no code implementations • 9 May 2024 • Xue Geng, Zhe Wang, Chunyun Chen, Qing Xu, Kaixin Xu, Chao Jin, Manas Gupta, Xulei Yang, Zhenghua Chen, Mohamed M. Sabry Aly, Jie Lin, Min Wu, XiaoLi Li
To address these challenges, researchers have developed various model compression techniques such as model quantization and model pruning.