2 code implementations • 11 Apr 2024 • Muxin Zhou, Zeyuan Yin, Shitong Shao, Zhiqiang Shen
In this work, we consider addressing this task through the new lens of model informativeness in the compression stage on the original dataset pretraining.
1 code implementation • 29 Nov 2023 • Shitong Shao, Zeyuan Yin, Muxin Zhou, Xindong Zhang, Zhiqiang Shen
We call this perspective "generalized matching" and propose Generalized Various Backbone and Statistical Matching (G-VBSM) in this work, which aims to create a synthetic dataset with densities, ensuring consistency with the complete dataset across various backbones, layers, and statistics.