no code implementations • 1 Apr 2024 • Dhananjay Saikumar, Blesson Varghese
Pruning methods, such as iterative magnitude-based pruning (IMP) achieve up to a 90% parameter reduction while retaining accuracy comparable to the original model.
no code implementations • 21 Feb 2024 • Dhananjay Saikumar, Blesson Varghese
NeuroFlux segments a CNN into blocks based on GPU memory usage and further attaches an auxiliary network to each layer in these blocks.