no code implementations • 14 Jun 2022 • Weishun Zhong, Ben Sorscher, Daniel D Lee, Haim Sompolinsky
Our theory predicts that the reduction in capacity due to the constrained weight-distribution is related to the Wasserstein distance between the imposed distribution and that of the standard normal distribution.