no code implementations • 21 Aug 2021 • Arwen V. Bradley, Carlos Alberto Gomez-Uribe, Manish Reddy Vuyyuru
Although the shift is unknown at training time, the shift-curvature can still be mitigated by minimizing overall curvature.