no code implementations • NeurIPS 2021 • Remi Bardenet, Subhro Ghosh, Meixia Lin
In particular, we show how specific DPPs and a string of controlled approximations can lead to gradient estimators with a variance that decays faster with the batchsize than under uniform sampling.