1 code implementation • 3 Nov 2022 • Arpan Losalka, Jonathan Scarlett
We consider the problem of sequentially maximising an unknown function over a set of actions while ensuring that every sampled point has a function value below a given safety threshold.
no code implementations • 7 Jul 2020 • Ilija Bogunovic, Arpan Losalka, Andreas Krause, Jonathan Scarlett
We consider a stochastic linear bandit problem in which the rewards are not only subject to random noise, but also adversarial attacks subject to a suitable budget $C$ (i. e., an upper bound on the sum of corruption magnitudes across the time horizon).
no code implementations • 31 May 2018 • Partha Ghosh, Arpan Losalka, Michael J. Black
Our model has the form of a variational autoencoder, with a Gaussian mixture prior on the latent vector.