no code implementations • 21 Mar 2021 • Vincent Thomas, Gérémy Hutin, Olivier Buffet
In this article, we discuss how to solve information-gathering problems expressed as rho-POMDPs, an extension of Partially Observable Markov Decision Processes (POMDPs) whose reward rho depends on the belief state.