no code implementations • 3 Jan 2023 • Haolin Ruan, Zhi Chen, Chin Pang Ho
We propose a distributionally robust return-risk model for Markov decision processes (MDPs) under risk and reward ambiguity.