Search Results for author: Joongkyu Lee

Found 2 papers, 0 papers with code

Nearly Minimax Optimal Regret for Multinomial Logistic Bandit

no code implementations16 May 2024 Joongkyu Lee, Min-hwan Oh

To the best of our knowledge, this is the first work in the MNL contextual bandit literature to prove minimax optimality -- for either uniform or non-uniform reward setting -- and to propose a computationally efficient algorithm that achieves this optimality up to logarithmic factors.

Learning Uncertainty-Aware Temporally-Extended Actions

no code implementations8 Feb 2024 Joongkyu Lee, Seung Joon Park, Yunhao Tang, Min-hwan Oh

In reinforcement learning, temporal abstraction in the action space, exemplified by action repetition, is a technique to facilitate policy learning through extended actions.

Cannot find the paper you are looking for? You can Submit a new open access paper.