Search Results for author: Seungki Min

Found 4 papers, 1 papers with code

An Information-Theoretic Analysis of Nonstationary Bandit Learning

no code implementations9 Feb 2023 Seungki Min, Daniel Russo

In nonstationary bandit learning problems, the decision-maker must continually gather information and adapt their action selection as the latent state of the environment evolves.

Risk-Sensitive Optimal Execution via a Conditional Value-at-Risk Objective

no code implementations28 Jan 2022 Seungki Min, Ciamac C. Moallemi, Costis Maglaras

As our problem is a special case of a linear-quadratic-Gaussian control problem with a CVaR objective, these results may be interesting in broader settings.

Policy Gradient Optimization of Thompson Sampling Policies

no code implementations30 Jun 2020 Seungki Min, Ciamac C. Moallemi, Daniel J. Russo

We study the use of policy gradient algorithms to optimize over a class of generalized Thompson sampling policies.

Policy Gradient Methods Thompson Sampling

Thompson Sampling with Information Relaxation Penalties

1 code implementation NeurIPS 2019 Seungki Min, Costis Maglaras, Ciamac C. Moallemi

With this framework, we define an intuitive family of control policies that include Thompson sampling (TS) and the Bayesian optimal policy as endpoints.

Thompson Sampling

Cannot find the paper you are looking for? You can Submit a new open access paper.