Search Results for author: Seungki Min

Found 4 papers, 1 papers with code

An Information-Theoretic Analysis of Nonstationary Bandit Learning

no code implementations • 9 Feb 2023 • Seungki Min, Daniel Russo

In nonstationary bandit learning problems, the decision-maker must continually gather information and adapt their action selection as the latent state of the environment evolves.

Paper
Add Code

Risk-Sensitive Optimal Execution via a Conditional Value-at-Risk Objective

no code implementations • 28 Jan 2022 • Seungki Min, Ciamac C. Moallemi, Costis Maglaras

As our problem is a special case of a linear-quadratic-Gaussian control problem with a CVaR objective, these results may be interesting in broader settings.

Paper
Add Code

Policy Gradient Optimization of Thompson Sampling Policies

no code implementations • 30 Jun 2020 • Seungki Min, Ciamac C. Moallemi, Daniel J. Russo

We study the use of policy gradient algorithms to optimize over a class of generalized Thompson sampling policies.

Policy Gradient Methods Thompson Sampling

Paper
Add Code

Thompson Sampling with Information Relaxation Penalties

1 code implementation • NeurIPS 2019 • Seungki Min, Costis Maglaras, Ciamac C. Moallemi

With this framework, we define an intuitive family of control policies that include Thompson sampling (TS) and the Bayesian optimal policy as endpoints.

Thompson Sampling

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.