Search Results for author: Simon Shaolei Du

Found 8 papers, 2 papers with code

Offline Multi-task Transfer RL with Representational Penalization

no code implementations • 19 Feb 2024 • Avinandan Bose, Simon Shaolei Du, Maryam Fazel

We study the problem of representation transfer in offline Reinforcement Learning (RL), where a learner has access to episodic data from a number of source tasks collected a priori, and aims to learn a shared representation to be used in finding a good policy for a target task.

Offline RL Reinforcement Learning (RL)

Paper
Add Code

Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning

no code implementations • 3 Feb 2024 • Yiping Wang, Yifang Chen, Wendan Yan, Kevin Jamieson, Simon Shaolei Du

In recent years, data selection has emerged as a core issue for large-scale visual-language model pretraining, especially on noisy web-curated datasets.

Contrastive Learning Experimental Design +1

Paper
Add Code

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

1 code implementation • 30 Oct 2023 • Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du

Off-policy dynamic programming (DP) techniques such as $Q$-learning have proven to be important in sequential decision-making problems.

Decision Making Offline RL +1

Paper
Code

Robust Offline Reinforcement Learning -- Certify the Confidence Interval

no code implementations • 28 Sep 2023 • Jiarui Yao, Simon Shaolei Du

Currently, reinforcement learning (RL), especially deep RL, has received more and more attention in the research area.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning

1 code implementation • 16 Jun 2023 • Jifan Zhang, Yifang Chen, Gregory Canal, Stephen Mussmann, Arnav M. Das, Gantavya Bhatt, Yinglun Zhu, Jeffrey Bilmes, Simon Shaolei Du, Kevin Jamieson, Robert D Nowak

Labeled data are critical to modern machine learning applications, but obtaining labels can be expensive.

Active Learning Benchmarking +1

Paper
Code

A Benchmark for Low-Switching-Cost Reinforcement Learning

no code implementations • 13 Dec 2021 • Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu

A ubiquitous requirement in many practical reinforcement learning (RL) applications, including medical treatment, recommendation system, education and robotics, is that the deployed policy that actually interacts with the environment cannot change frequently.

Atari Games reinforcement-learning +1

Paper
Add Code

Deep Q-Learning with Low Switching Cost

no code implementations • 1 Jan 2021 • Shusheng Xu, Simon Shaolei Du, Yi Wu

We initiate the study on deep reinforcement learning problems that require low switching cost, i. e., small number of policy switches during training.

Atari Games Q-Learning +2

Paper
Add Code

Hypothesis Transfer Learning via Transformation Functions

no code implementations • NeurIPS 2017 • Simon Shaolei Du, Jayanth Koushik, Aarti Singh, Barnabas Poczos

We consider the Hypothesis Transfer Learning (HTL) problem where one incorporates a hypothesis trained on the source domain into the learning procedure of the target domain.

Transfer Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.