Search Results for author: Wanxin Jin

Found 13 papers, 7 papers with code

How Can LLM Guide RL? A Value-Based Approach

1 code implementation • 25 Feb 2024 • Shenao Zhang, Sirui Zheng, Shuqi Ke, Zhihan Liu, Wanxin Jin, Jianbo Yuan, Yingxiang Yang, Hongxia Yang, Zhaoran Wang

Specifically, we develop an algorithm named LINVIT that incorporates LLM guidance as a regularization factor in value-based RL, leading to significant reductions in the amount of data needed for learning, particularly when the difference between the ideal policy and the LLM-informed policy is small, which suggests that the initial policy is close to optimal, reducing the need for further exploration.

Decision Making Reinforcement Learning (RL)

Paper
Code

Identifying Reaction-Aware Driving Styles of Stochastic Model Predictive Controlled Vehicles by Inverse Reinforcement Learning

no code implementations • 23 Aug 2023 • Ni Dang, Tao Shi, Zengjie Zhang, Wanxin Jin, Marion Leibold, Martin Buss

Nevertheless, an important indicator of the driving style, i. e., how an AV reacts to its nearby AVs, is not fully incorporated in the feature design of previous ME-IRL methods.

Autonomous Driving Model Predictive Control

Paper
Add Code

Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

no code implementations • 29 Sep 2022 • YiXuan Wang, Simon Sinong Zhan, Ruochen Jiao, Zhilu Wang, Wanxin Jin, Zhuoran Yang, Zhaoran Wang, Chao Huang, Qi Zhu

It is quite challenging to ensure the safety of reinforcement learning (RL) agents in an unknown and stochastic environment under hard constraints that require the system state not to reach certain specified unsafe regions.

Reinforcement Learning (RL) Safe Reinforcement Learning

Paper
Add Code

Cooperative Tuning of Multi-Agent Optimal Control Systems

1 code implementation • 24 Sep 2022 • Zehui Lu, Wanxin Jin, Shaoshuai Mou, Brian D. O. Anderson

Different from classical techniques for tuning parameters in a controller, we allow tunable parameters appearing in both the system dynamics and the objective functions of each agent.

Distributed Optimization

Paper
Code

Distributed Differentiable Dynamic Game for Multi-robot Coordination

no code implementations • 18 Jul 2022 • Xuan Wang, Yizhi Zhou, Wanxin Jin

In the inverse problem, where each robot aims to find (learn) its objective (and dynamics) parameters to mimic given coordination demonstrations, D3G proposes a differentiation solver based on Differential Pontryagin's Maximum Principle, which allows each robot to update its parameters in a distributed and coordinated manner.

Paper
Add Code

Learning Linear Complementarity Systems

no code implementations • 25 Dec 2021 • Wanxin Jin, Alp Aydinoglu, Mathew Halm, Michael Posa

This paper investigates the learning, or system identification, of a class of piecewise-affine dynamical systems known as linear complementarity systems (LCSs).

Paper
Add Code

Safe Pontryagin Differentiable Programming

1 code implementation • NeurIPS 2021 • Wanxin Jin, Shaoshuai Mou, George J. Pappas

We propose a Safe Pontryagin Differentiable Programming (Safe PDP) methodology, which establishes a theoretical and algorithmic framework to solve a broad class of safety-critical learning and control tasks -- problems that require the guarantee of safety constraint satisfaction at any stage of the learning and control progress.

Motion Planning

Paper
Code

Learning from Human Directional Corrections

1 code implementation • 30 Nov 2020 • Wanxin Jin, Todd D. Murphey, Zehui Lu, Shaoshuai Mou

This paper proposes a novel approach that enables a robot to learn an objective function incrementally from human directional corrections.

Motion Planning

Paper
Code

Learning Objective Functions Incrementally by Inverse Optimal Control

no code implementations • 28 Oct 2020 • Wanxin Jin, Zihao Liang, Shaoshuai Mou

This paper proposes an inverse optimal control method which enables a robot to incrementally learn a control objective function from a collection of trajectory segments.

Robotics

Paper
Add Code

Learning from Sparse Demonstrations

2 code implementations • 5 Aug 2020 • Wanxin Jin, Todd D. Murphey, Dana Kulić, Neta Ezer, Shaoshuai Mou

The time stamps of the keyframes can be different from the time of the robot's actual execution.

Motion Planning

150

Paper
Code

Neural Certificates for Safe Control Policies

no code implementations • 15 Jun 2020 • Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

This paper develops an approach to learn a policy of a dynamical system that is guaranteed to be both provably safe and goal-reaching.

Paper
Add Code

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

1 code implementation • NeurIPS 2020 • Wanxin Jin, Zhaoran Wang, Zhuoran Yang, Shaoshuai Mou

This paper develops a Pontryagin Differentiable Programming (PDP) methodology, which establishes a unified framework to solve a broad class of learning and control tasks.

150

Paper
Code

Inverse Optimal Control with Incomplete Observations

2 code implementations • 21 Mar 2018 • Wanxin Jin, Dana Kulić, Shaoshuai Mou, Sandra Hirche

We handle the problem by proposing the recovery matrix, which establishes a relationship between available observations of the trajectory and weights of given candidate features.

Robotics Systems and Control

150

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.