Search Results for author: Xiaocheng Tang

Found 14 papers, 3 papers with code

Golfer: Trajectory Prediction with Masked Goal Conditioning MnM Network

no code implementations • 2 Jul 2022 • Xiaocheng Tang, Soheil Sadeghi Eshkevari, Haoyu Chen, Weidan Wu, Wei Qian, Xiaoming Wang

Transformers have enabled breakthroughs in NLP and computer vision, and have recently began to show promising performance in trajectory prediction for Autonomous Vehicle (AV).

motion prediction Trajectory Prediction

Paper
Add Code

Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace

no code implementations • 10 Feb 2022 • Soheil Sadeghi Eshkevari, Xiaocheng Tang, Zhiwei Qin, Jinhan Mei, Cheng Zhang, Qianying Meng, Jia Xu

In this study, a real-time dispatching algorithm based on reinforcement learning is proposed and for the first time, is deployed in large scale.

Causal Inference reinforcement-learning +1

Paper
Add Code

RANK-NOSH: Efficient Predictor-Based Architecture Search via Non-Uniform Successive Halving

no code implementations • ICCV 2021 • Ruochen Wang, Xiangning Chen, Minhao Cheng, Xiaocheng Tang, Cho-Jui Hsieh

Predictor-based algorithms have achieved remarkable performance in the Neural Architecture Search (NAS) tasks.

Learning-To-Rank Neural Architecture Search +1

Paper
Add Code

Rethinking Architecture Selection in Differentiable NAS

1 code implementation • ICLR 2021 • Ruochen Wang, Minhao Cheng, Xiangning Chen, Xiaocheng Tang, Cho-Jui Hsieh

Differentiable Neural Architecture Search is one of the most popular Neural Architecture Search (NAS) methods for its search efficiency and simplicity, accomplished by jointly optimizing the model weight and architecture parameters in a weight-sharing supernet via gradient-based algorithms.

Neural Architecture Search

Paper
Code

A Deep Value-network Based Approach for Multi-Driver Order Dispatching

no code implementations • 8 Jun 2021 • Xiaocheng Tang, Zhiwei Qin, Fan Zhang, Zhaodong Wang, Zhe Xu, Yintai Ma, Hongtu Zhu, Jieping Ye

In this work, we propose a deep reinforcement learning based solution for order dispatching and we conduct large scale online A/B tests on DiDi's ride-dispatching platform to show that the proposed method achieves significant improvement on both total driver income and user experience related metrics.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Value Function is All You Need: A Unified Learning Framework for Ride Hailing Platforms

no code implementations • 18 May 2021 • Xiaocheng Tang, Fan Zhang, Zhiwei Qin, Yansheng Wang, Dingyuan Shi, Bingchen Song, Yongxin Tong, Hongtu Zhu, Jieping Ye

In this paper we propose a unified value-based dynamic learning framework (V1D3) for tackling both tasks.

Paper
Add Code

Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

no code implementations • 29 Mar 2021 • Sharada Mohanty, Jyotish Poonganam, Adrien Gaidon, Andrey Kolobov, Blake Wulfe, Dipam Chakraborty, Gražvydas Šemetulskis, João Schapke, Jonas Kubilius, Jurgis Pašukonis, Linas Klimas, Matthew Hausknecht, Patrick MacAlpine, Quang Nhat Tran, Thomas Tumiel, Xiaocheng Tang, Xinwei Chen, Christopher Hesse, Jacob Hilton, William Hebgen Guss, Sahika Genc, John Schulman, Karl Cobbe

We present the design of a centralized benchmark for Reinforcement Learning which can help measure Sample Efficiency and Generalization in Reinforcement Learning by doing end to end evaluation of the training and rollout phases of thousands of user submitted code bases in a scalable way.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Real-world Ride-hailing Vehicle Repositioning using Deep Reinforcement Learning

no code implementations • 8 Mar 2021 • Yan Jiao, Xiaocheng Tang, Zhiwei Qin, Shuaiji Li, Fan Zhang, Hongtu Zhu, Jieping Ye

We present a new practical framework based on deep reinforcement learning and decision-time planning for real-world vehicle repositioning on ride-hailing (a type of mobility-on-demand, MoD) platforms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

DrNAS: Dirichlet Neural Architecture Search

1 code implementation • ICLR 2021 • Xiangning Chen, Ruochen Wang, Minhao Cheng, Xiaocheng Tang, Cho-Jui Hsieh

This paper proposes a novel differentiable architecture search method by formulating it into a distribution learning problem.

Ranked #1 on Neural Architecture Search on NAS-Bench-201, CIFAR-100

Neural Architecture Search

Paper
Code

Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem

no code implementations • 25 Nov 2019 • John Holler, Risto Vuorio, Zhiwei Qin, Xiaocheng Tang, Yan Jiao, Tiancheng Jin, Satinder Singh, Chenxi Wang, Jieping Ye

Order dispatching and driver repositioning (also known as fleet management) in the face of spatially and temporally varying supply and demand are central to a ride-sharing platform marketplace.

BIG-bench Machine Learning Decision Making +3

Paper
Add Code

CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms

no code implementations • 27 May 2019 • Jiarui Jin, Ming Zhou, Wei-Nan Zhang, Minne Li, Zilong Guo, Zhiwei Qin, Yan Jiao, Xiaocheng Tang, Chenxi Wang, Jun Wang, Guobin Wu, Jieping Ye

How to optimally dispatch orders to vehicles and how to trade off between immediate and future returns are fundamental questions for a typical ride-hailing platform.

Multiagent Systems

Paper
Add Code

HIPAD - A Hybrid Interior-Point Alternating Direction algorithm for knowledge-based SVM and feature selection

no code implementations • 16 Nov 2014 • Zhiwei Qin, Xiaocheng Tang, Ioannis Akrotirianakis, Amit Chakraborty

We consider classification tasks in the regime of scarce labeled training data in high dimensional feature space, where specific expert knowledge is also available.

feature selection General Classification

Paper
Add Code

Practical Inexact Proximal Quasi-Newton Method with Global Complexity Analysis

1 code implementation • 26 Nov 2013 • Katya Scheinberg, Xiaocheng Tang

Recently several methods were proposed for sparse optimization which make careful use of second-order information [10, 28, 16, 3] to improve local convergence rates.

Paper
Code

Efficiently Using Second Order Information in Large l1 Regularization Problems

no code implementations • 27 Mar 2013 • Xiaocheng Tang, Katya Scheinberg

We propose a novel general algorithm LHAC that efficiently uses second-order information to train a class of large-scale l1-regularized problems.

regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.