Search Results for author: Yijie Peng

Found 18 papers, 4 papers with code

Forward Learning for Gradient-based Black-box Saliency Map Generation

no code implementations • 22 Mar 2024 • Zeliang Zhang, Mingqian Feng, Jinyang Jiang, Rongyi Zhu, Yijie Peng, Chenliang Xu

Gradient-based saliency maps are widely used to explain deep neural network decisions.

Paper
Add Code

Approximated Likelihood Ratio: A Forward-Only and Parallel Framework for Boosting Neural Network Training

no code implementations • 18 Mar 2024 • Zeliang Zhang, Jinyang Jiang, Zhuo Liu, Susan Liang, Yijie Peng, Chenliang Xu

In this paper, we introduce an approximation technique for the likelihood ratio (LR) method to alleviate computational and memory demands in gradient estimation.

Paper
Add Code

Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode

no code implementations • 1 Mar 2024 • Jinyang Jiang, Xiaotian Liu, Tao Ren, Qinghao Wang, Yi Zheng, Yufu Du, Yijie Peng, Cheng Zhang

We introduce a deep reinforcement learning (DRL) approach for solving management problems including inventory management, dynamic pricing, and recommendation.

Decision Making Management

Paper
Add Code

RiskMiner: Discovering Formulaic Alphas via Risk Seeking Monte Carlo Tree Search

no code implementations • 11 Feb 2024 • Tao Ren, Ruihan Zhou, Jinyang Jiang, Jiafeng Liang, Qinghao Wang, Yijie Peng

Existing alpha mining only employs the neural network agent, unable to utilize the structural information of the solution space.

Paper
Add Code

Sample-Efficient Clustering and Conquer Procedures for Parallel Large-Scale Ranking and Selection

no code implementations • 3 Feb 2024 • Zishi Zhang, Yijie Peng

We propose novel "clustering and conquer" procedures for the parallel large-scale ranking and selection (R&S) problem, which leverage correlation information for clustering to break the bottleneck of sample efficiency.

Clustering Neural Architecture Search

Paper
Add Code

AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems

no code implementations • 1 Feb 2024 • Ruihan Zhou, L. Jeff Hong, Yijie Peng

We introduce AlphaRank, an artificial intelligence approach to address the fixed-budget ranking and selection (R&S) problems.

Paper
Add Code

One Forward is Enough for Neural Network Training via Likelihood Ratio Method

no code implementations • 15 May 2023 • Jinyang Jiang, Zeliang Zhang, Chenliang Xu, Zhaofei Yu, Yijie Peng

While backpropagation (BP) is the mainstream approach for gradient computation in neural network training, its heavy reliance on the chain rule of differentiation constrains the designing flexibility of network architecture and training pipelines.

Paper
Add Code

Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient Algorithms

1 code implementation • 12 May 2023 • Jinyang Jiang, Jiaqiao Hu, Yijie Peng

Classical reinforcement learning (RL) aims to optimize the expected cumulative reward.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Efficient Learning for Selecting Top-m Context-Dependent Designs

no code implementations • 6 May 2023 • Gongbo Zhang, Sihua Chen, Kuihua Huang, Yijie Peng

We consider a simulation optimization problem for a context-dependent decision-making, which aims to determine the top-m designs for all contexts.

Decision Making

Paper
Add Code

A Novel Noise Injection-based Training Scheme for Better Model Robustness

no code implementations • 17 Feb 2023 • Zeliang Zhang, Jinyang Jiang, Minjie Chen, Zhiyuan Wang, Yijie Peng, Zhaofei Yu

Noise injection-based method has been shown to be able to improve the robustness of artificial neural networks in previous work.

Adversarial Robustness Computational Efficiency

Paper
Add Code

An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search

no code implementations • 26 Apr 2022 • Gongbo Zhang, Yijie Peng, Yilong Xu

We consider the popular tree-based search strategy within the framework of reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of finite-horizon Markov decision process.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Quantile-Based Policy Optimization for Reinforcement Learning

no code implementations • 27 Jan 2022 • Jinyang Jiang, Jiaqiao Hu, Yijie Peng

Classical reinforcement learning (RL) aims to optimize the expected cumulative rewards.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Pathological Image Segmentation with Noisy Labels

no code implementations • 20 Mar 2021 • Li Xiao, Yinhao Li, Luxi Qv, Xinxia Tian, Yijie Peng, S. Kevin Zhou

Segmentation of pathological images is essential for accurate disease diagnosis.

Image Segmentation Segmentation +1

Paper
Add Code

Noise Optimization for Artificial Neural Networks

1 code implementation • 6 Feb 2021 • Li Xiao, Zeliang Zhang, Yijie Peng

Adding noises to artificial neural network(ANN) has been shown to be able to improve robustness in previous work.

Paper
Code

Context-dependent Ranking and Selection under a Bayesian Framework

no code implementations • 10 Dec 2020 • Haidong Li, Henry Lam, Zhe Liang, Yijie Peng

We consider a context-dependent ranking and selection problem.

Methodology

Paper
Add Code

Efficient Learning for Clustering and Optimizing Context-Dependent Designs

1 code implementation • 10 Dec 2020 • Haidong Li, Henry Lam, Yijie Peng

We consider a simulation optimization problem for a context-dependent decision-making.

Decision Making Methodology

Paper
Code

Training Artificial Neural Networks by Generalized Likelihood Ratio Method: Exploring Brain-like Learning to Improve Robustness

1 code implementation • 31 Jan 2019 • Li Xiao, Yijie Peng, Jeff Hong, Zewu Ke, Shuhuai Yang

In this work, we propose a generalized likelihood ratio method capable of training the artificial neural networks with some biological brain-like mechanisms,. e. g., (a) learning by the loss value, (b) learning via neurons with discontinuous activation and loss functions.

General Classification

Paper
Code

Ranking and Selection as Stochastic Control

no code implementations • 7 Oct 2017 • Yijie Peng, Edwin K. P. Chong, Chun-Hung Chen, Michael C. Fu

Under a Bayesian framework, we formulate the fully sequential sampling and selection decision in statistical ranking and selection as a stochastic control problem, and derive the associated Bellman equation.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.