Search Results for author: Zhirong Wang

Found 4 papers, 0 papers with code

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang

First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set.

Paper
Add Code

Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without Communication

no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Rundong Wang, Xinrun Wang, Runsheng Yu, Xin Li, Zhirong Wang

Thus, the global policy of the whole page could be sub-optimal.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Paper
Add Code

Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning

no code implementations • 17 Sep 2018 • Jun Feng, Heng Li, Minlie Huang, Shichen Liu, Wenwu Ou, Zhirong Wang, Xiaoyan Zhu

The first one is lack of collaboration between scenarios meaning that each strategy maximizes its own objective but ignores the goals of other strategies, leading to a sub-optimal overall performance.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Alternating Multi-bit Quantization for Recurrent Neural Networks

no code implementations • ICLR 2018 • Chen Xu, Jianqiang Yao, Zhouchen Lin, Wenwu Ou, Yuanbin Cao, Zhirong Wang, Hongbin Zha

Recurrent neural networks have achieved excellent performance in many applications.

Image Classification Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.