no code implementations • 22 Mar 2024 • Sisi Dai, Wenhao Li, Haowen Sun, Haibin Huang, Chongyang Ma, Hui Huang, Kai Xu, Ruizhen Hu
In this study, we tackle the complex task of generating 3D human-object interactions (HOI) from textual descriptions in a zero-shot text-to-3D manner.
no code implementations • 21 Feb 2024 • Shishun Zhang, Qijin She, Wenhao Li, Chenyang Zhu, Yongjun Wang, Ruizhen Hu, Kai Xu
To achieve the goal, the core idea is to develop an effective object-to-arm task assignment strategy for minimizing the cumulative task execution time and maximizing the dual-arm cooperation efficiency.
no code implementations • 4 Feb 2024 • Ti Wang, Mengyuan Liu, Hong Liu, Bin Ren, Yingxuan You, Wenhao Li, Nicu Sebe, Xia Li
We observe that previous optimization-based methods commonly rely on projection constraint, which only ensures alignment in 2D space, potentially leading to the overfitting problem.
1 code implementation • 7 Jan 2024 • Yuheng Cheng, Ceyao Zhang, Zhengwen Zhang, Xiangrui Meng, Sirui Hong, Wenhao Li, ZiHao Wang, Zekai Wang, Feng Yin, Junhua Zhao, Xiuqiang He
Intelligent agents stand out as a potential path toward artificial general intelligence (AGI).
no code implementations • 5 Jan 2024 • Chuyun Shen, Wenhao Li, Haoqing Chen, Xiaoling Wang, Fengping Zhu, Yuxin Li, Xiangfeng Wang, Bo Jin
CIML adopts the idea of addition and removes inter-modal redundant information through inductive bias-driven task decomposition and message passing-based redundancy filtering.
1 code implementation • 6 Dec 2023 • Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang
The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can language agents be alternatives to PPO agents in traditional sequential decision-making tasks?
1 code implementation • 20 Nov 2023 • Wenhao Li, Mengyuan Liu, Hong Liu, Pichao Wang, Jialun Cai, Nicu Sebe
Transformers have been successfully applied in the field of video-based 3D human pose estimation.
no code implementations • 23 Oct 2023 • Zhecan Wang, Long Chen, Haoxuan You, Keyang Xu, Yicheng He, Wenhao Li, Noel Codella, Kai-Wei Chang, Shih-Fu Chang
Vision-language (VL) understanding tasks evaluate models' comprehension of complex visual scenes through multiple-choice questions.
no code implementations • 30 Sep 2023 • Wenhao Li
We establish the theoretical equivalence between planning in the latent action space and energy-guided sampling with a pretrained diffusion model and incorporate a novel sequence-level exact sampling method.
1 code implementation • ICCV 2023 • Yingxuan You, Hong Liu, Ti Wang, Wenhao Li, Runwei Ding, Xia Li
Despite significant progress in single image-based 3D human mesh recovery, accurately and smoothly recovering 3D human motion from a video remains challenging.
1 code implementation • 13 Aug 2023 • Wenhao Li, Guangyang Wu, Wenyi Wang, Peiran Ren, Xiaohong Liu
Experimental results on benchmark datasets demonstrate that our method achieves the State-Of-The-Art (SOTA) performance in terms of both image quality and inter-frame brightness consistency.
1 code implementation • 15 Jul 2023 • Tianyu Guo, Mengyuan Liu, Hong Liu, Wenhao Li, Jingwen Guo, Tao Wang, Yidi Li
Considering the instance-level discriminative ability, contrastive learning methods, including MoCo and SimCLR, have been adapted from the original image representation learning task to solve the self-supervised skeleton-based action recognition task.
no code implementations • 28 Jun 2023 • Ningyuan Chen, Wenhao Li
We consider a decision maker allocating one unit of renewable and divisible resource in each period on a number of arms.
no code implementations • 15 Jun 2023 • Chuyun Shen, Wenhao Li, Ya zhang, Xiangfeng Wang
The Segmentation Anything Model (SAM) has recently emerged as a foundation model for addressing image segmentation.
no code implementations • 15 Jun 2023 • Li Zhang, Wenhao Li, Haotian Guan, Zhiquan He, Mingjun Cheng, Han Wang
The identification of compound-protein interactions (CPI) plays a critical role in drug screening, drug repurposing, and combination therapy studies.
no code implementations • 8 Jun 2023 • Junjie Sheng, Wenhao Li, Bo Jin, Hongyuan Zha, Jun Wang, Xiangfeng Wang
Recent methods have shown that assigning reasoning ability to agents can mitigate RO algorithmically and empirically, but there has been a lack of theoretical understanding of RO, let alone designing provably RO-free methods.
1 code implementation • 26 May 2023 • Yudong Xu, Wenhao Li, Pashootan Vaezipoor, Scott Sanner, Elias B. Khalil
Although the state-of-the-art GPT-4 is unable to "reason" perfectly within non-language domains such as the 1D-ARC or a simple ARC subset, our study reveals that the use of object-based representations can significantly improve its reasoning ability.
no code implementations • 19 May 2023 • Jinyi Hu, Xu Han, Xiaoyuan Yi, Yutong Chen, Wenhao Li, Zhiyuan Liu, Maosong Sun
IAP optimizes only a separate Chinese text encoder with all other parameters fixed to align Chinese semantics space to the English one in CLIP.
no code implementations • 18 May 2023 • Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin, Hongyuan Zha
The difficulty of appropriately assigning credit is particularly heightened in cooperative MARL with sparse reward, due to the concurrent time and structural scales involved.
1 code implementation • NeurIPS 2023 • Yue Lin, Wenhao Li, Hongyuan Zha, Baoxiang Wang
To thrive in those environments, the agent needs to influence other agents so their actions become more helpful and less harmful.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 5 May 2023 • Benjamin Fan, Edward Qiao, Anran Jiao, Zhouzhou Gu, Wenhao Li, Lu Lu
We develop a methodology that utilizes deep learning to simultaneously solve and estimate canonical continuous-time general equilibrium models in financial economics.
1 code implementation • 27 Apr 2023 • Ti Wang, Hong Liu, Runwei Ding, Wenhao Li, Yingxuan You, Xia Li
Despite substantial progress in 3D human pose estimation from a single-view image, prior works rarely explore global and local correlations, leading to insufficient learning of human skeleton representations.
1 code implementation • 10 Mar 2023 • Yingxuan You, Hong Liu, Xia Li, Wenhao Li, Ti Wang, Runwei Ding
3D human mesh recovery from a 2D pose plays an important role in various applications.
Ranked #146 on 3D Human Pose Estimation on Human3.6M
1 code implementation • 10 Mar 2023 • Jie zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, Qian Yu
Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is to carry out multi-scenario transfer learning on the basis of the Mixture-of-Expert (MoE) architecture.
no code implementations • 3 Mar 2023 • Tao Wang, Mengyuan Liu, Hong Liu, Wenhao Li, Miaoju Ban, Tuanyu Guo, Yidi Li
In this paper, different from most previous works that discard the occluded region, we propose a Feature Completion Transformer (FCFormer) to implicitly complement the semantic information of occluded parts in the feature space.
1 code implementation • 23 Feb 2023 • Wenhao Li, Baoxiang Wang, Shanchao Yang, Hongyuan Zha
We propose a simple and effective RL method, Diverse Policy Optimization (DPO), to model the policies in structured action space as the energy-based models (EBM) by following the probabilistic RL framework.
1 code implementation • 20 Feb 2023 • Jialun Cai, Hong Liu, Runwei Ding, Wenhao Li, Jianbing Wu, Miaoju Ban
3D human pose estimation errors would propagate along the human body topology and accumulate at the end joints of limbs.
Ranked #34 on 3D Human Pose Estimation on MPI-INF-3DHP (AUC metric)
no code implementations • 31 Jan 2023 • Wenhao Li, Xiangfeng Wang, Bo Jin, Jingyi Lu, Hongyuan Zha
Social dilemmas can be considered situations where individual rationality leads to collective irrationality.
no code implementations • 20 Nov 2022 • Ningyuan Chen, Ming Hu, Wenhao Li
In view of such a conflict, we provide a general analytical framework to study the augmentation of algorithmic decisions with human knowledge: the analyst uses the knowledge to set a guardrail by which the algorithmic decision is clipped if the algorithmic output is out of bound, and seems unreasonable.
1 code implementation • 14 Nov 2022 • Wenhao Li, Xiaoyuan Yi, Jinyi Hu, Maosong Sun, Xing Xie
In this work, we dig into the intrinsic mechanism of this problem and found that sparser attention values in Transformer could improve diversity.
no code implementations • 10 Nov 2022 • Zhecan Wang, Haoxuan You, Yicheng He, Wenhao Li, Kai-Wei Chang, Shih-Fu Chang
Visual commonsense understanding requires Vision Language (VL) models to not only understand image and text but also cross-reference in-between to fully integrate and achieve comprehension of the visual scene described.
no code implementations • 22 Oct 2022 • Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie
We demonstrate that TRACE could enhance the entanglement of each segment and preceding latent variables and deduce a non-zero lower bound of the KL term, providing a theoretical guarantee of generation diversity.
1 code implementation • NAACL 2022 • Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie
The past several years have witnessed Variational Auto-Encoder's superiority in various text generation tasks.
1 code implementation • 13 Jun 2022 • Wenhao Li, Hong Liu, Tianyu Guo, Runwei Ding, Hao Tang
To the best of our knowledge, this is the first MLP-Like architecture for 3D human pose estimation in a single frame and a video sequence.
Ranked #53 on 3D Human Pose Estimation on Human3.6M
no code implementations • 21 Feb 2022 • Chao Lv, Han Zhang, Xinkai Du, Yunhao Zhang, Ying Huang, Wenhao Li, Jia Han, Shanshan Gu
With the success of down streaming task using English pre-trained language model, the pre-trained Chinese language model is also necessary to get a better performance of Chinese NLP task.
1 code implementation • 8 Feb 2022 • Wenhao Li, Hongjun Chen, Bo Jin, Wenzhe Tan, Hongyuan Zha, Xiangfeng Wang
The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and simultaneously pursue optimal planning policy.
Multi-Agent Path Finding Multi-agent Reinforcement Learning +1
no code implementations • 27 Dec 2021 • Yuan YAO, Qingxiu Dong, Jian Guan, Boxi Cao, Zhengyan Zhang, Chaojun Xiao, Xiaozhi Wang, Fanchao Qi, Junwei Bao, Jinran Nie, Zheni Zeng, Yuxian Gu, Kun Zhou, Xuancheng Huang, Wenhao Li, Shuhuai Ren, Jinliang Lu, Chengqiang Xu, Huadong Wang, Guoyang Zeng, Zile Zhou, Jiajun Zhang, Juanzi Li, Minlie Huang, Rui Yan, Xiaodong He, Xiaojun Wan, Xin Zhao, Xu sun, Yang Liu, Zhiyuan Liu, Xianpei Han, Erhong Yang, Zhifang Sui, Maosong Sun
We argue that for general-purpose language intelligence evaluation, the benchmark itself needs to be comprehensive and systematic.
2 code implementations • 9 Dec 2021 • Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang
A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling.
1 code implementation • CVPR 2022 • Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc van Gool
Estimating 3D human poses from monocular videos is a challenging task due to depth ambiguity and self-occlusion.
Ranked #22 on 3D Human Pose Estimation on MPI-INF-3DHP
no code implementations • 15 Nov 2021 • Wenhao Li, Qisen Xu, Chuyun Shen, Bin Hu, Fengping Zhu, Yuxin Li, Bo Jin, Xiangfeng Wang
Based on the confidential information, a self-adaptive reward function is designed to provide more detailed feedback, and a simulated label generation mechanism is proposed on unsupervised data to reduce over-reliance on labeled data.
1 code implementation • 3 Jun 2021 • Wenhao Li, Fanchao Qi, Maosong Sun, Xiaoyuan Yi, Jiarui Zhang
We hope this dataset can further enhance the study on incorporating deep semantics into the understanding and generation system of Chinese classical poetry.
no code implementations • 23 May 2021 • Guoliang Hua, Hong Liu, Wenhao Li, Qian Zhang, Runwei Ding, Xin Xu
Instead, exploiting multi-view information is a practical way to achieve absolute 3D human pose estimation.
Monocular 3D Human Pose Estimation Weakly-supervised 3D Human Pose Estimation +1
1 code implementation • 26 Mar 2021 • Wenhao Li, Hong Liu, Runwei Ding, Mengyuan Liu, Pichao Wang, Wenming Yang
The modified VTE is termed as Strided Transformer Encoder (STE), which is built upon the outputs of VTE.
Ranked #2 on 3D Human Pose Estimation on HumanEva-I
no code implementations • ICLR 2022 • Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Hongyuan Zha
In this paper, we introduce a novel notion, the $\delta$-measurement, to explicitly measure the non-stationarity of a policy sequence, which can be further proved to be bounded by the KL-divergence of consecutive joint policies.
no code implementations • 9 Feb 2021 • Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan Zha
In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named {\sc{Rochico}} based on reinforced organization control and hierarchical consensus learning.
no code implementations • 26 Oct 2020 • Wenhao Li, Bincheng Zhang, Jiajie Zhang
Detecting and intercepting malicious requests are one of the most widely used ways against attacks in the network security.
no code implementations • 11 Oct 2020 • Wenhao Li
I characterize the consumer-optimal market segmentation in competitive markets where multiple firms selling differentiated products to consumers with unit demand.
no code implementations • 17 Sep 2020 • Wenhao Li, Ningyuan Chen, L. Jeff Hong
Our algorithm achieves the regret $\tilde{O}(T^{(d_x^*+d_y+1)/(d_x^*+d_y+2)})$, where $d_x^*$ is the effective covariate dimension.
no code implementations • 17 Apr 2020 • Wenhao Li, Bo Jin, Xiangfeng Wang, Junchi Yan, Hongyuan Zha
Traditional centralized multi-agent reinforcement learning (MARL) algorithms are sometimes unpractical in complicated applications, due to non-interactivity between agents, curse of dimensionality and computation complexity.
Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2
no code implementations • 13 Mar 2020 • Xiaoyuan Yi, Ruoyu Li, Cheng Yang, Wenhao Li, Maosong Sun
Though recent neural models make prominent progress in some criteria of poetry quality, generated poems still suffer from the problem of poor diversity.
no code implementations • 11 Feb 2020 • Junjie Sheng, Xiangfeng Wang, Bo Jin, Junchi Yan, Wenhao Li, Tsung-Hui Chang, Jun Wang, Hongyuan Zha
This work explores the large-scale multi-agent communication mechanism under a multi-agent reinforcement learning (MARL) setting.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 11 Feb 2020 • Yun Hua, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Xiaofeng He, Hongyuan Zha
In spite of the success of existing meta reinforcement learning methods, they still have difficulty in learning a meta policy effectively for RL problems with sparse reward.
no code implementations • CVPR 2020 • Xuan Liao, Wenhao Li, Qisen Xu, Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Ya zhang, Yan-Feng Wang
We here propose to model the dynamic process of iterative interactive image segmentation as a Markov decision process (MDP) and solve it with reinforcement learning (RL).
no code implementations • 15 Jul 2019 • Wenhao Li, Ningyuan Chen, L. Jeff Hong
The literature has shown that for Lipschitz-continuous functions, the optimal regret is $\tilde{O}(T^{\frac{d_x+d_y+1}{d_x+d_y+2}})$, where $d_x$ and $d_y$ are the dimensions of contexts and arms, and thus suffers from the curse of dimensionality.
no code implementations • ACL 2019 • Guo Zhipeng, Xiaoyuan Yi, Maosong Sun, Wenhao Li, Cheng Yang, Jiannan Liang, Huimin Chen, Yuhui Zhang, Ruoyu Li
By exposing the options of poetry genres, styles and revision modes, Jiuge, acting as a professional assistant, allows constant and active participation of users in poetic creation.
no code implementations • EMNLP 2018 • Xiaoyuan Yi, Maosong Sun, Ruoyu Li, Wenhao Li
Human experts evaluate poetry in terms of some specific criteria, instead of word-level likelihood.
no code implementations • EMNLP 2018 • Cheng Yang, Maosong Sun, Xiaoyuan Yi, Wenhao Li
The ability to write diverse poems in different styles under the same poetic imagery is an important characteristic of human poetry writing.