no code implementations • 19 Feb 2024 • Xiaowei Yuan, Zhao Yang, Yequan Wang, Shengping Liu, Jun Zhao, Kang Liu
Large language models internalize enormous parametric knowledge during pre-training.
no code implementations • 1 Feb 2024 • Yu Guo, Yangbo Wu, Zhao Yang, Lei Xue, Ning Liang, Yang Ren, Zhengrui Tu, Jia Feng, Qunbi Zhuge
Intensity-modulation and direct-detection (IM/DD) transmission is widely adopted for high-speed optical transmission scenarios due to its cost-effectiveness and simplicity.
no code implementations • 21 Dec 2023 • Chi Zhang, Zhao Yang, Jiaxuan Liu, Yucheng Han, Xin Chen, Zebiao Huang, Bin Fu, Gang Yu
Recent advancements in large language models (LLMs) have led to the creation of intelligent agents capable of performing complex tasks.
1 code implementation • 17 Nov 2023 • Thomas M. Moerland, Matthias Müller-Brockhausen, Zhao Yang, Andrius Bernatavicius, Koen Ponse, Tom Kouwenhoven, Andreas Sauter, Michiel van der Meer, Bram Renting, Aske Plaat
To solve this issue we introduce EduGym, a set of educational reinforcement learning environments and associated interactive notebooks tailored for education.
1 code implementation • 3 Aug 2023 • Zhao Yang, Bing Su, Ji-Rong Wen
Firstly, they cannot directly generate coherent motions and require additional operations such as interpolation to process the generated actions.
no code implementations • 20 Apr 2023 • Zhao Yang, Thomas. M. Moerland, Mike Preuss, Aske Plaat
While deep reinforcement learning has shown important empirical success, it tends to learn relatively slow due to slow propagation of rewards information and slow update of parametric neural networks.
no code implementations • 11 Mar 2023 • Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr
Referring image segmentation segments an image from a language expression.
no code implementations • 6 Dec 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat
In this paper, we present a clear ablation study of post-exploration in a general intrinsically motivated goal exploration process (IMGEP) framework, that the Go-Explore paper did not show.
no code implementations • 28 Nov 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat
Therefore, this paper introduces Continuous Episodic Control (CEC), a novel non-parametric episodic memory algorithm for sequential decision making in problems with a continuous action space.
no code implementations • 25 Oct 2022 • Zhao Yang, Fengyang Deng, Linxi Han
Finally, a flexible Android malware detection model based on GANs with code tensor (MTFD-GANs) is proposed.
4 code implementations • 12 Sep 2022 • Bing Su, Dazhao Du, Zhao Yang, Yujie Zhou, Jiangmeng Li, Anyi Rao, Hao Sun, Zhiwu Lu, Ji-Rong Wen
Although artificial intelligence (AI) has made significant progress in understanding molecules in a wide range of fields, existing models generally acquire the single cognitive ability from the single molecular modality.
Ranked #7 on Molecule Captioning on ChEBI-20
no code implementations • 29 Mar 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat
Go-Explore achieved breakthrough performance on challenging reinforcement learning (RL) tasks with sparse rewards.
no code implementations • 26 Jan 2022 • Zhao Yang, Dianwen Ng, Xiao Fu, Liping Han, Wei Xi, Rui Wang, Rui Jiang, Jizhong Zhao
Based on the above intuition, we first investigate types of end-to-end encoder-decoder based models in the single-input dual-output (SIDO) multi-task framework, after which a novel asynchronous decoding with fuzzy Pinyin sampling method is proposed according to the one-to-one correspondence characteristics between Pinyin and Character.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 10 Dec 2021 • Zhigang Chang, Zhao Yang, Yongbiao Chen, Qin Zhou, Shibao Zheng
Validations on the gait recognition metric CASIA-B dataset further demonstrated the capability of our hybrid model.
1 code implementation • CVPR 2022 • Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr
Referring image segmentation is a fundamental vision-language task that aims to segment out an object referred to by a natural language expression from an image.
no code implementations • British Machine Vision Conference 2021 • Zhao Yang, Yansong Tang, Luca Bertinetto, Hengshuang Zhao, Philip Torr
In this paper, we investigate the problem of video object segmentation from referring expressions (VOSRE).
Ranked #1 on Referring Expression Segmentation on J-HMDB (Precision@0.9 metric)
Optical Flow Estimation Referring Expression Segmentation +3
no code implementations • ACL 2022 • Yiming Ju, Yuanzhe Zhang, Zhao Yang, Zhongtao Jiang, Kang Liu, Jun Zhao
Meanwhile, since the reasoning process of deep models is inaccessible, researchers design various evaluation methods to demonstrate their arguments.
no code implementations • 10 Sep 2021 • Zhao Yang, Mike Preuss, Aske Plaat
While previous work has investigated the use of expert knowledge to generate potential functions, in this work, we study whether we can use a search algorithm(A*) to automatically generate a potential function for reward shaping in Sokoban, a well-known planning task.
1 code implementation • 13 Aug 2021 • Haoming Li, Feiyang Pan, Xiang Ao, Zhao Yang, Min Lu, Junwei Pan, Dapeng Liu, Lei Xiao, Qing He
The delayed feedback problem is one of the imperative challenges in online advertising, which is caused by the highly diversified feedback delay of a conversion varying from a few minutes to several days.
no code implementations • ACL 2021 • Zhongtao Jiang, Yuanzhe Zhang, Zhao Yang, Jun Zhao, Kang Liu
Deep learning models have achieved great success on the task of Natural Language Inference (NLI), though only a few attempts try to explain their behaviors.
no code implementations • 25 May 2021 • Zhao Yang, Mike Preuss, Aske Plaat
In reinforcement learning, learning actions for a behavior policy that can be applied to new environments is still a challenge, especially for tasks that involve much planning.
no code implementations • 28 Feb 2021 • Xu Xie, Fei Sun, Xiaoyong Yang, Zhao Yang, Jinyang Gao, Wenwu Ou, Bin Cui
On the one hand, it utilizes UI relations and user neighborhood to capture both global and local information.
no code implementations • 3 Dec 2019 • Zirui Xu, Zhao Yang, JinJun Xiong, Jianlei Yang, Xiang Chen
In this paper, we propose Helios, a heterogeneity-aware FL framework to tackle the straggler issue.
Distributed, Parallel, and Cluster Computing
1 code implementation • ICCV 2019 • Zhao Yang, Qiang Wang, Luca Bertinetto, Weiming Hu, Song Bai, Philip H. S. Torr
Unsupervised video object segmentation has often been tackled by methods based on recurrent neural networks and optical flow.
Ranked #19 on Unsupervised Video Object Segmentation on DAVIS 2016 val
no code implementations • 22 Oct 2019 • Yuanxin Zhu, Zhao Yang, Li Wang, Sai Zhao, Xiao Hu, Dapeng Tao
With the joint supervision of Cross-Entropy (CE) loss and HC loss, the network is trained to achieve two vital objectives, inter-class discrepancy and intra-class cross-modality similarity as much as possible.
Cross-Modality Person Re-identification Person Re-Identification
no code implementations • 25 Jun 2019 • Guorui Zhou, Kailun Wu, Weijie Bian, Zhao Yang, Xiaoqiang Zhu, Kun Gai
In this paper, we model user behavior using an interest delay model, study carefully the embedding mechanism, and obtain two important results: (i) We theoretically prove that small aggregation radius of embedding vectors of items which belongs to a same user interest domain will result in good generalization performance of deep CTR model.
1 code implementation • 29 Dec 2018 • Zhao Yang, Song Bai, Li Zhang, Philip H. S. Torr
Deep reinforcement learning (DeepRL) agents surpass human-level performance in many tasks.