Search Results for author: Zhao Yang

Found 27 papers, 7 papers with code

Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint

no code implementations • 19 Feb 2024 • Xiaowei Yuan, Zhao Yang, Yequan Wang, Shengping Liu, Jun Zhao, Kang Liu

Large language models internalize enormous parametric knowledge during pre-training.

Paper
Add Code

Dual-Tap Optical-Digital Feedforward Equalization Enabling High-Speed Optical Transmission in IM/DD Systems

no code implementations • 1 Feb 2024 • Yu Guo, Yangbo Wu, Zhao Yang, Lei Xue, Ning Liang, Yang Ren, Zhengrui Tu, Jia Feng, Qunbi Zhuge

Intensity-modulation and direct-detection (IM/DD) transmission is widely adopted for high-speed optical transmission scenarios due to its cost-effectiveness and simplicity.

Paper
Add Code

AppAgent: Multimodal Agents as Smartphone Users

no code implementations • 21 Dec 2023 • Chi Zhang, Zhao Yang, Jiaxuan Liu, Yucheng Han, Xin Chen, Zebiao Huang, Bin Fu, Gang Yu

Recent advancements in large language models (LLMs) have led to the creation of intelligent agents capable of performing complex tasks.

Navigate

Paper
Add Code

EduGym: An Environment and Notebook Suite for Reinforcement Learning Education

1 code implementation • 17 Nov 2023 • Thomas M. Moerland, Matthias Müller-Brockhausen, Zhao Yang, Andrius Bernatavicius, Koen Ponse, Tom Kouwenhoven, Andreas Sauter, Michiel van der Meer, Bram Renting, Aske Plaat

To solve this issue we introduce EduGym, a set of educational reinforcement learning environments and associated interactive notebooks tailored for education.

reinforcement-learning

Paper
Code

Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling

1 code implementation • 3 Aug 2023 • Zhao Yang, Bing Su, Ji-Rong Wen

Firstly, they cannot directly generate coherent motions and require additional operations such as interpolation to process the generated actions.

Sentence

Paper
Code

Two-Memory Reinforcement Learning

no code implementations • 20 Apr 2023 • Zhao Yang, Thomas. M. Moerland, Mike Preuss, Aske Plaat

While deep reinforcement learning has shown important empirical success, it tends to learn relatively slow due to slow propagation of rewards information and slow update of parametric neural networks.

reinforcement-learning Representation Learning +1

Paper
Add Code

Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation

no code implementations • 11 Mar 2023 • Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr

Referring image segmentation segments an image from a language expression.

Image Segmentation Object +1

Paper
Add Code

First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation

no code implementations • 6 Dec 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

In this paper, we present a clear ablation study of post-exploration in a general intrinsically motivated goal exploration process (IMGEP) framework, that the Go-Explore paper did not show.

Continuous Control Reinforcement Learning (RL)

Paper
Add Code

Continuous Episodic Control

no code implementations • 28 Nov 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

Therefore, this paper introduces Continuous Episodic Control (CEC), a novel non-parametric episodic memory algorithm for sequential decision making in problems with a continuous action space.

Continuous Control Decision Making +2

Paper
Add Code

Flexible Android Malware Detection Model based on Generative Adversarial Networks with Code Tensor

no code implementations • 25 Oct 2022 • Zhao Yang, Fengyang Deng, Linxi Han

Finally, a flexible Android malware detection model based on GANs with code tensor (MTFD-GANs) is proposed.

Android Malware Detection Malware Detection

Paper
Add Code

A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language

4 code implementations • 12 Sep 2022 • Bing Su, Dazhao Du, Zhao Yang, Yujie Zhou, Jiangmeng Li, Anyi Rao, Hao Sun, Zhiwu Lu, Ji-Rong Wen

Although artificial intelligence (AI) has made significant progress in understanding molecules in a wide range of fields, existing models generally acquire the single cognitive ability from the single molecular modality.

Ranked #7 on Molecule Captioning on ChEBI-20

Contrastive Learning Cross-Modal Retrieval +4

Paper
Code

When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation

no code implementations • 29 Mar 2022 • Zhao Yang, Thomas M. Moerland, Mike Preuss, Aske Plaat

Go-Explore achieved breakthrough performance on challenging reinforcement learning (RL) tasks with sparse rewards.

Reinforcement Learning (RL)

Paper
Add Code

On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR

no code implementations • 26 Jan 2022 • Zhao Yang, Dianwen Ng, Xiao Fu, Liping Han, Wei Xi, Rui Wang, Rui Jiang, Jizhong Zhao

Based on the above intuition, we first investigate types of end-to-end encoder-decoder based models in the single-input dual-output (SIDO) multi-task framework, after which a novel asynchronous decoding with fuzzy Pinyin sampling method is proposed according to the one-to-one correspondence characteristics between Pinyin and Character.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Seq-Masks: Bridging the gap between appearance and gait modeling for video-based person re-identification

no code implementations • 10 Dec 2021 • Zhigang Chang, Zhao Yang, Yongbiao Chen, Qin Zhou, Shibao Zheng

Validations on the gait recognition metric CASIA-B dataset further demonstrated the capability of our hybrid model.

Gait Recognition Video-Based Person Re-Identification

Paper
Add Code

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

1 code implementation • CVPR 2022 • Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr

Referring image segmentation is a fundamental vision-language task that aims to segment out an object referred to by a natural language expression from an image.

Ranked #3 on Generalized Referring Expression Segmentation on gRefCOCO

Decoder Generalized Referring Expression Segmentation +3

170

Paper
Code

Hierarchical interaction network for video object segmentation from referring expressions

no code implementations • British Machine Vision Conference 2021 • Zhao Yang, Yansong Tang, Luca Bertinetto, Hengshuang Zhao, Philip Torr

In this paper, we investigate the problem of video object segmentation from referring expressions (VOSRE).

Ranked #1 on Referring Expression Segmentation on J-HMDB (Precision@0.9 metric)

Optical Flow Estimation Referring Expression Segmentation +3

Paper
Add Code

Logic Traps in Evaluating Attribution Scores

no code implementations • ACL 2022 • Yiming Ju, Yuanzhe Zhang, Zhao Yang, Zhongtao Jiang, Kang Liu, Jun Zhao

Meanwhile, since the reasoning process of deep models is inaccessible, researchers design various evaluation methods to demonstrate their arguments.

Paper
Add Code

Potential-based Reward Shaping in Sokoban

no code implementations • 10 Sep 2021 • Zhao Yang, Mike Preuss, Aske Plaat

While previous work has investigated the use of expert knowledge to generate potential functions, in this work, we study whether we can use a search algorithm(A*) to automatically generate a potential function for reward shaping in Sokoban, a well-known planning task.

Paper
Add Code

Follow the Prophet: Accurate Online Conversion Rate Prediction in the Face of Delayed Feedback

1 code implementation • 13 Aug 2021 • Haoming Li, Feiyang Pan, Xiang Ao, Zhao Yang, Min Lu, Junwei Pan, Dapeng Liu, Lei Xiao, Qing He

The delayed feedback problem is one of the imperative challenges in online advertising, which is caused by the highly diversified feedback delay of a conversion varying from a few minutes to several days.

Paper
Code

Alignment Rationale for Natural Language Inference

no code implementations • ACL 2021 • Zhongtao Jiang, Yuanzhe Zhang, Zhao Yang, Jun Zhao, Kang Liu

Deep learning models have achieved great success on the task of Natural Language Inference (NLI), though only a few attempts try to explain their behaviors.

feature selection Natural Language Inference

Paper
Add Code

Transfer Learning and Curriculum Learning in Sokoban

no code implementations • 25 May 2021 • Zhao Yang, Mike Preuss, Aske Plaat

In reinforcement learning, learning actions for a behavior policy that can be applied to new environments is still a challenge, especially for tasks that involve much planning.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Explore User Neighborhood for Real-time E-commerce Recommendation

no code implementations • 28 Feb 2021 • Xu Xie, Fei Sun, Xiaoyong Yang, Zhao Yang, Jinyang Gao, Wenwu Ou, Bin Cui

On the one hand, it utilizes UI relations and user neighborhood to capture both global and local information.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Helios: Heterogeneity-Aware Federated Learning with Dynamically Balanced Collaboration

no code implementations • 3 Dec 2019 • Zirui Xu, Zhao Yang, JinJun Xiong, Jianlei Yang, Xiang Chen

In this paper, we propose Helios, a heterogeneity-aware FL framework to tackle the straggler issue.

Distributed, Parallel, and Cluster Computing

Paper
Add Code

Anchor Diffusion for Unsupervised Video Object Segmentation

1 code implementation • ICCV 2019 • Zhao Yang, Qiang Wang, Luca Bertinetto, Weiming Hu, Song Bai, Philip H. S. Torr

Unsupervised video object segmentation has often been tackled by methods based on recurrent neural networks and optical flow.

Ranked #19 on Unsupervised Video Object Segmentation on DAVIS 2016 val

Image Segmentation Object +4

116

Paper
Code

Hetero-Center Loss for Cross-Modality Person Re-Identification

no code implementations • 22 Oct 2019 • Yuanxin Zhu, Zhao Yang, Li Wang, Sai Zhao, Xiao Hu, Dapeng Tao

With the joint supervision of Cross-Entropy (CE) loss and HC loss, the network is trained to achieve two vital objectives, inter-class discrepancy and intra-class cross-modality similarity as much as possible.

Cross-Modality Person Re-identification Person Re-Identification

Paper
Add Code

Res-embedding for Deep Learning Based Click-Through Rate Prediction Modeling

no code implementations • 25 Jun 2019 • Guorui Zhou, Kailun Wu, Weijie Bian, Zhao Yang, Xiaoqiang Zhu, Kun Gai

In this paper, we model user behavior using an interest delay model, study carefully the embedding mechanism, and obtain two important results: (i) We theoretically prove that small aggregation radius of embedding vectors of items which belongs to a same user interest domain will result in good generalization performance of deep CTR model.

Click-Through Rate Prediction

Paper
Add Code

Learn to Interpret Atari Agents

1 code implementation • 29 Dec 2018 • Zhao Yang, Song Bai, Li Zhang, Philip H. S. Torr

Deep reinforcement learning (DeepRL) agents surpass human-level performance in many tasks.

Decision Making

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.