1 code implementation • 20 Dec 2023 • Dan Shi, Chaobin You, Jiantao Huang, Taihao Li, Deyi Xiong
With these pre-defined domains and slots, we collect 76, 787 commonsense knowledge annotations from 19, 700 dialogues through crowdsourcing.
no code implementations • 16 Dec 2023 • Jun Sun, Xinxin Zhang, Shoukang Han, Yu-Ping Ruan, Taihao Li
Multimodal learning is susceptible to modality missing, which poses a major obstacle for its practical applications and, thus, invigorates increasing research interest.
1 code implementation • 29 Nov 2023 • Fukun Yin, Xin Chen, Chi Zhang, Biao Jiang, Zibo Zhao, Jiayuan Fan, Gang Yu, Taihao Li, Tao Chen
The advent of large language models, enabling flexibility through instruction-driven approaches, has revolutionized many traditional generative tasks, but large models for 3D data, particularly in comprehensively handling 3D shapes with other modalities, are still under-explored.
1 code implementation • 6 Sep 2023 • Sijin Chen, Hongyuan Zhu, Mingsheng Li, Xin Chen, Peng Guo, Yinjie Lei, Gang Yu, Taihao Li, Tao Chen
Moreover, we argue that object localization and description generation require different levels of scene understanding, which could be challenging for a shared set of queries to capture.
no code implementations • 14 Dec 2022 • Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter
Human speech can be characterized by different components, including semantic content, speaker identity and prosodic information.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • 16 Nov 2022 • Wang Qi, Yu-Ping Ruan, Yuan Zuo, Taihao Li
Conventional fine-tuning encounters increasing difficulties given the size of current Pre-trained Language Models, which makes parameter-efficient tuning become the focal point of frontier research.
no code implementations • 16 Nov 2022 • Leyuan Qu, Wei Wang, Cornelius Weber, Pengcheng Yue, Taihao Li, Stefan Wermter
Once training is completed, EmoAug enriches expressions of emotional speech with different prosodic attributes, such as stress, rhythm and intensity, by feeding different styles into the paralinguistic encoder.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
2 code implementations • 21 Oct 2022 • Yunfan Li, Mouxing Yang, Dezhong Peng, Taihao Li, Jiantao Huang, Xi Peng
Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively.
Ranked #1 on Short Text Clustering on Biomedical
1 code implementation • CVPR 2022 • Mouxing Yang, Zhenyu Huang, Peng Hu, Taihao Li, Jiancheng Lv, Xi Peng
To solve the TNL problem, we propose a novel method for robust VI-ReID, termed DuAlly Robust Training (DART).
no code implementations • 6 Oct 2021 • Fen Wang, Gene Cheung, Taihao Li, Ying Du, Yu-Ping Ruan
Sensor placement for linear inverse problems is the selection of locations to assign sensors so that the entire physical signal can be well recovered from partial observations.