1 code implementation • Findings (EMNLP) 2021 • Boer Lyu, Lu Chen, Kai Yu
Sememes are defined as the atomic units to describe the semantic meaning of concepts.
1 code implementation • 4 May 2024 • Hanchong Zhang, Ruisheng Cao, Hongshen Xu, Lu Chen, Kai Yu
Recently, Large Language Models (LLMs) have been demonstrated to possess impressive capabilities in a variety of domains and tasks.
no code implementations • 7 Apr 2024 • Kunyao Lan, Cong Ming, Binwei Yao, Lu Chen, Mengyue Wu
Nevertheless, the blend of task-oriented and chit-chat in diagnosis-related dialogues necessitates professional expertise and empathy.
2 code implementations • 6 Apr 2024 • Hongchuan Zeng, Hongshen Xu, Lu Chen, Kai Yu
MBS overcomes the English-centric limitations of existing methods by sampling calibration data from various languages proportionally to the language distribution of the model training datasets.
no code implementations • 27 Mar 2024 • Hongshen Xu, Zichen Zhu, Situo Zhang, Da Ma, Shuai Fan, Lu Chen, Kai Yu
Large Language Models (LLMs) often generate erroneous outputs, known as hallucinations, due to their limitations in discerning questions beyond their knowledge scope.
no code implementations • 5 Mar 2024 • Yutong Li, Lu Chen, Aiwei Liu, Kai Yu, Lijie Wen
In this work, we firstly focus on the independent literature summarization step and introduce ChatCite, an LLM agent with human workflow guidance for comparative literature summary.
1 code implementation • 28 Feb 2024 • Hongshen Xu, Ruisheng Cao, Su Zhu, Sheng Jiang, Hanchong Zhang, Lu Chen, Kai Yu
Previous work on spoken language understanding (SLU) mainly focuses on single-intent settings, where each input utterance merely contains one user intent.
1 code implementation • 28 Feb 2024 • Hongshen Xu, Lu Chen, Zihan Zhao, Da Ma, Ruisheng Cao, Zichen Zhu, Kai Yu
Additionally, we propose several pre-training tasks to model the interaction among text, structure, and image modalities effectively.
no code implementations • 18 Feb 2024 • Nuo Xu, Jun Zhao, Can Zu, Sixian Li, Lu Chen, Zhihao Zhang, Rui Zheng, Shihan Dou, Wenjuan Qin, Tao Gui, Qi Zhang, Xuanjing Huang
To address this issue, we propose a cost-effective preference learning strategy, optimizing reward models by distinguishing between human and machine translations.
no code implementations • 9 Feb 2024 • Lu Chen, Wei Huang, Ruqing Zhang, Wei Chen, Jiafeng Guo, Xueqi Cheng
The key idea is to learn task-required causal factors and only use those to make predictions for a given task.
no code implementations • 5 Feb 2024 • Zichen Zhu, Yang Xu, Lu Chen, Jingkai Yang, Yichuan Ma, Yiming Sun, Hailin Wen, Jiaqi Liu, Jinyu Cai, Yingzi Ma, Situo Zhang, Zihan Zhao, Liangtai Sun, Kai Yu
Rapid progress in multimodal large language models (MLLMs) highlights the need to introduce challenging yet realistic benchmarks to the academic community, while existing benchmarks primarily focus on understanding simple natural images and short context.
no code implementations • 3 Feb 2024 • Lu Chen, Shaofeng Li, Benhao Huang, Fan Yang, Zheng Li, Jie Li, Yuan Luo
In the context of deep neural networks, we expose the existence of a harmless perturbation space, where perturbations leave the network output entirely unaltered.
1 code implementation • 2 Feb 2024 • Jian Liu, Xiaoshui Huang, Tianyu Huang, Lu Chen, Yuenan Hou, Shixiang Tang, Ziwei Liu, Wanli Ouyang, WangMeng Zuo, Junjun Jiang, Xianming Liu
Recent years have witnessed remarkable advances in artificial intelligence generated content(AIGC), with diverse input modalities, e. g., text, image, video, audio and 3D.
1 code implementation • 30 Jan 2024 • Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang
This technique introduces a fusion network to unify the processing of outputs from different visual experts, while bridging the gap between image encoders and pre-trained LLMs.
Ranked #43 on Visual Question Answering on MM-Vet
1 code implementation • 29 Jan 2024 • Lu Chen, Siyu Lou, Benhao Huang, Quanshi Zhang
Faithfully summarizing the knowledge encoded by a deep neural network (DNN) into a few symbolic primitive patterns without losing much information represents a core challenge in explainable AI.
no code implementations • 26 Jan 2024 • Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li, Hongshen Xu, Zichen Zhu, Su Zhu, Shuai Fan, Guodong Shen, Xin Chen, Kai Yu
To this end, we develop ChemDFM, the first LLM towards CGI.
no code implementations • 19 Jan 2024 • Chao Zhang, YUREN MAO, Yijiang Fan, Yu Mi, Yunjun Gao, Lu Chen, Dongfang Lou, Jinshu Lin
Text-to-SQL, which provides zero-code interface for operating relational databases, has gained much attention in financial analysis; because, financial professionals may not well-skilled in SQL programming.
1 code implementation • 11 Jan 2024 • Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang
We introduce a series of novel methods to mitigate the influence of incorrect and ambiguous preferences in the dataset and fully leverage high-quality preference data.
no code implementations • 7 Jan 2024 • Siyu Li, Lu Chen, Chenwei Song, Xinyi Liu
To resolve the semantic ambiguity in texts, we propose a model, which innovatively combines a knowledge graph with an improved attention mechanism.
1 code implementation • 11 Dec 2023 • Mengzhao Wang, Xiangyu Ke, Xiaoliang Xu, Lu Chen, Yunjun Gao, Pinpin Huang, Runkai Zhu
We investigate the problem of multimodal search of target modality, where the task involves enhancing a query in a specific target modality by integrating information from auxiliary modalities.
no code implementations • 22 Nov 2023 • Weiwei Li, Xing Liu, Wei Wang, Lu Chen, Sizhe Li, Hui Fan
To address the challenge of identifying hidden danger in substations from unstructured text, a novel dynamic analysis method is proposed.
no code implementations • 28 Oct 2023 • Ruisheng Cao, Hanchong Zhang, Hongshen Xu, Jieyu Li, Da Ma, Lu Chen, Kai Yu
Text-to-SQL aims to generate an executable SQL program given the user utterance and the corresponding database schema.
1 code implementation • 26 Oct 2023 • Hanchong Zhang, Ruisheng Cao, Lu Chen, Hongshen Xu, Kai Yu
Recently Large Language Models (LLMs) have been proven to have strong abilities in various domains and tasks.
1 code implementation • 25 Aug 2023 • Liangtai Sun, Yang Han, Zihan Zhao, Da Ma, Zhennan Shen, Baocai Chen, Lu Chen, Kai Yu
This design suffers from data leakage problem and lacks the evaluation of subjective Q/A ability.
1 code implementation • 24 Aug 2023 • Lu Chen, Ruqing Zhang, Wei Huang, Wei Chen, Jiafeng Guo, Xueqi Cheng
The key idea is to reformulate the Variational Auto-encoder (VAE) to fit the joint distribution of the document and summary variables from the training corpus.
1 code implementation • 2 Aug 2023 • Xiaocan Zeng, Pengfei Wang, YUREN MAO, Lu Chen, Xiaoze Liu, Yunjun Gao
Traditional unsupervised EM assumes that all entities come from two tables; however, it is more common to match entities from multiple tables in practical applications, that is, multi-table entity matching (multi-table EM).
1 code implementation • 14 Jul 2023 • XueMei Dong, Chao Zhang, Yuhang Ge, YUREN MAO, Yunjun Gao, Lu Chen, Jinshu Lin, Dongfang Lou
This paper proposes a ChatGPT-based zero-shot Text-to-SQL method, dubbed C3, which achieves 82. 3\% in terms of execution accuracy on the holdout test set of Spider and becomes the state-of-the-art zero-shot Text-to-SQL method on the Spider Challenge.
Ranked #4 on Text-To-SQL on spider
1 code implementation • 11 Jul 2023 • Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang
Therefore, we explore the PPO-max, an advanced version of PPO algorithm, to efficiently improve the training stability of the policy model.
no code implementations • 5 Jul 2023 • Jiaqi Wang, Tianyi Li, Anni Wang, Xiaoze Liu, Lu Chen, Jie Chen, Jianye Liu, Junyang Wu, Feifei Li, Yunjun Gao
This has led to the increasing volume of database workloads, which provides the opportunity for pattern analysis.
no code implementations • 19 Jun 2023 • Yaqi Zhang, Di Huang, Bin Liu, Shixiang Tang, Yan Lu, Lu Chen, Lei Bai, Qi Chu, Nenghai Yu, Wanli Ouyang
Generating realistic human motion from given action descriptions has experienced significant advancements because of the emerging requirement of digital humans.
1 code implementation • NeurIPS 2023 • Danyang Zhang, Lu Chen, Situo Zhang, Hongshen Xu, Zihan Zhao, Kai Yu
By equipping the LLM with a long-term experience memory, REMEMBERER is capable of exploiting the experiences from the past episodes even for different task goals, which excels an LLM-based agent with fixed exemplars or equipped with a transient working memory.
1 code implementation • 25 May 2023 • Hanchong Zhang, Jieyu Li, Lu Chen, Ruisheng Cao, Yunyan Zhang, Yu Huang, Yefeng Zheng, Kai Yu
Furthermore, we present CSS, a large-scale CrosS-Schema Chinese text-to-SQL dataset, to carry on corresponding studies.
1 code implementation • 14 May 2023 • Danyang Zhang, Hongshen Xu, Zihan Zhao, Lu Chen, Ruisheng Cao, Kai Yu
A GUI task set based on WikiHow app is collected on Mobile-Env to form a benchmark covering a range of GUI interaction capabilities.
1 code implementation • 28 Apr 2023 • Xinjun Zhu, Yuntao Du, YUREN MAO, Lu Chen, Yujia Hu, Yunjun Gao
Knowledge graph (KG), which contains rich side information, becomes an essential part to boost the recommendation performance and improve its explainability.
1 code implementation • 27 Apr 2023 • Yuntao Du, Jianxun Lian, Jing Yao, Xiting Wang, Mingqi Wu, Lu Chen, Yunjun Gao, Xing Xie
In recent decades, there have been significant advancements in latent embedding-based CF methods for improved accuracy, such as matrix factorization, neural collaborative filtering, and LightGCN.
1 code implementation • 14 Apr 2023 • Junyang Wu, Tianyi Li, Lu Chen, Yunjun Gao, Ziheng Wei
To enhance the usability of GNN-based EA models in real-world applications, we present SEA, a scalable entity alignment system that enables to (i) train large-scale GNNs for EA, (ii) speed up the normalization and the evaluation process, and (iii) report clear results for users to estimate different models and parameter settings.
1 code implementation • 4 Apr 2023 • Lu Chen, Siyu Lou, Keyan Zhang, Jin Huang, Quanshi Zhang
The HarsanyiNet is designed on the theoretical foundation that the Shapley value can be reformulated as the redistribution of Harsanyi interactions encoded by the network.
no code implementations • 3 Apr 2023 • Minjun Zhao, Yichen Yin, YUREN MAO, Qing Liu, Lu Chen, Yunjun Gao
Recently, a few methods have been put forward to handle the SGA dilemma.
no code implementations • 3 Feb 2023 • Ronghao Dang, Lu Chen, Liuyi Wang, Zongtao He, Chengju Liu, Qijun Chen
We propose a meta-ability decoupling (MAD) paradigm, which brings together various object navigation methods in an architecture system, allowing them to mutually enhance each other and evolve together.
1 code implementation • 1 Feb 2023 • Xiaoze Liu, Junyang Wu, Tianyi Li, Lu Chen, Yunjun Gao
State-of-the-art time-aware EA studies have suggested that the temporal information of TKGs facilitates the performance of EA.
Ranked #1 on Entity Alignment on YAGO-WIKI50K
no code implementations • 12 Jan 2023 • Jieyu Li, Lu Chen, Ruisheng Cao, Su Zhu, Hongshen Xu, Zhi Chen, Hanchong Zhang, Kai Yu
Exploring the generalization of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases.
no code implementations • 11 Dec 2022 • Danlei Hu, Ziquan Fang, Hanxi Fang, Tianyi Li, Chunhui Shen, Lu Chen, Yunjun Gao
Transportation mode classification, the process of predicting the class labels of moving objects transportation modes, has been widely applied to a variety of real world applications, such as traffic management, urban computing, and behavior study.
no code implementations • 10 Sep 2022 • Zhi Chen, Yuncong Liu, Lu Chen, Su Zhu, Mengyue Wu, Kai Yu
The second phase is to fine-tune the pretrained model on the TOD data.
1 code implementation • 15 Aug 2022 • Bing Wang, Lu Chen, Bo Yang
In this paper, we study the problem of 3D scene geometry decomposition and manipulation from 2D views.
no code implementations • 25 May 2022 • Zhi Chen, Jijia Bao, Lu Chen, Yuncong Liu, Da Ma, Bei Chen, Mengyue Wu, Su Zhu, Xin Dong, Fujiang Ge, Qingliang Miao, Jian-Guang Lou, Kai Yu
In this work, we aim to build a unified dialogue foundation model (DFM) which can be used to solve massive diverse dialogue tasks.
no code implementations • 24 May 2022 • Binwei Yao, Chao Shi, Likai Zou, Lingfeng Dai, Mengyue Wu, Lu Chen, Zhen Wang, Kai Yu
In a depression-diagnosis-directed clinical session, doctors initiate a conversation with ample emotional support that guides the patients to expose their symptoms based on clinical diagnosis criteria.
no code implementations • 23 May 2022 • Liangtai Sun, Xingyu Chen, Lu Chen, Tianle Dai, Zichen Zhu, Kai Yu
However, this API-based architecture greatly limits the information-searching capability of intelligent assistants and may even lead to task failure if TOD-specific APIs are not available or the task is too complicated to be executed by the provided APIs.
2 code implementations • 20 May 2022 • Yunjun Gao, Xiaoze Liu, Junyang Wu, Tianyi Li, Pengfei Wang, Lu Chen
To tackle this challenge, we present ClusterEA, a general framework that is capable of scaling up EA models and enhancing their results by leveraging normalization methods on mini-batches with a high entity equivalent rate.
Ranked #2 on Entity Alignment on DBP1M DE-EN
1 code implementation • NAACL 2022 • Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu
Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests.
2 code implementations • 14 Apr 2022 • Yunjun Gao, Yuntao Du, Yujia Hu, Lu Chen, Xinjun Zhu, Ziquan Fang, Baihua Zheng
Besides, our method can automatically switch its learning phase at the memorization point from memorization to self-guided learning, and select clean and informative memorized data via a novel adaptive denoising scheduler to improve the robustness.
1 code implementation • 11 Apr 2022 • Yuntao Du, Xinjun Zhu, Lu Chen, Baihua Zheng, Yunjun Gao
Furthermore, we propose a dual item embeddings design to represent and propagate collaborative signals and knowledge associations separately, and leverage the gated aggregation to distill discriminative information for better capturing user behavior patterns.
Ranked #1 on Recommendation Systems on Alibaba-iFashion
no code implementations • SIGDIAL (ACL) 2022 • Zhi Chen, Lu Chen, Bei Chen, Libo Qin, Yuncong Liu, Su Zhu, Jian-Guang Lou, Kai Yu
With the development of pre-trained language models, remarkable success has been witnessed in dialogue understanding (DU).
1 code implementation • 8 Feb 2022 • Yuntao Du, Xinjun Zhu, Lu Chen, Ziquan Fang, Yunjun Gao
Inspired by the success of meta-learning on scarce training samples, we propose a novel meta-learning based framework called MetaKG, which encompasses a collaborative-aware meta learner and a knowledge-aware meta learner, to capture meta users' preference and entities' knowledge for cold-start recommendations.
no code implementations • 22 Jan 2022 • Keqi Wang, Ziteng Cui, Jieru Jia, Hao Xu, Ge Wu, Yin Zhuang, Lu Chen, Zhiguo Hu, Yuhua Qian
However, the convolution operation is based on a local sliding window mechanism, which is difficult to construct the long-range dependencies of the feature maps.
1 code implementation • 3 Jan 2022 • Zhuo Deng, Yuanhao Cai, Lu Chen, Zheng Gong, Qiqi Bao, Xue Yao, Dong Fang, Shaochong Zhang, Lan Ma
In this paper, we investigate the real clinical fundus image restoration problem.
1 code implementation • 17 Dec 2021 • Ziquan Fang, Yuntao Du, Xinjun Zhu, Lu Chen, Yunjun Gao, Christian S. Jensen
Trajectory similarity computation has drawn massive attention, as it is core functionality in a wide range of applications such as ride-sharing, traffic analysis, and social recommendation.
no code implementations • 9 Dec 2021 • Su Zhu, Lu Chen, Ruisheng Cao, Zhi Chen, Qingliang Miao, Kai Yu
In this paper, we propose to improve prototypical networks with vector projection distance and abstract triangular Conditional Random Field (CRF) for the few-shot NLU.
no code implementations • 8 Dec 2021 • Keyu Yang, Lu Chen, Zhihao Zeng, Yunjun Gao
Distributed ML models trained by SGD involve large amounts of gradient communication, which limits the scalability of distributed ML.
1 code implementation • NeurIPS 2021 • Jie Ren, Die Zhang, Yisen Wang, Lu Chen, Zhanpeng Zhou, Yiting Chen, Xu Cheng, Xin Wang, Meng Zhou, Jie Shi, Quanshi Zhang
This paper provides a unified view to explain different adversarial attacks and defense methods, i. e. the view of multi-order interactions between input variables of DNNs.
no code implementations • 19 Nov 2021 • Kamil Oster, Stefan Güttel, Lu Chen, Jonathan L. Shapiro, Megan Jobson
Firstly, it is important to enhance the quality of both sets of data (laboratory measurements and physical sensors) in a data pre-processing stage (as described in Methodology section).
1 code implementation • 5 Nov 2021 • Jie Ren, Die Zhang, Yisen Wang, Lu Chen, Zhanpeng Zhou, Yiting Chen, Xu Cheng, Xin Wang, Meng Zhou, Jie Shi, Quanshi Zhang
This paper provides a unified view to explain different adversarial attacks and defense methods, \emph{i. e.} the view of multi-order interactions between input variables of DNNs.
1 code implementation • 13 Oct 2021 • Minjun Zhao, Lu Chen, Keyu Yang, Yuntao Du, Yunjun Gao
It uses a Gaussian mixture-based metric called separation degree to rank materialized models.
no code implementations • 29 Sep 2021 • Lu Chen, Renjie Chen, Hang Guo, Yuan Luo, Quanshi Zhang, Yisen Wang
Adversarial examples have attracted significant attention over the years, yet a sufficient understanding is in lack, especially when analyzing their performances in combination with adversarial training.
no code implementations • 9 Sep 2021 • Shilin Lu, Xinghong Hu, Chengyou Wang, Lu Chen, Shulu Han, Yuejia Han
The aim of this paper is to improve the accuracy of copy-move forgery detection (CMFD) in image forensics by proposing a novel scheme and the main contribution is evolving circular domains coverage (ECDC) algorithm.
no code implementations • 17 Jun 2021 • Kamil Oster, Stefan Güttel, Jonathan L. Shapiro, Lu Chen, Megan Jobson
In this case, we used principal component analysis (PCA) with Hotelling's $T^2$ statistics to identify the long-term outliers.
no code implementations • Findings (ACL) 2021 • Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu, Kai Yu
A dual learning approach is also proposed for the utterance rewrite model to address the data sparsity problem.
1 code implementation • ACL 2021 • Ruisheng Cao, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu, Kai Yu
This work aims to tackle the challenging heterogeneous graph encoding problem in the text-to-SQL task.
no code implementations • NAACL 2021 • Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu, Kai Yu
Given a database schema, Text-to-SQL aims to translate a natural language question into the corresponding SQL query.
1 code implementation • 12 Mar 2021 • Jie Ren, Die Zhang, Yisen Wang, Lu Chen, Zhanpeng Zhou, Yiting Chen, Xu Cheng, Xin Wang, Meng Zhou, Jie Shi, Quanshi Zhang
This paper provides a unified view to explain different adversarial attacks and defense methods, i. e. the view of multi-order interactions between input variables of DNNs.
1 code implementation • 25 Feb 2021 • Boer Lyu, Lu Chen, Su Zhu, Kai Yu
Additionally, we adopt the word lattice graph as input to maintain multi-granularity information.
1 code implementation • EMNLP 2021 • Xingyu Chen, Zihan Zhao, Lu Chen, Danyang Zhang, Jiabao Ji, Ao Luo, Yuxuan Xiong, Kai Yu
In this paper, we introduce the task of structural reading comprehension (SRC) on web.
no code implementations • 21 Jan 2021 • Yue Hong Gao, Xiao Hong, Hao Tian Yang, Lu Chen, Xiao Nan Zhang
The test results are compared with the theoretical results, analyzed and corrected, in order to verify the feasibility of our analysis model for the performance analysis of the actual network.
Networking and Internet Architecture Performance
no code implementations • 4 Jan 2021 • Aman Abidi, Lu Chen, Rui Zhou, Chengfei Liu
By exploiting the discoveries, we propose novel algorithms for maintaining the two indices, which substantially reduces the cost of maintenance.
1 code implementation • 15 Dec 2020 • Lu Chen, Jiao Sun, Wei Xu
In both letter-level and word-level attacks, our experiments show that in addition to natural appearance, FAWA achieves a 100% attack success rate with 60% less perturbations and 78% fewer iterations on average.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • 14 Oct 2020 • Zihan Zhao, Yuncong Liu, Lu Chen, Qi Liu, Rao Ma, Kai Yu
Recently, pre-trained language models like BERT have shown promising performance on multiple natural language processing tasks.
no code implementations • 24 Sep 2020 • Bolong Zheng, Qi Hu, Lingfeng Ming, Jilin Hu, Lu Chen, Kai Zheng, Christian S. Jensen
In this setting, an assignment authority is to assign agents to requests such that the average idle time of the agents is minimized.
Databases Signal Processing
no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Xiang Zhou, Kai Yu
To the best of our knowledge, this is the first effort to optimize the DST module within DRL framework for on-line task-oriented spoken dialogue systems.
no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Xiaoyuan Liu, Kai Yu
The task-oriented spoken dialogue system (SDS) aims to assist a human user in accomplishing a specific task (e. g., hotel booking).
no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Yanbin Zhao, Su Zhu, Kai Yu
In task-oriented multi-turn dialogue systems, dialogue state refers to a compact representation of the user goal in the context of dialogue history.
no code implementations • 22 Sep 2020 • Zhi Chen, Xiaoyuan Liu, Lu Chen, Kai Yu
A novel ComNet is proposed to model the structure of a hierarchical agent.
no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Zihan Xu, Yanbin Zhao, Su Zhu, Kai Yu
In dialogue systems, a dialogue state tracker aims to accurately find a compact representation of the current dialogue status, based on the entire dialogue history.
1 code implementation • 21 Sep 2020 • Su Zhu, Ruisheng Cao, Lu Chen, Kai Yu
Few-shot slot tagging becomes appealing for rapid domain transfer and adaptation, motivated by the tremendous development of conversational dialogue systems.
no code implementations • 7 Sep 2020 • Chen Liu, Su Zhu, Lu Chen, Kai Yu
The framework consists of a slot tagging model and a rule-based value error recovery module.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • ACL 2020 • Yanbin Zhao, Lu Chen, Zhi Chen, Ruisheng Cao, Su Zhu, Kai Yu
We also adopt graph attention networks with higher-order neighborhood information to encode the rich structure in AMR graphs.
no code implementations • ACL 2020 • Lu Chen, Yanbin Zhao, Boer Lyu, Lesheng Jin, Zhi Chen, Su Zhu, Kai Yu
Chinese short text matching usually employs word sequences rather than character sequences to get better performance.
1 code implementation • ACL 2020 • Ruisheng Cao, Su Zhu, Chenyu Yang, Chen Liu, Rao Ma, Yanbin Zhao, Lu Chen, Kai Yu
One daunting problem for semantic parsing is the scarcity of annotation.
1 code implementation • 24 May 2020 • Chen Liu, Su Zhu, Zijian Zhao, Ruisheng Cao, Lu Chen, Kai Yu
In this paper, a novel BERT based SLU model (WCN-BERT SLU) is proposed to encode WCNs and the dialogue context jointly.
no code implementations • 30 Apr 2020 • Yanbin Zhao, Lu Chen, Zhi Chen, Kai Yu
When modeling simple and complex sentences with autoencoders, we introduce different types of noise into the training process.
no code implementations • 26 Apr 2020 • Keyu Yang, Yunjun Gao, Lei Liang, Song Bian, Lu Chen, Baihua Zheng
We propose Crowd-based neural networks for Text Sentiment Classification (CrowdTSC for short).
no code implementations • Findings of the Association for Computational Linguistics 2020 • Su Zhu, Jieyu Li, Lu Chen, Kai Yu
In this paper, a novel context and schema fusion network is proposed to encode the dialogue context and schema graph by using internal and external attention mechanisms.
Ranked #8 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.0
Dialogue State Tracking Multi-domain Dialogue State Tracking
no code implementations • 3 Apr 2020 • Lu Chen, Boer Lv, Chi Wang, Su Zhu, Bowen Tan, Kai Yu
For multi-domain DST, the data sparsity problem is also a major obstacle due to the increased number of state candidates.
Ranked #12 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.1
no code implementations • 8 Feb 2020 • Zafaryab Rasool, Rui Zhou, Lu Chen, Chengfei Liu, Jiajie Xu
Efficient query algorithms are proposed for these indices which significantly avoids irrelevant comparisons at the cost of space.
no code implementations • 8 Feb 2020 • Lu Chen, Wei Xu
Optical character recognition (OCR) is widely applied in real applications serving as a key preprocessing tool.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • 27 May 2019 • Lu Chen, Zhi Chen, Bowen Tan, Sishan Long, Milica Gasic, Kai Yu
Experiments show that AgentGraph models significantly outperform traditional reinforcement learning approaches on most of the 18 tasks of the PyDial benchmark.
no code implementations • 13 Nov 2018 • Jilin Hu, Chenjuan Guo, Bin Yang, Christian S. Jensen, Lu Chen
Origin-destination (OD) matrices are often used in urban planning, where a city is partitioned into regions and an element (i, j) in an OD matrix records the cost (e. g., travel time, fuel consumption, or travel speed) from region i to region j.
1 code implementation • ACL 2019 • Shun Zheng, Xu Han, Yankai Lin, Peilin Yu, Lu Chen, Ling Huang, Zhiyuan Liu, Wei Xu
To demonstrate the effectiveness of DIAG-NRE, we apply it to two real-world datasets and present both significant and interpretable improvements over state-of-the-art methods.
1 code implementation • EMNLP 2018 • Liliang Ren, Kaige Xie, Lu Chen, Kai Yu
Dialogue state tracking is the core part of a spoken dialogue system.
no code implementations • COLING 2018 • Lu Chen, Bowen Tan, Sishan Long, Kai Yu
The proposed structured deep reinforcement learning is based on graph neural networks (GNN), which consists of some sub-networks, each one for a node on a directed graph.
no code implementations • WS 2018 • Kaige Xie, Cheng Chang, Liliang Ren, Lu Chen, Kai Yu
Dialogue state tracking (DST), when formulated as a supervised learning problem, relies on labelled data.
no code implementations • EMNLP 2017 • Cheng Chang, Runzhe Yang, Lu Chen, Xiang Zhou, Kai Yu
The key to building an evolvable dialogue system in real-world scenarios is to ensure an affordable on-line dialogue policy learning, which requires the on-line learning process to be safe, efficient and economical.
no code implementations • EMNLP 2017 • Lu Chen, Xiang Zhou, Cheng Chang, Runzhe Yang, Kai Yu
Hand-crafted rules and reinforcement learning (RL) are two popular choices to obtain dialogue policy.
no code implementations • EACL 2017 • Lu Chen, Runzhe Yang, Cheng Chang, Zihao Ye, Xiang Zhou, Kai Yu
On-line dialogue policy learning is the key for building evolvable conversational agent in real world scenarios.