no code implementations • CCL 2020 • Kunli Zhang, Xu Zhao, Lei Zhuang, Qi Xie, Hongying Zan
In this paper, we treat the diagnosis assistant as a multi-label classification task and propose a Knowledge-Enabled Diagnosis Assistant (KEDA) model for the obstetric diagnosis assistant.
no code implementations • 27 Apr 2024 • Yiming Bao, Xu Zhao, Dahong Qian
On Total Capture dataset, the pose estimation error is significantly decreased compared to the baseline method.
no code implementations • 30 Jan 2024 • Yesheng Zhang, Xu Zhao
However, the pervasive presence of matching redundancy between images gives rise to unnecessary and error-prone computations in these methods, imposing limitations on their accuracy.
no code implementations • 30 Jan 2024 • Baoxing Li, Yong Deng, Yehui Yang, Xu Zhao
In recent years, a combination of parametric body models (such as SMPL) that capture body pose and shape prior, and neural implicit functions that learn flexible clothing details, has been used to integrate the advantages of both approaches.
1 code implementation • 19 Dec 2023 • Yongqi An, Xu Zhao, Tao Yu, Ming Tang, Jinqiao Wang
Retraining-free is important for LLMs' pruning methods.
no code implementations • 24 Nov 2023 • Xiaoyue Wan, Zhuo Chen, Yiming Bao, Xu Zhao
This perception is injected by the Pose Transformer network and learned through a pre-training task that recovers iterative masked joints.
1 code implementation • 31 Oct 2023 • Kaixin Li, Qisheng Hu, Xu Zhao, Hui Chen, Yuxi Xie, Tiedong Liu, Qizhe Xie, Junxian He
In this work, we explore the use of Large Language Models (LLMs) to edit code based on user instructions.
no code implementations • 5 Aug 2023 • Yi Ren, Xu Zhao, Hongyan Tang, Shuai Li
In this paper, we propose a structural causal model-based method to address the popularity bias issue for sequential recommendation model learning.
1 code implementation • 21 Jun 2023 • Xu Zhao, Wenchao Ding, Yongqi An, Yinglong Du, Tao Yu, Min Li, Ming Tang, Jinqiao Wang
In this paper, we propose a speed-up alternative method for this fundamental task with comparable performance.
Ranked #4 on Zero-Shot Instance Segmentation on LVIS v1.0 val
1 code implementation • Github 2023 • Qisheng Hu*, Kaixin Li*, Xu Zhao, Yuxi Xie, Tiedong Liu, Hui Chen, Qizhe Xie, Junxian He
In this work, we explore the use of large language models (LLMs) to edit code based on user instructions, covering a broad range of implicit tasks such as comment insertion, code optimization, and code refactoring.
no code implementations • NeurIPS 2023 • Yuxi Xie, Kenji Kawaguchi, Yiran Zhao, Xu Zhao, Min-Yen Kan, Junxian He, Qizhe Xie
Stochastic beam search balances exploitation and exploration of the search space with temperature-controlled randomness.
1 code implementation • 29 Apr 2023 • Yesheng Zhang, Xu Zhao, Dahong Qian
This paper, thus, pays attention to the search space and proposes to set the initial search space for point matching as the matched image areas containing prominent semantic, named semantic area matches.
1 code implementation • 27 Apr 2023 • Yueming Hao, Xu Zhao, Bin Bao, David Berard, Will Constable, Adnan Aziz, Xu Liu
TorchBench is able to comprehensively characterize the performance of the PyTorch software stack, guiding the performance optimization across models, PyTorch framework, and GPU libraries.
no code implementations • 10 Apr 2023 • Zhaowen Li, Xu Zhao, Peigeng Ding, Zongxin Gao, Yuting Yang, Ming Tang, Jinqiao Wang
In the high-frequency branch, a derivative-filter-like architecture is designed to extract the high-frequency information while a light extractor is employed in the low-frequency branch because the low-frequency information is usually redundant.
1 code implementation • CVPR 2023 • Yongqi An, Xu Zhao, Tao Yu, Haiyun Guo, Chaoyang Zhao, Ming Tang, Jinqiao Wang
However, previous unsupervised deep learning BGS algorithms perform poorly in sophisticated scenarios such as shadows or night lights, and they cannot detect objects outside the pre-defined categories.
no code implementations • 28 Feb 2023 • Shenzheng Zhang, Qi Tan, Xinzhi Zheng, Yi Ren, Xu Zhao
The gap between the randomly initialized item ID embedding and the well-trained warm item ID embedding makes the cold items hard to suit the recommendation system, which is trained on the data of historical warm items.
1 code implementation • 24 Feb 2023 • Yi Ren, Xiao Han, Xu Zhao, Shenzheng Zhang, Yan Zhang
Therefore, the ranking stage is still essential for most applications to provide high-quality candidate set for the re-ranking stage.
no code implementations • 22 Feb 2023 • Xiaoyue Wan, Zhuo Chen, Xu Zhao
The rapid development of multi-view 3D human pose estimation (HPE) is attributed to the maturation of monocular 2D HPE and the geometry of 3D reconstruction.
1 code implementation • 14 Feb 2023 • Wenke Xia, Xu Zhao, Xincheng Pang, Changqing Zhang, Di Hu
We surprisingly find that: the multimodal models with existing imbalance algorithms consistently perform worse than the unimodal one on specific subsets, in accordance with the modality bias.
no code implementations • ICCV 2023 • Zixuan Zhao, Dongqi Wang, Xu Zhao
First, the submergence of movement feature, i. e. the movement information in a snippet is covered by the scene information.
no code implementations • 11 Nov 2022 • Ke Liao, Wei Wang, Armagan Elibol, Lingzhong Meng, Xu Zhao, Nak Young Chong
In this paper, we systematically examine the performance of machine learning models for the clinical prediction task based on the EHR, especially physiological time series.
no code implementations • 31 Aug 2022 • Zhaowen Li, Xu Zhao, Chaoyang Zhao, Ming Tang, Jinqiao Wang
Previous unsupervised domain adaptation methods did not handle the cross-domain problem from the perspective of frequency for computer vision.
no code implementations • 25 Aug 2022 • Yiming Bao, Xu Zhao, Dahong Qian
On Total Capture dataset, KineFuse surpasses previous state-of-the-art which uses IMU only for testing by 8. 6\%.
Ranked #2 on 3D Human Pose Estimation on Total Capture
1 code implementation • 27 May 2022 • Xu Zhao, Yi Ren, Ying Du, Shenzheng Zhang, Nian Wang
This paper attempts to tackle the item cold-start problem by generating enhanced warmed-up ID embeddings for cold items with historical data and limited interaction records.
1 code implementation • 14 May 2022 • Shuming Liu, Mengmeng Xu, Chen Zhao, Xu Zhao, Bernard Ghanem
We propose to sequentially forward the snippet frame through the video encoder, and backward only a small necessary portion of gradients to update the encoder.
1 code implementation • 1 Feb 2022 • Yesheng Zhang, Xu Zhao, Dahong Qian
Therefore, in this paper, we propose a hybrid camera calibration framework which combines learning-based approaches with traditional methods to handle these bottlenecks.
1 code implementation • 18 Jan 2022 • Nanfei Jiang, Xu Zhao, Chaoyang Zhao, Yongqi An, Ming Tang, Jinqiao Wang
MaskSparsity imposes the fine-grained sparse regularization on the specific filters selected by a pruning mask, rather than all the filters of the model.
no code implementations • 11 Jan 2021 • Hansen Zhao, Xu Zhao, Huan Yao, Jiaxin Feng, Sichun Zhang, Xinrong Zhang
Metabolite structure identification has become the major bottleneck of the mass spectrometry based metabolomics research.
no code implementations • 1 Jan 2021 • ZiHao Wang, Xu Zhao, Tam Le, Hao Wu, Yong Zhang, Makoto Yamada
In this work, we consider OT over tree metrics, which is more general than the sliced Wasserstein and includes the sliced Wasserstein as a special case, and we propose a fast minimization algorithm in $O(n)$ for the optimal Wasserstein-1 transport plan between two distributions in the tree structure.
no code implementations • 29 Oct 2020 • Yesheng Zhang, Xu Zhao, Dahong Qian
In this paper, we present a novel end-to-end network architecture to estimate fundamental matrix directly from stereo images.
1 code implementation • 14 Oct 2020 • Xiaoqing Liang, Xu Zhao, Chaoyang Zhao, Nanfei Jiang, Ming Tang, Jinqiao Wang
This method decouples the distillation task of face detection into two subtasks, i. e., the classification distillation subtask and the regression distillation subtask.
1 code implementation • EMNLP 2020 • Xu Zhao, ZiHao Wang, Hao Wu, Yong Zhang
In this paper, we propose a new semi-supervised BLI framework to encourage the interaction between the supervised signal and unsupervised alignment.
no code implementations • ACL 2020 • Xu Zhao, ZiHao Wang, Hao Wu, Yong Zhang
Recently unsupervised Bilingual Lexicon Induction (BLI) without any parallel corpus has attracted much research interest.
no code implementations • 31 Oct 2019 • Xu Zhao
Auto Composing is an active and appealing research area in the past few years, and lots of efforts have been put into inventing more robust models to solve this problem.
no code implementations • 29 Jul 2019 • Haisheng Su, Xu Zhao, Shuming Liu
This technical report presents an overview of our solution used in the submission to ActivityNet Challenge 2019 Task 1 (\textbf{temporal action proposal generation}) and Task 2 (\textbf{temporal action localization/detection}).
no code implementations • 5 Mar 2019 • Xiao Song, Xu Zhao, Liangji Fang, Hanwen Hu
EdgeStereo also achieves comparable generalization performance for disparity estimation because of the incorporation of edge cues.
no code implementations • 4 Feb 2019 • Xu Zhao, Zongli Jiang
TDPM uses tangent distance instead of geodesic distance, and then applies MDS to the tangent distance matrix to map the manifold into a low dimensional space in which we can get its nonlinear structure.
no code implementations • 28 Oct 2018 • Haisheng Su, Xu Zhao, Tianwei Lin
Weakly supervised temporal action localization, which aims at temporally locating action instances in untrimmed videos using only video-level class labels during training, is an important yet challenging problem in video analysis.
no code implementations • 27 Aug 2018 • Xiao Song, Xu Zhao, Liangji Fang, Tianwei Lin
Secondly we utilize the SSD, which is a deep learning framework for detection, to excavate context cues and conduct end-to-end face presentation attack detection.
17 code implementations • ECCV 2018 • Tianwei Lin, Xu Zhao, Haisheng Su, Chongjing Wang, Ming Yang
Temporal action proposal generation is an important yet challenging problem, since temporal proposals with rich action content are indispensable for analysing real-world videos with long duration and high proportion irrelevant content.
Ranked #3 on Temporal Action Proposal Generation on THUMOS' 14
no code implementations • 14 Mar 2018 • Xiao Song, Xu Zhao, Hanwen Hu, Liangji Fang
Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task.
no code implementations • 13 Mar 2018 • Xiao Song, Xu Zhao, Tianwei Lin
The second one is a high-level micro-texture based feature called Spatial Pyramid Coding Micro-Texture (SPMT) feature.
2 code implementations • 17 Oct 2017 • Tianwei Lin, Xu Zhao, Zheng Shou
The main drawback of this framework is that the boundaries of action instance proposals have been fixed during the classification step.
3 code implementations • ICCV 2017 • Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, Yi Wu, Hanqing Lu
To fully explore the local and global properties, in this paper, we propose a novel fully convolutional network, named as CoupleNet, to couple the global structure with local parts for object detection.
Ranked #5 on Object Detection on PASCAL VOC 2007
no code implementations • 24 Jul 2017 • Xu Zhao, Yingying Chen, Ming Tang, Jinqiao Wang
In the first stage, a convolutional encoder-decoder sub-network is employed to reconstruct the background images and encode rich prior knowledge of background scenes.
no code implementations • 21 Jul 2017 • Tianwei Lin, Xu Zhao, Zheng Shou
Our approach achieves the state-of-the-art performances on both temporal action proposal task and temporal action localization task.
Ranked #11 on Temporal Action Proposal Generation on ActivityNet-1.3