Search Results for author: Xu Zhao

Found 46 papers, 18 papers with code

Konwledge-Enabled Diagnosis Assistant Based on Obstetric EMRs and Knowledge Graph

no code implementations • CCL 2020 • Kunli Zhang, Xu Zhao, Lei Zhuang, Qi Xie, Hongying Zan

In this paper, we treat the diagnosis assistant as a multi-label classification task and propose a Knowledge-Enabled Diagnosis Assistant (KEDA) model for the obstetric diagnosis assistant.

Disease Prediction Multi-Label Classification

Paper
Add Code

Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs

no code implementations • 27 Apr 2024 • Yiming Bao, Xu Zhao, Dahong Qian

On Total Capture dataset, the pose estimation error is significantly decreased compared to the baseline method.

3D Human Pose Estimation

Paper
Add Code

MESA: Matching Everything by Segmenting Anything

no code implementations • 30 Jan 2024 • Yesheng Zhang, Xu Zhao

However, the pervasive presence of matching redundancy between images gives rise to unnecessary and error-prone computations in these methods, imposing limitations on their accuracy.

Image Segmentation Pose Estimation +1

Paper
Add Code

An Embeddable Implicit IUVD Representation for Part-based 3D Human Surface Reconstruction

no code implementations • 30 Jan 2024 • Baoxing Li, Yong Deng, Yehui Yang, Xu Zhao

In recent years, a combination of parametric body models (such as SMPL) that capture body pose and shape prior, and neural implicit functions that learn flexible clothing details, has been used to integrate the advantages of both approaches.

Surface Reconstruction

Paper
Add Code

Fluctuation-based Adaptive Structured Pruning for Large Language Models

1 code implementation • 19 Dec 2023 • Yongqi An, Xu Zhao, Tao Yu, Ming Tang, Jinqiao Wang

Retraining-free is important for LLMs' pruning methods.

Network Pruning

Paper
Code

RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling

no code implementations • 24 Nov 2023 • Xiaoyue Wan, Zhuo Chen, Yiming Bao, Xu Zhao

This perception is injected by the Pose Transformer network and learned through a pre-training task that recovers iterative masked joints.

3D Human Pose Estimation 3D Reconstruction +1

Paper
Add Code

InstructCoder: Instruction Tuning Large Language Models for Code Editing

1 code implementation • 31 Oct 2023 • Kaixin Li, Qisheng Hu, Xu Zhao, Hui Chen, Yuxi Xie, Tiedong Liu, Qizhe Xie, Junxian He

In this work, we explore the use of Large Language Models (LLMs) to edit code based on user instructions.

Paper
Code

Disentangled Counterfactual Reasoning for Unbiased Sequential Recommendation

no code implementations • 5 Aug 2023 • Yi Ren, Xu Zhao, Hongyan Tang, Shuai Li

In this paper, we propose a structural causal model-based method to address the popularity bias issue for sequential recommendation model learning.

counterfactual Counterfactual Reasoning +1

Paper
Add Code

Fast Segment Anything

1 code implementation • 21 Jun 2023 • Xu Zhao, Wenchao Ding, Yongqi An, Yinglong Du, Tao Yu, Min Li, Ming Tang, Jinqiao Wang

In this paper, we propose a speed-up alternative method for this fundamental task with comparable performance.

Ranked #4 on Zero-Shot Instance Segmentation on LVIS v1.0 val

Edge Detection Image Segmentation +6

6,941

Paper
Code

CodeInstruct: Empowering Language Models to Edit Code

1 code implementation • Github 2023 • Qisheng Hu*, Kaixin Li*, Xu Zhao, Yuxi Xie, Tiedong Liu, Hui Chen, Qizhe Xie, Junxian He

In this work, we explore the use of large language models (LLMs) to edit code based on user instructions, covering a broad range of implicit tasks such as comment insertion, code optimization, and code refactoring.

Paper
Code

Self-Evaluation Guided Beam Search for Reasoning

no code implementations • NeurIPS 2023 • Yuxi Xie, Kenji Kawaguchi, Yiran Zhao, Xu Zhao, Min-Yen Kan, Junxian He, Qizhe Xie

Stochastic beam search balances exploitation and exploration of the search space with temperature-controlled randomness.

Arithmetic Reasoning GSM8K +3

Paper
Add Code

Searching from Area to Point: A Hierarchical Framework for Semantic-Geometric Combined Feature Matching

1 code implementation • 29 Apr 2023 • Yesheng Zhang, Xu Zhao, Dahong Qian

This paper, thus, pays attention to the search space and proposes to set the initial search space for point matching as the matched image areas containing prominent semantic, named semantic area matches.

Pose Estimation

Paper
Code

TorchBench: Benchmarking PyTorch with High API Surface Coverage

1 code implementation • 27 Apr 2023 • Yueming Hao, Xu Zhao, Bin Bao, David Berard, Will Constable, Adnan Aziz, Xu Liu

TorchBench is able to comprehensively characterize the performance of the PyTorch software stack, guiding the performance optimization across models, PyTorch framework, and GPU libraries.

Benchmarking Vocal Bursts Intensity Prediction

791

Paper
Code

FreConv: Frequency Branch-and-Integration Convolutional Networks

no code implementations • 10 Apr 2023 • Zhaowen Li, Xu Zhao, Peigeng Ding, Zongxin Gao, Yuting Yang, Ming Tang, Jinqiao Wang

In the high-frequency branch, a derivative-filter-like architecture is designed to extract the high-frequency information while a light extractor is employed in the low-frequency branch because the low-frequency information is usually redundant.

Paper
Add Code

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection

1 code implementation • CVPR 2023 • Yongqi An, Xu Zhao, Tao Yu, Haiyun Guo, Chaoyang Zhao, Ming Tang, Jinqiao Wang

However, previous unsupervised deep learning BGS algorithms perform poorly in sophisticated scenarios such as shadows or night lights, and they cannot detect objects outside the pre-defined categories.

Foreground Segmentation Object +2

Paper
Code

Item Cold Start Recommendation via Adversarial Variational Auto-encoder Warm-up

no code implementations • 28 Feb 2023 • Shenzheng Zhang, Qi Tan, Xinzhi Zheng, Yi Ren, Xu Zhao

The gap between the randomly initialized item ID embedding and the well-trained warm item ID embedding makes the cold items hard to suit the recommendation system, which is trained on the data of historical warm items.

News Recommendation

Paper
Add Code

Slate-Aware Ranking for Recommendation

1 code implementation • 24 Feb 2023 • Yi Ren, Xiao Han, Xu Zhao, Shenzheng Zhang, Yan Zhang

Therefore, the ranking stage is still essential for most applications to provide high-quality candidate set for the re-ranking stage.

Recommendation Systems Re-Ranking

Paper
Code

View Consistency Aware Holistic Triangulation for 3D Human Pose Estimation

no code implementations • 22 Feb 2023 • Xiaoyue Wan, Zhuo Chen, Xu Zhao

The rapid development of multi-view 3D human pose estimation (HPE) is attributed to the maturation of monocular 2D HPE and the geometry of 3D reconstruction.

3D Human Pose Estimation 3D Reconstruction +1

Paper
Add Code

Balanced Audiovisual Dataset for Imbalance Analysis

1 code implementation • 14 Feb 2023 • Wenke Xia, Xu Zhao, Xincheng Pang, Changqing Zhang, Di Hu

We surprisingly find that: the multimodal models with existing imbalance algorithms consistently perform worse than the unimodal one on specific subsets, in accordance with the modality bias.

201

Paper
Code

Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection

no code implementations • ICCV 2023 • Zixuan Zhao, Dongqi Wang, Xu Zhao

First, the submergence of movement feature, i. e. the movement information in a snippet is covered by the scene information.

Action Detection

Paper
Add Code

Does Deep Learning REALLY Outperform Non-deep Machine Learning for Clinical Prediction on Physiological Time Series?

no code implementations • 11 Nov 2022 • Ke Liao, Wei Wang, Armagan Elibol, Lingzhong Meng, Xu Zhao, Nak Young Chong

In this paper, we systematically examine the performance of machine learning models for the clinical prediction task based on the EHR, especially physiological time series.

Time Series Time Series Analysis

Paper
Add Code

Transfering Low-Frequency Features for Domain Adaptation

no code implementations • 31 Aug 2022 • Zhaowen Li, Xu Zhao, Chaoyang Zhao, Ming Tang, Jinqiao Wang

Previous unsupervised domain adaptation methods did not handle the cross-domain problem from the perspective of frequency for computer vision.

Image Classification object-detection +2

Paper
Add Code

FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation

no code implementations • 25 Aug 2022 • Yiming Bao, Xu Zhao, Dahong Qian

On Total Capture dataset, KineFuse surpasses previous state-of-the-art which uses IMU only for testing by 8. 6\%.

Ranked #2 on 3D Human Pose Estimation on Total Capture

3D Human Pose Estimation Sensor Fusion

Paper
Add Code

Improving Item Cold-start Recommendation via Model-agnostic Conditional Variational Autoencoder

1 code implementation • 27 May 2022 • Xu Zhao, Yi Ren, Ying Du, Shenzheng Zhang, Nian Wang

This paper attempts to tackle the item cold-start problem by generating enhanced warmed-up ID embeddings for cold items with historical data and limited interaction records.

Decoder News Recommendation +1

Paper
Code

ETAD: Training Action Detection End to End on a Laptop

1 code implementation • 14 May 2022 • Shuming Liu, Mengmeng Xu, Chen Zhao, Xu Zhao, Bernard Ghanem

We propose to sequentially forward the snippet frame through the video encoder, and backward only a small necessary portion of gradients to update the encoder.

Action Detection Video Understanding

Paper
Code

Learning-Based Framework for Camera Calibration with Distortion Correction and High Precision Feature Detection

1 code implementation • 1 Feb 2022 • Yesheng Zhang, Xu Zhao, Dahong Qian

Therefore, in this paper, we propose a hybrid camera calibration framework which combines learning-based approaches with traditional methods to handle these bottlenecks.

Camera Calibration

Paper
Code

Pruning-aware Sparse Regularization for Network Pruning

1 code implementation • 18 Jan 2022 • Nanfei Jiang, Xu Zhao, Chaoyang Zhao, Yongqi An, Ming Tang, Jinqiao Wang

MaskSparsity imposes the fine-grained sparse regularization on the specific filters selected by a pruning mask, rather than all the filters of the model.

Network Pruning

Paper
Code

Estimate Metabolite Taxonomy and Structure with a Fragment-Centered Database and Fragment Network

no code implementations • 11 Jan 2021 • Hansen Zhao, Xu Zhao, Huan Yao, Jiaxin Feng, Sichun Zhang, Xinrong Zhang

Metabolite structure identification has become the major bottleneck of the mass spectrometry based metabolomics research.

Paper
Add Code

Adaptive Tree Wasserstein Minimization for Hierarchical Generative Modeling

no code implementations • 1 Jan 2021 • ZiHao Wang, Xu Zhao, Tam Le, Hao Wu, Yong Zhang, Makoto Yamada

In this work, we consider OT over tree metrics, which is more general than the sliced Wasserstein and includes the sliced Wasserstein as a special case, and we propose a fast minimization algorithm in $O(n)$ for the optimal Wasserstein-1 transport plan between two distributions in the tree structure.

Unsupervised Domain Adaptation

Paper
Add Code

An End to End Network Architecture for Fundamental Matrix Estimation

no code implementations • 29 Oct 2020 • Yesheng Zhang, Xu Zhao, Dahong Qian

In this paper, we present a novel end-to-end network architecture to estimate fundamental matrix directly from stereo images.

Paper
Add Code

Task Decoupled Knowledge Distillation For Lightweight Face Detectors

1 code implementation • 14 Oct 2020 • Xiaoqing Liang, Xu Zhao, Chaoyang Zhao, Nanfei Jiang, Ming Tang, Jinqiao Wang

This method decouples the distillation task of face detection into two subtasks, i. e., the classification distillation subtask and the regression distillation subtask.

Face Detection Knowledge Distillation +1

Paper
Code

Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

1 code implementation • EMNLP 2020 • Xu Zhao, ZiHao Wang, Hao Wu, Yong Zhang

In this paper, we propose a new semi-supervised BLI framework to encourage the interaction between the supervised signal and unsupervised alignment.

Bilingual Lexicon Induction Vocal Bursts Valence Prediction

Paper
Code

A Relaxed Matching Procedure for Unsupervised BLI

no code implementations • ACL 2020 • Xu Zhao, ZiHao Wang, Hao Wu, Yong Zhang

Recently unsupervised Bilingual Lexicon Induction (BLI) without any parallel corpus has attracted much research interest.

Bilingual Lexicon Induction Translation

Paper
Add Code

Parameter Sharing Decoder Pair for Auto Composing

no code implementations • 31 Oct 2019 • Xu Zhao

Auto Composing is an active and appealing research area in the past few years, and lots of efforts have been put into inventing more robust models to solve this problem.

Decoder

Paper
Add Code

Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2

no code implementations • 29 Jul 2019 • Haisheng Su, Xu Zhao, Shuming Liu

This technical report presents an overview of our solution used in the submission to ActivityNet Challenge 2019 Task 1 (\textbf{temporal action proposal generation}) and Task 2 (\textbf{temporal action localization/detection}).

Re-Ranking Task 2 +1

Paper
Add Code

EdgeStereo: An Effective Multi-Task Learning Network for Stereo Matching and Edge Detection

no code implementations • 5 Mar 2019 • Xiao Song, Xu Zhao, Liangji Fang, Hanwen Hu

EdgeStereo also achieves comparable generalization performance for disparity estimation because of the incorporation of edge cues.

Disparity Estimation Edge Detection +3

Paper
Add Code

A Tangent Distance Preserving Dimensionality Reduction Algorithm

no code implementations • 4 Feb 2019 • Xu Zhao, Zongli Jiang

TDPM uses tangent distance instead of geodesic distance, and then applies MDS to the tangent distance matrix to map the manifold into a low dimensional space in which we can get its nonlinear structure.

Dimensionality Reduction

Paper
Add Code

Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization

no code implementations • 28 Oct 2018 • Haisheng Su, Xu Zhao, Tianwei Lin

Weakly supervised temporal action localization, which aims at temporally locating action instances in untrimmed videos using only video-level class labels during training, is an important yet challenging problem in video analysis.

General Classification Video Classification +2

Paper
Add Code

Discriminative Representation Combinations for Accurate Face Spoofing Detection

no code implementations • 27 Aug 2018 • Xiao Song, Xu Zhao, Liangji Fang, Tianwei Lin

Secondly we utilize the SSD, which is a deep learning framework for detection, to excavate context cues and conduct end-to-end face presentation attack detection.

Face Presentation Attack Detection

Paper
Add Code

BSN: Boundary Sensitive Network for Temporal Action Proposal Generation

17 code implementations • ECCV 2018 • Tianwei Lin, Xu Zhao, Haisheng Su, Chongjing Wang, Ming Yang

Temporal action proposal generation is an important yet challenging problem, since temporal proposals with rich action content are indispensable for analysing real-world videos with long duration and high proportion irrelevant content.

Ranked #3 on Temporal Action Proposal Generation on THUMOS' 14

Action Detection Temporal Action Proposal Generation

6,703

Paper
Code

EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching

no code implementations • 14 Mar 2018 • Xiao Song, Xu Zhao, Hanwen Hu, Liangji Fang

Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task.

Disparity Estimation Edge Detection +2

Paper
Add Code

Face Spoofing Detection by Fusing Binocular Depth and Spatial Pyramid Coding Micro-Texture Features

no code implementations • 13 Mar 2018 • Xiao Song, Xu Zhao, Tianwei Lin

The second one is a high-level micro-texture based feature called Spatial Pyramid Coding Micro-Texture (SPMT) feature.

Paper
Add Code

Single Shot Temporal Action Detection

2 code implementations • 17 Oct 2017 • Tianwei Lin, Xu Zhao, Zheng Shou

The main drawback of this framework is that the boundaries of action instance proposals have been fixed during the classification step.

Action Detection General Classification

Paper
Code

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

3 code implementations • ICCV 2017 • Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, Yi Wu, Hanqing Lu

To fully explore the local and global properties, in this paper, we propose a novel fully convolutional network, named as CoupleNet, to couple the global structure with local parts for object detection.

Ranked #5 on Object Detection on PASCAL VOC 2007

Object object-detection +3

Paper
Code

Joint Background Reconstruction and Foreground Segmentation via A Two-stage Convolutional Neural Network

no code implementations • 24 Jul 2017 • Xu Zhao, Yingying Chen, Ming Tang, Jinqiao Wang

In the first stage, a convolutional encoder-decoder sub-network is employed to reconstruct the background images and encode rich prior knowledge of background scenes.

Decoder Foreground Segmentation +1

Paper
Add Code

Temporal Convolution Based Action Proposal: Submission to ActivityNet 2017

no code implementations • 21 Jul 2017 • Tianwei Lin, Xu Zhao, Zheng Shou

Our approach achieves the state-of-the-art performances on both temporal action proposal task and temporal action localization task.

Ranked #11 on Temporal Action Proposal Generation on ActivityNet-1.3

Action Classification General Classification +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.