no code implementations • 15 Feb 2024 • Wenxiao Wang, Wei Chen, Yicong Luo, Yongliu Long, Zhengkai Lin, Liye Zhang, Binbin Lin, Deng Cai, Xiaofei He
However, Large language models have two prominent characteristics compared to smaller models: (1) Most of compression algorithms require finetuning or even retraining the model after compression.
1 code implementation • 30 Oct 2023 • Hengjia Li, Yang Liu, Linxuan Xia, Yuqi Lin, Tu Zheng, Zheng Yang, Wenxiao Wang, Xiaohui Zhong, Xiaobo Ren, Xiaofei He
Concretely, the distance loss blends the attributes of all target domains by reducing the distances from generated images to all target subspaces.
1 code implementation • 12 Oct 2023 • Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, Qibo Qiu, Binbin Lin, Xiaofei He, Wanli Ouyang
In the context of autonomous driving, the significance of effective feature learning is widely acknowledged.
no code implementations • 29 Sep 2023 • Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang, Xiaofei He
Recent works considering professional legal-linguistic style (PLLS) texts have shown promising results on the charge prediction task.
no code implementations • 23 Sep 2023 • Qibo Qiu, Honghui Yang, Wenxiao Wang, Shun Zhang, Haiming Gao, Haochao Ying, Wei Hua, Xiaofei He
Specifically, with masked point cloud as input, M$^3$CS introduces two decoders to predict masked representations and the original points simultaneously.
no code implementations • 1 Aug 2023 • Minghao Chen, Zepeng Gao, Shuai Zhao, Qibo Qiu, Wenxiao Wang, Binbin Lin, Xiaofei He
Unsupervised domain adaptation (UDA) methods facilitate the transfer of models to target domains without labels.
no code implementations • 1 Jun 2023 • Qibo Qiu, Haiming Gao, Wenxiao Wang, Zhiyi Su, Tian Xie, Wei Hua, Xiaofei He
To enhance message passing along particular axes, Stacked Asymmetric Convolution Block (SACB) is designed, which is one of the main contributions in this paper.
1 code implementation • CVPR 2023 • Honghui Yang, Wenxiao Wang, Minghao Chen, Binbin Lin, Tong He, Hua Chen, Xiaofei He, Wanli Ouyang
The key to associating the two different representations is our introduced input-dependent Query Initialization module, which could efficiently generate reference points and content queries.
no code implementations • 3 May 2023 • Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Wenqiao Zhang, Rui Zhang, Xiaofei He, Fei Wu
In contrast, modalities that do not cause users' behaviors are potential noises and might mislead the learning of a recommendation model.
1 code implementation • 13 Mar 2023 • Wenxiao Wang, Wei Chen, Qibo Qiu, Long Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wei Liu
On the one hand, CEL blends each token with multiple patches of different scales, providing the self-attention module itself with cross-scale features.
no code implementations • 20 Feb 2023 • Liang Xie, Yibo Yang, Wenxiao Wang, Binbin Lin, Deng Cai, Xiaofei He, Ronghua Liang
Compared to 2D images, 3D point clouds are much more sensitive to rotations.
1 code implementation • CVPR 2023 • Yuqi Lin, Minghao Chen, Wenxiao Wang, Boxi Wu, Ke Li, Binbin Lin, Haifeng Liu, Xiaofei He
To efficiently generate high-quality segmentation masks from CLIP, we propose a novel WSSS framework called CLIP-ES.
Ranked #12 on Weakly-Supervised Semantic Segmentation on COCO 2014 val
1 code implementation • CVPR 2023 • Honghui Yang, Tong He, Jiaheng Liu, Hua Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wanli Ouyang
In contrast to previous 3D MAE frameworks, which either design a complex decoder to infer masked information from maintained regions or adopt sophisticated masking strategies, we instead propose a much simpler paradigm.
1 code implementation • 15 Sep 2022 • Qibo Qiu, Haiming Gao, Wei Hua, Gang Huang, Xiaofei He
Lane detection is one of the fundamental modules in self-driving.
no code implementations • 29 Aug 2022 • Boxi Wu, Jie Jiang, Haidong Ren, Zifan Du, Wenxiao Wang, Zhifeng Li, Deng Cai, Xiaofei He, Binbin Lin, Wei Liu
Various training criteria for these auxiliary outliers are proposed based on heuristic intuitions.
no code implementations • 17 Aug 2022 • Shengyu Zhang, Bofang Li, Dong Yao, Fuli Feng, Jieming Zhu, Wenyan Fan, Zhou Zhao, Xiaofei He, Tat-Seng Chua, Fei Wu
Micro-video recommender systems suffer from the ubiquitous noises in users' behaviors, which might render the learned user representation indiscriminating, and lead to trivial recommendations (e. g., popular items) or even weird ones that are far beyond users' interests.
1 code implementation • 7 Aug 2022 • Honghui Yang, Zili Liu, Xiaopei Wu, Wenxiao Wang, Wei Qian, Xiaofei He, Deng Cai
The dynamic farthest voxel sampling is then applied to evenly sample the points.
1 code implementation • 1 Aug 2022 • Xing Zhao, Haoran Liang, Peipei Li, Guodao Sun, Dongdong Zhao, Ronghua Liang, Xiaofei He
Moreover, inspired by the boundary supervision commonly used in image salient object detection (ISOD), we design a motion-aware loss for predicting object boundary motion and simultaneously perform multitask learning for VSOD and object motion prediction, which can further facilitate the model to extract spatiotemporal features accurately and maintain the object integrity.
no code implementations • 21 Jul 2022 • Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu
Vision Transformer (ViT), as a powerful alternative to Convolutional Neural Network (CNN), has received much attention.
no code implementations • 19 Apr 2022 • Liang Xie, Yibo Yang, Deng Cai, Xiaofei He
Class imbalance distribution widely exists in real-world engineering.
3 code implementations • CVPR 2022 • Tu Zheng, Yifei HUANG, Yang Liu, Wenjian Tang, Zheng Yang, Deng Cai, Xiaofei He
In this way, we can exploit more contextual information to detect lanes while leveraging local detailed lane features to improve localization accuracy.
Ranked #1 on Lane Detection on LLAMAS
1 code implementation • ICLR 2022 • Liang Peng, Senbo Yan, Boxi Wu, Zheng Yang, Xiaofei He, Deng Cai
This network is learned by minimizing our newly-proposed 3D alignment loss between the 3D box estimates and the corresponding RoI LiDAR points.
1 code implementation • 21 Feb 2022 • Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He
In this paper, the VLAD aggregation method is adopted to quantize local features with visual vocabulary locally partitioning the feature space, and hence preserve the local discriminability.
no code implementations • CVPR 2022 • Aoxiong Yin, Zhou Zhao, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He
In addition, we also explore zero-shot translation in sign language and find that our model can achieve comparable performance to the supervised BSLT model on some language pairs.
no code implementations • 8 Dec 2021 • Aoxiong Yin, Zhou Zhao, Jinglin Liu, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He
Sign language translation as a kind of technology with profound social significance has attracted growing researchers' interest in recent years.
no code implementations • 29 Sep 2021 • Liang Peng, Senbo Yan, Chenxi Huang, Xiaofei He, Deng Cai
This characteristic indicates that monocular 3D detection is inherently different from other typical detection tasks that have the same dimensional input and output.
1 code implementation • 26 Sep 2021 • Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Qi Zhang, Jingjie Li, Xiuqiang He, Xiaofei He, Tat-Seng Chua, Fei Wu
In this work, inspired by the fact that users make their click decisions mostly based on the visual impression they perceive when browsing news, we propose to capture such visual impression information with visual-semantic modeling for news recommendation.
no code implementations • 31 Aug 2021 • Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He
Lip reading, aiming to recognize spoken sentences according to the given video of lip movements without relying on the audio stream, has attracted great interest due to its application in many scenarios.
3 code implementations • ICLR 2022 • Wenxiao Wang, Lu Yao, Long Chen, Binbin Lin, Deng Cai, Xiaofei He, Wei Liu
On the one hand, CEL blends each embedding with multiple patches of different scales, providing the self-attention module itself with cross-scale features.
Ranked #42 on Semantic Segmentation on ADE20K val
1 code implementation • CVPR 2022 • Yang Liu, Weifeng Zhang, Chao Xiang, Tu Zheng, Deng Cai, Xiaofei He
Few-shot learning (FSL) aims to learn a classifier that can be easily adapted to accommodate new tasks not seen during training, given only a few examples.
1 code implementation • ICCV 2021 • Hao Fang, Daoxin Zhang, Yi Zhang, Minghao Chen, Jiawei Li, Yao Hu, Deng Cai, Xiaofei He
In this paper, we study the Salient Object Ranking (SOR) task, which manages to assign a ranking order of each detected object according to its visual saliency.
no code implementations • 9 Jun 2021 • Boxi Wu, Heng Pan, Li Shen, Jindong Gu, Shuai Zhao, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu
In this work, we find that the adversarial attacks can also be vulnerable to small perturbations.
no code implementations • 29 Apr 2021 • Xin Guo, Zhongming Jin, Chong Chen, Helei Nie, Jianqiang Huang, Deng Cai, Xiaofei He, Xiansheng Hua
In this paper, we propose a DiscRiminative-gEnerative duAl Memory (DREAM) anomaly detection model to take advantage of a few anomalies and solve data imbalance.
no code implementations • 13 Apr 2021 • Liang Peng, Fei Liu, Senbo Yan, Xiaofei He, Deng Cai
Image-only and pseudo-LiDAR representations are commonly used for monocular 3D object detection.
no code implementations • 24 Mar 2021 • Liang Xie, Guodong Xu, Deng Cai, Xiaofei He
3D object detection algorithms for autonomous driving reason about 3D obstacles either from 3D birds-eye view or perspective view or both.
no code implementations • 15 Mar 2021 • Yang Liu, Tu Zheng, Jie Song, Deng Cai, Xiaofei He
In this paper, we argue that a Mutual Nearest Neighbor (MNN) relation should be established to explicitly select the query descriptors that are most relevant to each task and discard less relevant ones from aggregative clutters in FSL.
no code implementations • 10 Mar 2021 • Dong Shen, Shuai Zhao, Jinming Hu, Hao Feng, Deng Cai, Xiaofei He
In this paper, we propose a novel network, Erasing-Salient Net (ES-Net), to learn comprehensive features by erasing the salient areas in an image.
1 code implementation • 15 Oct 2020 • Jia Guo, Minghao Chen, Yao Hu, Chen Zhu, Xiaofei He, Deng Cai
We investigate this problem by study the gap of confidence between teacher and student.
no code implementations • 10 Oct 2020 • Wenxiao Wang, Minghao Chen, Shuai Zhao, Long Chen, Jinming Hu, Haifeng Liu, Deng Cai, Xiaofei He, Wei Liu
Specifically, it first casts the relationships between a certain model's accuracy and depth/width/resolution into a polynomial regression and then maximizes the polynomial to acquire the optimal values for the three dimensions.
1 code implementation • NeurIPS 2021 • Boxi Wu, Jinghui Chen, Deng Cai, Xiaofei He, Quanquan Gu
Previous empirical results suggest that adversarial training requires wider networks for better performances.
no code implementations • 14 Aug 2020 • Zhengxu Yu, Yilun Zhao, Bin Hong, Zhongming Jin, Jianqiang Huang, Deng Cai, Xiaofei He, Xian-Sheng Hua
Therefore, it is critical to learn an apparel-invariant person representation under cases like cloth changing or several persons wearing similar clothes.
no code implementations • 30 Jul 2020 • Xin Guo, Zhengxu Yu, Chao Xiang, Zhongming Jin, Jianqiang Huang, Deng Cai, Xiaofei He, Xian-Sheng Hua
Most deep-learning-based image classification methods assume that all samples are generated under an independent and identically distributed (IID) setting.
no code implementations • 14 Nov 2019 • Liang Xie, Chao Xiang, Zhengxu Yu, Guodong Xu, Zheng Yang, Deng Cai, Xiaofei He
Moreover, based on the PACF module, we propose a 3D multi-sensor multi-task network called Pointcloud-Image RCNN(PI-RCNN as brief), which handles the image segmentation and 3D object detection tasks.
1 code implementation • 24 Jul 2019 • Jun Zhao, Zhou Zhou, Ziyu Guan, Wei Zhao, Wei Ning, Guang Qiu, Xiaofei He
In this work, we collect abundant relationships from common user behaviors and item information, and propose a novel framework named IntentGC to leverage both explicit preferences and heterogeneous relationships by graph convolutional networks.
no code implementations • NeurIPS 2018 • Boyuan Pan, Yazheng Yang, Hao Li, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He
In this paper, we transfer knowledge learned from machine comprehension to the sequence-to-sequence tasks to deepen the understanding of the text.
1 code implementation • ACL 2018 • Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He
We observe that people usually use some discourse markers such as "so" or "but" to represent the logical relationship between two sentences.
Ranked #13 on Natural Language Inference on SNLI
no code implementations • 28 Jun 2019 • Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Xiaofei He
Concretely, we first develop a hierarchical convolutional self-attention encoder to efficiently model long-form video contents, which builds the hierarchical structure for video sequences and captures question-aware long-range dependencies from video context.
1 code implementation • 25 Jun 2019 • Wenxiao Wang, Cong Fu, Jishun Guo, Deng Cai, Xiaofei He
2) Cross-layer filter comparison is unachievable since the importance is defined locally within each layer.
no code implementations • 1 Mar 2018 • Jun Zhao, Guang Qiu, Ziyu Guan, Wei Zhao, Xiaofei He
In this paper, we consider the RTB problem in sponsored search auction, named SS-RTB.
no code implementations • SIGIR 2018 • Zheqian Chen, Rongqin Yang, Zhou Zhao, Deng Cai, Xiaofei He
Dialogue Act Recognition (DAR) is a challenging problem in dialogue interpretation, which aims to attach semantic labels to utterances and characterize the speaker's intention.
no code implementations • 1 Nov 2017 • Boyuan Pan, Hao Li, Zhou Zhao, Deng Cai, Xiaofei He
In this paper, we propose a novel neural network system that consists a Demand Optimization Model based on a passage-attention neural machine translation and a Reader Model that can find the answer given the optimized question.
no code implementations • 8 Oct 2017 • Zheqian Chen, Rongqin Yang, Bin Cao, Zhou Zhao, Deng Cai, Xiaofei He
Machine Comprehension (MC) is a challenging task in Natural Language Processing field, which aims to guide the machine to comprehend a passage and answer the given question.
Ranked #33 on Question Answering on SQuAD1.1 dev
2 code implementations • 12 Sep 2017 • Junying Li, Deng Cai, Xiaofei He
Molecules can be represented as an undirected graph, and we can utilize graph convolution networks to predication molecular properties.
Ranked #1 on Drug Discovery on HIV dataset
no code implementations • 28 Jul 2017 • Boyuan Pan, Hao Li, Zhou Zhao, Bin Cao, Deng Cai, Xiaofei He
Machine comprehension(MC) style question answering is a representative problem in natural language processing.
Ranked #37 on Question Answering on TriviaQA
1 code implementation • ICML 2017 • Weizhong Zhang, Bin Hong, Wei Liu, Jieping Ye, Deng Cai, Xiaofei He, Jie Wang
By noting that sparse SVMs induce sparsities in both feature and sample spaces, we propose a novel approach, which is based on accurate estimations of the primal and dual optima of sparse SVMs, to simultaneously identify the inactive features and samples that are guaranteed to be irrelevant to the outputs.
no code implementations • 1 May 2014 • Binbin Lin, Ji Yang, Xiaofei He, Jieping Ye
Based on our theoretical analysis, we propose to first learn the gradient field of the distance function and then learn the distance function itself.
no code implementations • 2 Apr 2013 • Lijun Zhang, Tianbao Yang, Rong Jin, Xiaofei He
Traditional algorithms for stochastic optimization require projecting the solution at each iteration into a given domain to ensure its feasibility.
no code implementations • NeurIPS 2012 • Binbin Lin, Sen yang, Chiyuan Zhang, Jieping Ye, Xiaofei He
MTVFL has the following key properties: (1) the vector fields we learned are close to the gradient fields of the prediction functions; (2) within each task, the vector field is required to be as parallel as possible which is expected to span a low dimensional subspace; (3) the vector fields from all tasks share a low dimensional subspace.
no code implementations • NeurIPS 2011 • Binbin Lin, Chiyuan Zhang, Xiaofei He
To achieve this goal, we show that the second order smoothness measures the linearity of the function, and the gradient field of a linear function has to be a parallel vector field.
no code implementations • IEEE Transactions on Pattern Analysis and Machine Intelligence 2011 • Deng Cai, Xiaofei He, Jiawei Han, Thomas S. Huang
In GNMF, an affinity graph is constructed to encode the geometrical information and we seek a matrix factorization, which respects the graph structure.