1 code implementation • 9 May 2024 • Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li
Sora unveils the potential of scaling Diffusion Transformer for generating photorealistic images and videos at arbitrary resolutions, aspect ratios, and durations, yet it still lacks sufficient implementation details.
1 code implementation • 26 Apr 2024 • Tao Liu, Yuhang Zhang, Zhu Feng, Zhiqin Yang, Chen Xu, Dapeng Man, Wu Yang
Trained backdoored global model is more resilient to benign updates, leading to a higher attack success rate on the test set.
3 code implementations • 14 Mar 2024 • Lixiong Qin, Mei Wang, Xuannan Liu, Yuhang Zhang, Wei Deng, Xiaoshuai Song, Weiran Xu, Weihong Deng
This design enhances the unification of model structure while improving application efficiency in terms of storage overhead.
no code implementations • 23 Jan 2024 • Yuhang Zhang, Yue Yao, Xuannan Liu, Lixiong Qin, Wenjing Wang, Weihong Deng
Facial expression recognition (FER) models are typically trained on datasets with a fixed number of seven basic classes.
Facial Expression Recognition Facial Expression Recognition (FER) +1
no code implementations • 12 Jan 2024 • Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang
Vision-language foundation models, represented by Contrastive language-image pre-training (CLIP), have gained increasing attention for jointly understanding both vision and textual tasks.
no code implementations • 28 Dec 2023 • Yuhang Zhang, Yuang Deng, Xiaopeng Zhang, Jie Li, Robert C. Qiu, Qi Tian
In DeLR, the query is based on region-level, and we only annotate the object region that is queried; 2) Instead of directly providing both localization and recognition annotations, we separately query the two components, and thus reduce the recognition budget with the pseudo class labels provided by the model.
no code implementations • 22 Dec 2023 • Xuannan Liu, Yaoyao Zhong, Xing Cui, Yuhang Zhang, Peipei Li, Weihong Deng
This strategy initially focuses on adapting the masks to the unique individual faces via image-specific training and then enhances their feature-level generalization ability to diverse facial variations of individuals via person-specific training.
no code implementations • 8 Dec 2023 • Jiaping Xiao, Rangya Zhang, Yuhang Zhang, Mir Feroskhan
Drones as advanced cyber-physical systems are undergoing a transformative shift with the advent of vision-based learning, a field that is rapidly gaining prominence due to its profound impact on drone autonomy and functionality.
no code implementations • 18 Oct 2023 • Yuhang Zhang, Marcos Quinones-Grueiro, Zhiyao Zhang, Yanbing Wang, William Barbour, Gautam Biswas, Daniel Work
Variable Speed Limit (VSL) control acts as a promising highway traffic management strategy with worldwide deployment, which can enhance traffic safety by dynamically adjusting speed limits according to real-time traffic conditions.
no code implementations • 12 Oct 2023 • Zhao Ning Zou, Yuhang Zhang, Robert Wijaya
We studied this issue by measuring the performance of DETR with different experiments and benchmarking the network with convolutional neural network (CNN) based detectors like YOLO and Faster-RCNN.
no code implementations • 28 Sep 2023 • Yuhang Zhang, Yue Liu, Zhihua Zhang
Motivated by the synthetic control method, we construct a synthetic treatment group for the target population by a weighted mixture of treatment groups of source populations.
1 code implementation • 27 Sep 2023 • Wenjie Li, Mei Wang, Kai Zhang, Juncheng Li, Xiaoming Li, Yuhang Zhang, Guangwei Gao, Weihong Deng, Chia-Wen Lin
We also discuss notable benchmarks commonly utilized in the field.
1 code implementation • 25 Sep 2023 • Muxin Liao, Shishun Tian, Yuhang Zhang, Guoguang Hua, Wenbin Zou, Xia Li
Based on these observations, a calibration-based dual prototypical contrastive learning (CDPCL) approach is proposed to reduce the domain discrepancy between the learned class-wise features and the prototypes of different domains for domain generalization semantic segmentation.
1 code implementation • journal 2023 • Saining Zhang, Yuhang Zhang, Ye Zhang, YuFei Wang, Zhigang Song
In recent years, facial expression recognition (FER) has garnered significant attention within the realm of computer vision research.
Ranked #1 on Facial Expression Recognition (FER) on AffectNet
Facial Expression Recognition Facial Expression Recognition (FER) +1
1 code implementation • ICCV 2023 • Xuannan Liu, Yaoyao Zhong, Yuhang Zhang, Lixiong Qin, Weihong Deng
Deep neural networks are vulnerable to universal adversarial perturbation (UAP), an instance-agnostic perturbation capable of fooling the target model for most samples.
no code implementations • 17 Jun 2023 • Suyash C. Vishnoi, Junyi Ji, MirSaleh Bahavarnia, Yuhang Zhang, Ahmad F. Taha, Christian G. Claudel, Daniel B. Work
The effectiveness of the proposed traffic control algorithms is tested using a traffic control example and compared with existing proportional-integral (PI)- and model predictive control (MPC)- based controllers from the literature.
1 code implementation • 23 Apr 2023 • Yue Hu, Yuhang Zhang, Yanbing Wang, Daniel Work
In this work, we consider the problem of detecting a variety of socially abnormal driving behaviors, i. e., behaviors that do not conform to the behavior of other nearby drivers.
no code implementations • 16 Feb 2023 • Yuhang Zhang, Weihong Deng, Liang Zheng
We further provide interesting analyses of the effects of backbones and IND/OOD datasets on OOD detection performance.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
no code implementations • 29 Dec 2022 • Yuhang Zhang, Shishun Tian, Muxin Liao, Zhengyu Zhang, Wenbin Zou, Chen Xu
In this paper, we propose a class-wise non-salient region generalized (CNSG) framework for the VGSS task.
1 code implementation • 2 Dec 2022 • Yuhang Zhang, Weihong Deng, Xingchen Cui, Yunfeng Yin, Hongzhi Shi, Dongchao Wen
We introduce mean point ensemble to utilize a more robust loss function and more information from unselected samples to reduce error accumulation from the model perspective.
1 code implementation • 21 Jul 2022 • Yuhang Zhang, Chengrui Wang, Xu Ling, Weihong Deng
We find that FER models remember noisy samples by focusing on a part of the features that can be considered related to the noisy labels instead of learning from the whole features that lead to the latent truth.
Facial Expression Recognition Facial Expression Recognition (FER) +1
no code implementations • 29 Jun 2022 • Yuhang Zhang, Yulian Jiang, Shenquan Wang
In this article, the observer-based coordinated tracking control problem for a class of nonlinear multi-agent systems(MASs) with intermittent communication and information constraints is studied under dynamic switching topology.
no code implementations • 22 Jan 2022 • Siyan Li, Yue Xiao, Yuhang Zhang, Lei Chu, Robert C. Qiu
It is a challenging problem to detect and recognize targets on complex large-scene Synthetic Aperture Radar (SAR) images.
no code implementations • CVPR 2022 • Yuhang Zhang, Xiaopeng Zhang, Lingxi Xie, Jie Li, Robert C. Qiu, Hengtong Hu, Qi Tian
The Yes query is treated as positive pairs of the queried category for contrastive pulling, while the No query is treated as hard negative pairs for contrastive repelling.
1 code implementation • NeurIPS 2021 • Yuhang Zhang, Chengrui Wang, Weihong Deng
To quantify these uncertainties and achieve good performance under noisy data, we regard uncertainty as a relative concept and propose an innovative uncertainty learning method called Relative Uncertainty Learning (RUL).
Ranked #12 on Facial Expression Recognition (FER) on RAF-DB
Facial Expression Recognition Facial Expression Recognition (FER)
no code implementations • 16 May 2021 • Yuhang Zhang, Xiaopeng Zhang, Robert. C. Qiu, Jie Li, Haohang Xu, Qi Tian
Semi-supervised learning acts as an effective way to leverage massive unlabeled data.
no code implementations • 16 Feb 2021 • Yuhang Zhang, Yao Mu, Yujie Yang, Yang Guan, Shengbo Eben Li, Qi Sun, Jianyu Chen
Reinforcement learning has shown great potential in developing high-level autonomous driving.
1 code implementation • IEEE Biomedical Circuits and Systems (BIOCAS) 2019 • Yi Ma, Xinzi Xu, Qing Yu, Yuhang Zhang, Yongfu Li, Jian Zhao and Guoxing Wang
Improving access to health care services for the medically under-served population is vital to ensure that critical illness can be addressed immediately.
Ranked #15 on Audio Classification on ICBHI Respiratory Sound Database
no code implementations • 25 Sep 2019 • Yuhang Zhang, Zhenwei Miao, Tiebin Mi, Robert Caiming Qiu
Three-dimensional data, such as point clouds, are often composed of three coordinates with few featrues.
no code implementations • 16 Jan 2018 • Zenan Ling, Robert C. Qiu, Zhijian Jin, Yuhang Zhang, Xing He, Haichun Liu, Lei Chu
The location of broken insulators in aerial images is a challenging task.