1 code implementation • 5 Apr 2024 • JunHao Chen, Xiang Li, Xiaojun Ye, Chao Li, Zhaoxin Fan, Hao Zhao
The definition of an IDEA is the composition of multimodal inputs including text, image, and 3D models.
no code implementations • 18 Mar 2024 • Mingjin Chen, JunHao Chen, Xiaojun Ye, Huan-ang Gao, Xiaoxue Chen, Zhaoxin Fan, Hao Zhao
In this paper, we propose a new method called \emph{Ultraman} for fast reconstruction of textured 3D human models from a single image.
no code implementations • 11 Mar 2024 • Zhenbo Song, Wenhao Gao, Kaihao Zhang, Wenhan Luo, Zhaoxin Fan, Jianfeng Lu
Extensive experiments demonstrate the efficacy of the degradation objective on state-of-the-art face restoration models.
1 code implementation • 5 Mar 2024 • Zhaoxin Fan, Runmin Jiang, Junhao Wu, Xin Huang, Tianyang Wang, Heng Huang, Min Xu
3D medical image segmentation is a challenging task with crucial implications for disease diagnosis and treatment planning.
no code implementations • 21 Feb 2024 • Zhenbo Song, Zhenyuan Zhang, Kaihao Zhang, Wenhan Luo, Zhaoxin Fan, Jianfeng Lu
This study delves into the enhancement of Under-Display Camera (UDC) image restoration models, focusing on their robustness against adversarial attacks.
1 code implementation • 29 Nov 2023 • Ziqiao Peng, Wentao Hu, Yue Shi, Xiangyu Zhu, Xiaomei Zhang, Hao Zhao, Jun He, Hongyan Liu, Zhaoxin Fan
A lifelike talking head requires synchronized coordination of subject identity, lip movements, facial expressions, and head poses.
no code implementations • 16 Oct 2023 • Kaixing Yang, Xukun Zhou, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan
Dance and music are closely related forms of expression, with mutual retrieval between dance videos and music being a fundamental task in various fields like education, art, and sports.
no code implementations • 15 Sep 2023 • Xukun Zhou, Zhenbo Song, Jun He, Hongyan Liu, Zhaoxin Fan
Scene Graph Generation is a critical enabler of environmental comprehension for autonomous robotic systems.
no code implementations • 12 Sep 2023 • Yixing Lu, Zhaoxin Fan, Min Xu
In this paper, we introduce a novel semi-supervised learning framework tailored for medical image segmentation.
1 code implementation • ICCV 2023 • Xueting Yang, Yihao Luo, Yuliang Xiu, Wei Wang, Hao Xu, Zhaoxin Fan
In this paper, we propose replacing the implicit value with an adaptive uncertainty distribution, to differentiate between points based on their distance to the surface.
1 code implementation • 1 Aug 2023 • Zhenyuan Zhang, Zhenbo Song, Kaihao Zhang, Wenhan Luo, Zhaoxin Fan, Jianfeng Lu
To the best of our knowledge, these two datasets are the first largest-scale UHD datasets for SIRR.
no code implementations • 13 Jul 2023 • Zhaoxin Fan, Puquan Pan, Zeren Zhang, Ce Chen, Tianyang Wang, Siyang Zheng, Min Xu
Few-shot medical image semantic segmentation is of paramount importance in the domain of medical image analysis.
1 code implementation • 19 Jun 2023 • Ziqiao Peng, Yihao Luo, Yue Shi, Hao Xu, Xiangyu Zhu, Jun He, Hongyan Liu, Zhaoxin Fan
To enhance the visual accuracy of generated lip movement while reducing the dependence on labeled data, we propose a novel framework SelfTalk, by involving self-supervision in a cross-modals network system to learn 3D talking faces.
2 code implementations • ICCV 2023 • Ziqiao Peng, HaoYu Wu, Zhenbo Song, Hao Xu, Xiangyu Zhu, Jun He, Hongyan Liu, Zhaoxin Fan
Specifically, we introduce the emotion disentangling encoder (EDE) to disentangle the emotion and content in the speech by cross-reconstructed speech signals with different emotion labels.
1 code implementation • CVPR 2023 • Zhenbo Song, Zhenyuan Zhang, Kaihao Zhang, Wenhan Luo, Zhaoxin Fan, Wenqi Ren, Jianfeng Lu
This paper addresses the problem of robust deep single-image reflection removal (SIRR) against adversarial attacks.
Ranked #2 on Reflection Removal on Real20
1 code implementation • 22 Dec 2022 • Zhaoxin Fan, Kaixing Yang, Min Zhang, Zhenbo Song, Hongyan Liu, Jun He
In stage 1, a novel devices detection and tracking scheme is introduced, which accurately locate the height limit devices in the left or right image.
1 code implementation • 30 Nov 2022 • Zhaoxin Fan, Yuqing Pan, Hao Xu, Zhenbo Song, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He
These novel elements of FuRPE not only serve to further refine the model but also to reduce potential biases that may arise from inaccuracies in pseudo labels, thereby optimizing the network's training process and enhancing the robustness of the model.
no code implementations • 23 Sep 2022 • Zhaoxin Fan, Zhenbo Song, Hongyan Liu, Jun He
Large-scale place recognition is a fundamental but challenging task, which plays an increasingly important role in autonomous driving and robotics.
no code implementations • 17 Sep 2022 • Zhaoxin Fan, Fengxin Li, Hongyan Liu, Jun He, Xiaoyong Du
In this paper, we research the new topic of object effects recommendation in micro-video platforms, which is a challenging but important task for many practical applications such as advertisement insertion.
1 code implementation • 19 Aug 2022 • Han Sun, Zhaoxin Fan, Zhenbo Song, Zhicheng Wang, Kejian Wu, Jianfeng Lu
The insight behind introducing MonoSIM is that we propose to simulate the feature learning behaviors of a point cloud based detector for monocular detector during the training period.
no code implementations • 20 Apr 2022 • Zhaoxin Fan, Yulin He, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He
Real-world sensors often produce incomplete, irregular, and noisy point clouds, making point cloud completion increasingly important.
no code implementations • 4 Apr 2022 • Zhaoxin Fan, Zhenbo Song, Jian Xu, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He
Recently, RGBD-based category-level 6D object pose estimation has achieved promising improvement in performance, however, the requirement of depth information prohibits broader applications.
no code implementations • 20 Nov 2021 • Zhaoxin Fan, Zhengbo Song, Jian Xu, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He
ACR-Pose consists of a Reconstructor and a Discriminator.
no code implementations • 29 Aug 2021 • Zhaoxin Fan, Zhenbo Song, Wenping Zhang, Hongyan Liu, Jun He, Xiaoyong Du
Third, we apply these kernels to previous point cloud features to generate new features, which is the well-known SO(3) mapping process.
no code implementations • 29 May 2021 • Zhaoxin Fan, Yazhi Zhu, Yulin He, Qi Sun, Hongyan Liu, Jun He
Therefore, this study presents a comprehensive review of recent progress in object pose detection and tracking that belongs to the deep learning technical route.
no code implementations • 1 May 2021 • Zhaoxin Fan, Zhenbo Song, Hongyan Liu, Zhiwu Lu, Jun He, Xiaoyong Du
Point cloud-based large scale place recognition is fundamental for many applications like Simultaneous Localization and Mapping (SLAM).
Ranked #2 on 3D Place Recognition on Oxford RobotCar Dataset