no code implementations • 31 Jan 2024 • Wenxuan Yang, Weimin Tan, Yuqi Sun, Bo Yan
This paper introduces data-effective learning, aiming to use data in the most impactful way to pre-train foundation models.
1 code implementation • 18 Dec 2023 • Ruian He, Shili Zhou, Yuqi Sun, Ri Cheng, Weimin Tan, Bo Yan
With the rise of real-time rendering and the evolution of display devices, there is a growing demand for post-processing methods that offer high-resolution content in a high frame rate.
no code implementations • 12 Dec 2023 • Ri Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan
In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of iterations per sample.
1 code implementation • 15 Sep 2023 • Ruian He, Zhen Xing, Weimin Tan, Bo Yan
Second, we propose a novel representation diffusion model (RDM) to disentangle 3D latent into facial identity and expression.
no code implementations • 3 Aug 2023 • Shili Zhou, Xuhao Jiang, Weimin Tan, Ruian He, Bo Yan
Motion vectors, one of the compression information, record the motion of the video frames.
no code implementations • 31 Jul 2023 • Shili Zhou, Ruian He, Weimin Tan, Bo Yan
Optical Flow Estimation aims to find the 2D dense motion field between two frames.
no code implementations • 31 Jul 2023 • Ri Cheng, Xuhao Jiang, Ruian He, Shili Zhou, Weimin Tan, Bo Yan
We can use dynamic spatial pruning method to skip redundant computation, but this method cannot properly identify easy regions in VFI tasks without supervision.
1 code implementation • 3 Jul 2023 • Weimin Tan, Siyuan Chen, Bo Yan
Although various few-shot semantic segmentation (FSS) models with different network structures have been proposed, performance improvement has reached a bottleneck.
Ranked #5 on Few-Shot Semantic Segmentation on PASCAL-5i (1-Shot)
no code implementations • 19 Jun 2023 • Yuqi Sun, Ruian He, Weimin Tan, Bo Yan
Given a short speech video, we first build an efficient talking radiance field, and then apply the latest conditional diffusion model for image editing based on the given instructions and guiding implicit representation optimization towards the editing target.
1 code implementation • 24 May 2023 • Yu Ling, Weimin Tan, Bo Yan
Survival analysis aims at modeling the relationship between covariates and event occurrence with some untracked (censored) samples.
no code implementations • 4 May 2023 • Xuhao Jiang, Weimin Tan, Qing Lin, Chenxi Ma, Bo Yan, Liquan Shen
In recent years, many convolutional neural network-based models are designed for JPEG artifacts reduction, and have achieved notable progress.
no code implementations • 26 Apr 2023 • Xuhao Jiang, Weimin Tan, Tian Tan, Bo Yan, Liquan Shen
Image-based single-modality compression learning approaches have demonstrated exceptionally powerful encoding and decoding capabilities in the past few years , but suffer from blur and severe semantics loss at extremely low bitrates.
no code implementations • 18 Jul 2022 • Ri Cheng, Yuqi Sun, Bo Yan, Weimin Tan, Chenxi Ma
To address these problems, we propose the MVSRnet, which uses geometry information to extract sharp details from all LR multi-view to support the SR of the LR input view.
no code implementations • 15 Jul 2022 • Xuhao Jiang, Weimin Tan, Ri Cheng, Shili Zhou, Bo Yan
Under stereo settings, the performance of image JPEG artifacts removal can be further improved by exploiting the additional information provided by a second view.
no code implementations • 14 Jul 2022 • Chenxi Ma, Bo Yan, Qing Lin, Weimin Tan, Siming Chen
To enhance the semantic accuracy and the visual quality of the reconstructed image, we explore the multi-modal fusion learning in SISR by proposing a Text-Guided Super-Resolution (TGSR) framework, which can effectively utilize the information from the text and image modalities.
no code implementations • 14 Jul 2022 • Chenxi Ma, Bo Yan, Weimin Tan, Xuhao Jiang
Recent studies of deep learning based stereo image super-resolution (StereoSR) have promoted the development of StereoSR.
no code implementations • CVPR 2022 • Yuqi Sun, Shili Zhou, Ri Cheng, Weimin Tan, Bo Yan, Lang Fu
Specifically, GR stage takes sparse depth map and RGB as input to predict dense depth map by exploiting the correlation between two modals.
1 code implementation • 8 Jul 2021 • Ruian He, Zhen Xing, Weimin Tan, Bo Yan
Affective Analysis is not a single task, and the valence-arousal value, expression class, and action unit can be predicted at the same time.
1 code implementation • ICCV 2021 • Jichun Li, Weimin Tan, Bo Yan
Deep learning algorithms have made significant progress in dynamic scene deblurring.
no code implementations • 28 Sep 2019 • Bo Yan, Chuming Lin, Weimin Tan
For video super-resolution, current state-of-the-art approaches either process multiple low-resolution (LR) frames to produce each output high-resolution (HR) frame separately in a sliding window fashion or recurrently exploit the previously estimated HR frames to super-resolve the following frame.
1 code implementation • 2019 IEEE International Conference on Multimedia and Expo (ICME) 2019 • Junyi Wang, Weimin Tan, Xuejing Niu and Bo Yan
We also present a new RDGAN (Retinex Decomposition based Generative Adversarial Network) loss, which is computed on the decomposed reflectance components of the enhanced an the reference images.
1 code implementation • 9 May 2019 • Weimin Tan, Bo Yan, Chumin Lin, Xuejing Niu
Our idea is built on the reverse mapping from the retargeted images to the given input images.
no code implementations • CVPR 2018 • Weimin Tan, Bo Yan, Bahetiyaer Bare
In this paper, different from image super-resolution (ISR), we propose a novel super-resolution technique called feature super-resolution (FSR), which aims at enhancing the discriminatory power of small size image in order to provide high recognition precision for machine.
no code implementations • 26 Apr 2017 • Dawei Dai, Weimin Tan, Hong Zhan
Experiments for two types of ANN model including multi-layer MLP and CNN verify that the network flow based on class-pathway is a reasonable explanation for ANN models.