Search Results for author: Yawen Huang

Found 35 papers, 22 papers with code

AnomalyXFusion: Multi-modal Anomaly Synthesis with Diffusion

1 code implementation • 30 Apr 2024 • Jie Hu, Yawen Huang, Yilin Lu, Guoyang Xie, Guannan Jiang, Yefeng Zheng, Zhichao Lu

The AnomalyXFusion framework comprises two distinct yet synergistic modules: the Multi-modal In-Fusion (MIF) module and the Dynamic Dif-Fusion (DDF) module.

Paper
Code

Learning Long-form Video Prior via Generative Pre-Training

1 code implementation • 24 Apr 2024 • Jinheng Xie, Jiajun Feng, Zhaoxu Tian, Kevin Qinghong Lin, Yawen Huang, Xi Xia, Nanxu Gong, Xu Zuo, Jiaqi Yang, Yefeng Zheng, Mike Zheng Shou

Instead of operating on pixel space, it is efficient to employ visual locations like bounding boxes and keypoints to represent key information in videos, which can be simply discretized and then tokenized for consumption by GPT.

Paper
Code

NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization

no code implementations • 7 Apr 2024 • Peng Tu, Xun Zhou, Mingming Wang, Xiaojun Yang, Bo Peng, Ping Chen, Xiu Su, Yawen Huang, Yefeng Zheng, Chang Xu

Neural Radiance Fields (NeRF) have emerged as a paradigm-shifting methodology for the photorealistic rendering of objects and environments, enabling the synthesis of novel viewpoints with remarkable fidelity.

Autonomous Vehicles Point Cloud Generation

Paper
Add Code

ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields

1 code implementation • 2 Feb 2024 • Xingyu Miao, Yang Bai, Haoran Duan, Fan Wan, Yawen Huang, Yang Long, Yefeng Zheng

Most of the existing works on arbitrary 3D NeRF style transfer required retraining on each single style condition.

Style Transfer

Paper
Code

CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video

no code implementations • 10 Jan 2024 • Xingyu Miao, Yang Bai, Haoran Duan, Yawen Huang, Fan Wan, Yang Long, Yefeng Zheng

The goal of our work is to generate high-quality novel views from monocular videos of complex and dynamic scenes.

Paper
Add Code

Federated Learning via Input-Output Collaborative Distillation

1 code implementation • 22 Dec 2023 • Xuan Gong, Shanglin Li, Yuxiang Bao, Barry Yao, Yawen Huang, Ziyan Wu, Baochang Zhang, Yefeng Zheng, David Doermann

Federated learning (FL) is a machine learning paradigm in which distributed local nodes collaboratively train a central model without sharing individually held private data.

Federated Learning Image Classification

Paper
Code

UniHead: Unifying Multi-Perception for Detection Heads

1 code implementation • 23 Sep 2023 • Hantao Zhou, Rui Yang, Yachao Zhang, Haoran Duan, Yawen Huang, Runze Hu, Xiu Li, Yefeng Zheng

More precisely, our approach (1) introduces deformation perception, enabling the model to adaptively sample object features; (2) proposes a Dual-axial Aggregation Transformer (DAT) to adeptly model long-range dependencies, thereby achieving global perception; and (3) devises a Cross-task Interaction Transformer (CIT) that facilitates interaction between the classification and localization branches, thus aligning the two tasks.

Paper
Code

Automatic view plane prescription for cardiac magnetic resonance imaging via supervision by spatial relationship between views

1 code implementation • 22 Sep 2023 • Dong Wei, Yawen Huang, Donghuan Lu, Yuexiang Li, Yefeng Zheng

Then, a multi-view planning strategy is proposed to aggregate information from the predicted heatmaps for all the source views of a target plane, for a globally optimal prescription, mimicking the similar strategy practiced by skilled human prescribers.

Anatomy

Paper
Code

DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost Volume

1 code implementation • 14 Aug 2023 • Xingyu Miao, Yang Bai, Haoran Duan, Yawen Huang, Fan Wan, Xinxing Xu, Yang Long, Yefeng Zheng

Nevertheless, the dynamic cost volume inevitably generates extra occlusions and noise, thus we alleviate this by designing a fusion module that makes static and dynamic cost volumes compensate for each other.

Ranked #3 on Unsupervised Monocular Depth Estimation on Cityscapes

Monocular Depth Estimation Optical Flow Estimation +1

Paper
Code

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

2 code implementations • ICCV 2023 • Jinheng Xie, Yuexiang Li, Yawen Huang, Haozhe Liu, Wentian Zhang, Yefeng Zheng, Mike Zheng Shou

As such paired data is time-consuming and labor-intensive to acquire and restricted to a closed set, this potentially becomes the bottleneck for applications in an open world.

Ranked #5 on Conditional Text-to-Image Synthesis on COCO-MIG

Conditional Text-to-Image Synthesis Denoising

207

Paper
Code

K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment

no code implementations • 10 Jul 2023 • Guoyang Xie, Jinbao Wang, Yawen Huang, Jiayi Lyu, Feng Zheng, Yefeng Zheng, Yaochu Jin

To further reflect the frequency-specific information from the magnetic resonance imaging principles, both k-space features and vision features are obtained and employed in our comprehensive encoders with a frequency reconstruction penalty.

Image Generation SSIM

Paper
Add Code

Dynamically Masked Discriminator for Generative Adversarial Networks

1 code implementation • 13 Jun 2023 • Wentian Zhang, Haozhe Liu, Bing Li, Jinheng Xie, Yawen Huang, Yuexiang Li, Yefeng Zheng, Bernard Ghanem

By treating the generated data in training as a stream, we propose to detect whether the discriminator slows down the learning of new knowledge in generated data.

Continual Learning

Paper
Code

Cross-Modal Vertical Federated Learning for MRI Reconstruction

no code implementations • 5 Jun 2023 • Yunlu Yan, Hong Wang, Yawen Huang, Nanjun He, Lei Zhu, Yuexiang Li, Yong Xu, Yefeng Zheng

To this end, we formulate this practical-yet-challenging cross-modal vertical federated learning task, in which shape data from multiple hospitals have different modalities with a small amount of multi-modality data collected from the same individuals.

Disentanglement MRI Reconstruction +1

Paper
Add Code

Improving GAN Training via Feature Space Shrinkage

1 code implementation • 2 Mar 2023 • Haozhe Liu, Wentian Zhang, Bing Li, Haoqian Wu, Nanjun He, Yawen Huang, Yuexiang Li, Bernard Ghanem, Yefeng Zheng

The evaluation results demonstrate that our AdaptiveMix can facilitate the training of GANs and effectively improve the image quality of generated samples.

Out of Distribution (OOD) Detection

Paper
Code

Interactive Segmentation as Gaussian Process Classification

1 code implementation • 28 Feb 2023 • Minghao Zhou, Hong Wang, Qian Zhao, Yuexiang Li, Yawen Huang, Deyu Meng, Yefeng Zheng

Against this issue, in this paper, we propose to formulate the IS task as a Gaussian process (GP)-based pixel-wise binary classification model on each image.

Binary Classification Classification +4

Paper
Code

FemtoDet: An Object Detection Baseline for Energy Versus Performance Tradeoffs

1 code implementation • ICCV 2023 • Peng Tu, Xu Xie, Guo Ai, Yuexiang Li, Yawen Huang, Yefeng Zheng

Efficient detectors for edge devices are often optimized for parameters or speed count metrics, which remain in weak correlation with the energy of detectors.

Ranked #30 on Object Detection on PASCAL VOC 2007

object-detection Object Detection

Paper
Code

A New Perspective to Boost Vision Transformer for Medical Image Classification

no code implementations • 3 Jan 2023 • Yuexiang Li, Yawen Huang, Nanjun He, Kai Ma, Yefeng Zheng

The experimental results validate the superiority of our BOLT for medical image classification, compared to ImageNet pretrained weights and state-of-the-art self-supervised learning approaches.

Diabetic Retinopathy Grading Image Classification +5

Paper
Add Code

AdaptiveMix: Improving GAN Training via Feature Space Shrinkage

1 code implementation • CVPR 2023 • Haozhe Liu, Wentian Zhang, Bing Li, Haoqian Wu, Nanjun He, Yawen Huang, Yuexiang Li, Bernard Ghanem, Yefeng Zheng

The evaluation results demonstrate that our AdaptiveMix can facilitate the training of GANs and effectively improve the image quality of generated samples.

Out of Distribution (OOD) Detection

Paper
Code

Interactive Segmentation As Gaussion Process Classification

1 code implementation • CVPR 2023 • Minghao Zhou, Hong Wang, Qian Zhao, Yuexiang Li, Yawen Huang, Deyu Meng, Yefeng Zheng

Against this issue, in this paper, we propose to formulate the IS task as a Gaussian process (GP)-based pixel-wise binary classification model on each image.

Binary Classification Classification +4

Paper
Code

SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation

no code implementations • CVPR 2023 • Huimin Huang, Shiao Xie, Lanfen Lin, Ruofeng Tong, Yen-Wei Chen, Yuexiang Li, Hong Wang, Yawen Huang, Yefeng Zheng

Semi-supervised learning improves data efficiency of deep models by leveraging unlabeled samples to alleviate the reliance on a large set of labeled samples.

Semantic Segmentation

Paper
Add Code

Orientation-Shared Convolution Representation for CT Metal Artifact Learning

1 code implementation • 26 Dec 2022 • Hong Wang, Qi Xie, Yuexiang Li, Yawen Huang, Deyu Meng, Yefeng Zheng

During X-ray computed tomography (CT) scanning, metallic implants carrying with patients often lead to adverse artifacts in the captured CT images and then impair the clinical treatment.

Computed Tomography (CT) Metal Artifact Reduction

Paper
Code

Knowing the Past to Predict the Future: Reinforcement Virtual Learning

no code implementations • 2 Nov 2022 • Peng Zhang, Yawen Huang, Bingzhang Hu, Shizheng Wang, Haoran Duan, Noura Al Moubayed, Yefeng Zheng, Yang Long

Reinforcement Learning (RL)-based control system has received considerable attention in recent decades.

Reinforcement Learning (RL)

Paper
Add Code

Decoupled Mixup for Generalized Visual Recognition

1 code implementation • 26 Oct 2022 • Haozhe Liu, Wentian Zhang, Jinheng Xie, Haoqian Wu, Bing Li, Ziqi Zhang, Yuexiang Li, Yawen Huang, Bernard Ghanem, Yefeng Zheng

Since the observation is that noise-prone regions such as textural and clutter backgrounds are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair.

Paper
Code

A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-Rays

1 code implementation • 5 Sep 2022 • Haoqin Ji, Haozhe Liu, Yuexiang Li, Jinheng Xie, Nanjun He, Yawen Huang, Dong Wei, Xinrong Chen, Linlin Shen, Yefeng Zheng

Such a point annotation setting can provide weakly instance-level information for abnormality localization with a marginal annotation cost.

Paper
Code

Combating Mode Collapse in GANs via Manifold Entropy Estimation

1 code implementation • 25 Aug 2022 • Haozhe Liu, Bing Li, Haoqian Wu, Hanbang Liang, Yawen Huang, Yuexiang Li, Bernard Ghanem, Yefeng Zheng

In this paper, we propose a novel training pipeline to address the mode collapse issue of GANs.

Paper
Code

mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor Segmentation

1 code implementation • 6 Jun 2022 • Yao Zhang, Nanjun He, Jiawei Yang, Yuexiang Li, Dong Wei, Yawen Huang, Yang Zhang, Zhiqiang He, Yefeng Zheng

Concretely, we propose a novel multimodal Medical Transformer (mmFormer) for incomplete multimodal learning with three main components: the hybrid modality-specific encoders that bridge a convolutional encoder and an intra-modal Transformer for both local and global context modeling within each modality; an inter-modal Transformer to build and align the long-range correlations across modalities for modality-invariant features with global semantics corresponding to tumor region; a decoder that performs a progressive up-sampling and fusion with the modality-invariant features to generate robust segmentation.

Ranked #65 on Semantic Segmentation on NYU Depth v2

Brain Tumor Segmentation Decoder +2

Paper
Code

Cross-Modality Neuroimage Synthesis: A Survey

no code implementations • 14 Feb 2022 • Guoyang Xie, Yawen Huang, Jinbao Wang, Jiayi Lyu, Feng Zheng, Yefeng Zheng, Yaochu Jin

This is followed by a stepwise in-depth analysis to evaluate how cross-modality neuroimage synthesis improves the performance of its downstream tasks.

Image Generation Weakly-supervised Learning

Paper
Add Code

FedMed-ATL: Misaligned Unpaired Brain Image Synthesis via Affine Transform Loss

1 code implementation • 29 Jan 2022 • Jinbao Wang, Guoyang Xie, Yawen Huang, Yefeng Zheng, Yaochu Jin, Feng Zheng

The proposed method demonstrates the advanced performance in both the quality of our synthesized results under a severely misaligned and unpaired data setting, and better stability than other GAN-based algorithms.

Data Augmentation Image Generation +1

Paper
Code

FedMed-GAN: Federated Domain Translation on Unsupervised Cross-Modality Brain Image Synthesis

1 code implementation • 22 Jan 2022 • Jinbao Wang, Guoyang Xie, Yawen Huang, Jiayi Lyu, Yefeng Zheng, Feng Zheng, Yaochu Jin

There is a clear need to launch a federated learning and facilitate the integration of the dispersed data from different institutions.

Federated Learning Image Generation +1

Paper
Code

GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference

no code implementations • 28 Dec 2021 • Peng Tu, Yawen Huang, Feng Zheng, Zhenyu He, Liujun Cao, Ling Shao

In this paper, we propose a novel method for semi-supervised semantic segmentation named GuidedMix-Net, by leveraging labeled information to guide the learning of unlabeled instances.

Segmentation Semi-Supervised Semantic Segmentation

Paper
Add Code

GuidedMix-Net: Learning to Improve Pseudo Masks Using Labeled Images as Reference

1 code implementation • 29 Jun 2021 • Peng Tu, Yawen Huang, Rongrong Ji, Feng Zheng, Ling Shao

To take advantage of the labeled examples and guide unlabeled data learning, we further propose a mask generation module to generate high-quality pseudo masks for the unlabeled data.

Ranked #1 on Semi-Supervised Semantic Segmentation on PASCAL VOC 2012 500 labels

Semi-Supervised Semantic Segmentation

Paper
Code

Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net

no code implementations • CVPR 2021 • Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao

Recent advances in neuroscience have highlighted the effectiveness of multi-modal medical data for investigating certain pathologies and understanding human cognition.

Image Generation

Paper
Add Code

Brain Image Synthesis with Unsupervised Multivariate Canonical CSC$\ell_4$Net

no code implementations • 22 Mar 2021 • Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao

Recent advances in neuroscience have highlighted the effectiveness of multi-modal medical data for investigating certain pathologies and understanding human cognition.

Image Generation

Paper
Add Code

DOTE: Dual cOnvolutional filTer lEarning for Super-Resolution and Cross-Modality Synthesis in MRI

no code implementations • 15 Jun 2017 • Yawen Huang, Ling Shao, Alejandro F. Frangi

Cross-modal image synthesis is a topical problem in medical image computing.

Image Generation Image Super-Resolution

Paper
Add Code

Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images using Weakly-Supervised Joint Convolutional Sparse Coding

no code implementations • CVPR 2017 • Yawen Huang, Ling Shao, Alejandro F. Frangi

We propose the weakly-supervised joint convolutional sparse coding to simultaneously solve the problems of super-resolution (SR) and cross-modality image synthesis.

Dictionary Learning Image Generation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.