Search Results for author: Nanxuan Zhao

Found 22 papers, 7 papers with code

GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting

no code implementations • 30 Apr 2024 • Kai Zhang, Sai Bi, Hao Tan, Yuanbo Xiangli, Nanxuan Zhao, Kalyan Sunkavalli, Zexiang Xu

We propose GS-LRM, a scalable large reconstruction model that can predict high-quality 3D Gaussian primitives from 2-4 posed sparse images in 0. 23 seconds on single A100 GPU.

3D Generation

Paper
Add Code

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing

no code implementations • 8 Apr 2024 • Jing Gu, Yilin Wang, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang

Compared with existing methods for personalized subject swapping, SwapAnything has three unique advantages: (1) precise control of arbitrary objects and parts rather than the main subject, (2) more faithful preservation of context pixels, (3) better adaptation of the personalized concept to the image.

Image Generation Object

Paper
Add Code

Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

no code implementations • 1 Mar 2024 • Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W. H. Lau

Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis.

Image Generation

Paper
Add Code

AnaMoDiff: 2D Analogical Motion Diffusion via Disentangled Denoising

no code implementations • 5 Feb 2024 • Maham Tanveer, Yizhi Wang, Ruiqi Wang, Nanxuan Zhao, Ali Mahdavi-Amiri, Hao Zhang

We present AnaMoDiff, a novel diffusion-based method for 2D motion analogies that is applied to raw, unannotated videos of articulated characters.

Denoising Optical Flow Estimation

Paper
Add Code

Localizing and Editing Knowledge in Text-to-Image Generative Models

no code implementations • 20 Oct 2023 • Samyadeep Basu, Nanxuan Zhao, Vlad Morariu, Soheil Feizi, Varun Manjunatha

We adapt Causal Mediation Analysis for text-to-image models and trace knowledge about distinct visual attributes to various (causal) components in the (i) UNet and (ii) text-encoder of the diffusion model.

Attribute Image Generation +1

Paper
Add Code

Text-Guided Vector Graphics Customization

no code implementations • 21 Sep 2023 • Peiying Zhang, Nanxuan Zhao, Jing Liao

In this paper, we propose a novel pipeline that generates high-quality customized vector graphics based on textual prompts while preserving the properties and layer-wise information of a given exemplar SVG.

Vector Graphics

Paper
Add Code

NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos

1 code implementation • 23 Aug 2023 • Ziyu Yang, Sucheng Ren, Zongwei Wu, Nanxuan Zhao, Junle Wang, Jing Qin, Shengfeng He

Non-photorealistic videos are in demand with the wave of the metaverse, but lack of sufficient research studies.

Saliency Detection

Paper
Code

Language-based Photo Color Adjustment for Graphic Designs

no code implementations • 6 Aug 2023 • Zhenwei Wang, Nanxuan Zhao, Gerhard Hancke, Rynson W. H. Lau

We also introduce an approach for generating a synthetic graphic design dataset with instructions to enable model training.

Paper
Add Code

FashionTex: Controllable Virtual Try-on with Text and Texture

1 code implementation • 8 May 2023 • Anran Lin, Nanxuan Zhao, Shuliang Ning, Yuda Qiu, Baoyuan Wang, Xiaoguang Han

Virtual try-on attracts increasing research attention as a promising way for enhancing the user experience for online cloth shopping.

Virtual Try-on

Paper
Code

AssetField: Assets Mining and Reconfiguration in Ground Feature Plane Representation

no code implementations • ICCV 2023 • Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Bo Dai, Dahua Lin

Traditional modeling pipelines keep an asset library storing unique object templates, which is both versatile and memory efficient in practice.

Novel View Synthesis Object

Paper
Add Code

Grid-guided Neural Radiance Fields for Large Urban Scenes

no code implementations • CVPR 2023 • Linning Xu, Yuanbo Xiangli, Sida Peng, Xingang Pan, Nanxuan Zhao, Christian Theobalt, Bo Dai, Dahua Lin

An alternative solution is to use a feature grid representation, which is computationally efficient and can naturally scale to a large scene with increased grid resolutions.

Paper
Add Code

Neural Preset for Color Style Transfer

1 code implementation • CVPR 2023 • Zhanghan Ke, Yuhao Liu, Lei Zhu, Nanxuan Zhao, Rynson W. H. Lau

In this paper, we present a Neural Preset technique to address the limitations of existing color style transfer methods, including visual artifacts, vast memory requirement, and slow style switching speed.

4k Color Normalization +4

244

Paper
Code

Bring Clipart to Life

1 code implementation • ICCV 2023 • Nanxuan Zhao, Shengqi Dang, Hexun Lin, Yang Shi, Nan Cao

The development of face editing has been boosted since the birth of StyleGAN.

Paper
Code

UniColor: A Unified Framework for Multi-Modal Colorization with Transformer

no code implementations • 22 Sep 2022 • Zhitong Huang, Nanxuan Zhao, Jing Liao

In the first stage, multi-modal conditions are converted into a common representation of hint points.

Colorization

Paper
Add Code

TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition

1 code implementation • CVPR 2022 • Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin

To mitigate this problem, we developed TransRank, a unified framework for recognizing Transformations in a Ranking formulation.

Action Recognition Representation Learning +3

Paper
Code

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

no code implementations • 10 Dec 2021 • Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin

The wide span of viewing positions within these scenes yields multi-scale renderings with very different levels of detail, which poses great challenges to neural radiance field and biases it towards compromised results.

Paper
Add Code

Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation

no code implementations • ICCV 2021 • Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu

While the average prediction accuracy has been improved significantly over the years, the performance on hard poses with depth ambiguity, self-occlusion, and complex or rare poses is still far from satisfactory.

Ranked #23 on Skeleton Based Action Recognition on NTU RGB+D 120

3D Human Pose Estimation 3D Pose Estimation +3

Paper
Add Code

Unifying Global-Local Representations in Salient Object Detection with Transformer

1 code implementation • 5 Aug 2021 • Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

In this paper, we introduce a new attention-based encoder, vision transformer, into salient object detection to ensure the globalization of the representations from shallow to deep layers.

Decoder object-detection +2

Paper
Code

Delving Deep Into Many-to-Many Attention for Few-Shot Video Object Segmentation

1 code implementation • CVPR 2021 • Haoxin Chen, Hanjie Wu, Nanxuan Zhao, Sucheng Ren, Shengfeng He

The key is to model the relationship between the query videos and the support images for propagating the object information.

Meta-Learning Semantic Segmentation +2

Paper
Code

BlockPlanner: City Block Generation With Vectorized Graph Representation

no code implementations • ICCV 2021 • Linning Xu, Yuanbo Xiangli, Anyi Rao, Nanxuan Zhao, Bo Dai, Ziwei Liu, Dahua Lin

City modeling is the foundation for computational urban planning, navigation, and entertainment.

valid

Paper
Add Code

What makes instance discrimination good for transfer learning?

no code implementations • ICLR 2021 • Nanxuan Zhao, Zhirong Wu, Rynson W. H. Lau, Stephen Lin

Contrastive visual pretraining based on the instance discrimination pretext task has made significant progress.

object-detection Object Detection +1

Paper
Add Code

Distilling Localization for Self-Supervised Representation Learning

no code implementations • 14 Apr 2020 • Nanxuan Zhao, Zhirong Wu, Rynson W. H. Lau, Stephen Lin

To address this problem, we propose a data-driven approach for learning invariance to backgrounds.

Colorization Contrastive Learning +8

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.