Search Results for author: Hongwei Yi

Found 23 papers, 14 papers with code

Generating Human Interaction Motions in Scenes with Text Control

no code implementations • 16 Apr 2024 • Hongwei Yi, Justus Thies, Michael J. Black, Xue Bin Peng, Davis Rempe

Our approach begins with pre-training a scene-agnostic text-to-motion diffusion model, emphasizing goal-reaching constraints on large-scale motion-capture datasets.

Denoising Human-Object Interaction Detection +1

Paper
Add Code

DECO: Dense Estimation of 3D Human-Scene Contact In The Wild

1 code implementation • ICCV 2023 • Shashank Tripathi, Agniv Chatterjee, Jean-Claude Passy, Hongwei Yi, Dimitrios Tzionas, Michael J. Black

In contrast, we focus on inferring dense, 3D contact between the full body surface and objects in arbitrary images.

Ranked #2 on Contact Detection on BEHAVE

Contact Detection Human-Object Interaction Detection

Paper
Code

POCO: 3D Pose and Shape Estimation with Confidence

1 code implementation • 24 Aug 2023 • Sai Kumar Dwivedi, Cordelia Schmid, Hongwei Yi, Michael J. Black, Dimitrios Tzionas

To address this, we develop POCO, a novel framework for training HPS regressors to estimate not only a 3D human body, but also their confidence, in a single feed-forward pass.

Action Recognition Pose Estimation +1

Paper
Code

TADA! Text to Animatable Digital Avatars

no code implementations • 21 Aug 2023 • Tingting Liao, Hongwei Yi, Yuliang Xiu, Jiaxaing Tang, Yangyi Huang, Justus Thies, Michael J. Black

We introduce TADA, a simple-yet-effective approach that takes textual descriptions and produces expressive 3D avatars with high-quality geometry and lifelike textures, that can be animated and rendered with traditional graphics pipelines.

Paper
Add Code

TeCH: Text-guided Reconstruction of Lifelike Clothed Humans

1 code implementation • 16 Aug 2023 • Yangyi Huang, Hongwei Yi, Yuliang Xiu, Tingting Liao, Jiaxiang Tang, Deng Cai, Justus Thies

But how to effectively capture all visual attributes of an individual from a single image, which are sufficient to reconstruct unseen areas (e. g., the back view)?

Descriptive Question Answering +1

347

Paper
Code

ProxyCap: Real-time Monocular Full-body Capture in World Space via Human-Centric Proxy-to-Motion Learning

no code implementations • 3 Jul 2023 • Yuxiang Zhang, Hongwen Zhang, Liangxiao Hu, Jiajun Zhang, Hongwei Yi, Shengping Zhang, Yebin Liu

For more accurate and physically plausible predictions in world space, our network is designed to learn human motions from a human-centric perspective, which enables the understanding of the same motion captured with different camera trajectories.

Ranked #220 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation

Paper
Add Code

GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction

1 code implementation • 29 Jun 2023 • Sihan Ma, Qiong Cao, Hongwei Yi, Jing Zhang, DaCheng Tao

Demystifying complex human-ground interactions is essential for accurate and realistic 3D human motion reconstruction from RGB videos, as it ensures consistency between the humans and the ground plane.

Paper
Code

High-Fidelity Clothed Avatar Reconstruction from a Single Image

1 code implementation • CVPR 2023 • Tingting Liao, Xiaomei Zhang, Yuliang Xiu, Hongwei Yi, Xudong Liu, Guo-Jun Qi, Yong Zhang, Xuan Wang, Xiangyu Zhu, Zhen Lei

This paper presents a framework for efficient 3D clothed avatar reconstruction.

Vocal Bursts Intensity Prediction

103

Paper
Code

SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments

1 code implementation • CVPR 2023 • Yudi Dai, Yitai Lin, Xiping Lin, Chenglu Wen, Lan Xu, Hongwei Yi, Siqi Shen, Yuexin Ma, Cheng Wang

We present SLOPER4D, a novel scene-aware dataset collected in large urban environments to facilitate the research of global human pose estimation (GHPE) with human-scene interaction in the wild.

3D Human Pose Estimation Camera Calibration +1

Paper
Code

Test-time Personalizable Forecasting of 3D Human Poses

no code implementations • ICCV 2023 • Qiongjie Cui, Huaijiang Sun, Jianfeng Lu, Weiqing Li, Bin Li, Hongwei Yi, Haofan Wang

Current motion forecasting approaches typically train a deep end-to-end model from the source domain data, and then apply it directly to target subjects.

Motion Forecasting

Paper
Add Code

Generating Holistic 3D Human Motion from Speech

2 code implementations • CVPR 2023 • Hongwei Yi, Hualin Liang, Yifei Liu, Qiong Cao, Yandong Wen, Timo Bolkart, DaCheng Tao, Michael J. Black

This work addresses the problem of generating 3D holistic body motions from human speech.

Ranked #2 on Gesture Generation on BEAT2

3D Face Animation Gesture Generation

259

Paper
Code

MIME: Human-Aware 3D Scene Generation

no code implementations • CVPR 2023 • Hongwei Yi, Chun-Hao P. Huang, Shashank Tripathi, Lea Hering, Justus Thies, Michael J. Black

We propose MIME (Mining Interaction and Movement to infer 3D Environments), which is a generative model of indoor scenes that produces furniture layouts that are consistent with the human movement.

Ranked #2 on Indoor Scene Synthesis on PRO-teXt

2D Semantic Segmentation task 1 (8 classes) 3D Semantic Scene Completion +2

Paper
Add Code

One-shot Implicit Animatable Avatars with Model-based Priors

no code implementations • ICCV 2023 • Yangyi Huang, Hongwei Yi, Weiyang Liu, Haofan Wang, Boxi Wu, Wenxiao Wang, Binbin Lin, Debing Zhang, Deng Cai

Most of these methods fail to achieve realistic reconstruction when only a single image is available.

Neural Rendering

Paper
Add Code

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields

no code implementations • 24 Sep 2022 • Jiankai Sun, Yan Xu, Mingyu Ding, Hongwei Yi, Chen Wang, Jingdong Wang, Liangjun Zhang, Mac Schwager

Using current NeRF training tools, a robot can train a NeRF environment model in real-time and, using our algorithm, identify 3D bounding boxes of objects of interest within the NeRF for downstream navigation or manipulation tasks.

Object Localization Robot Navigation

Paper
Add Code

Capturing and Inferring Dense Full-Body Human-Scene Contact

2 code implementations • CVPR 2022 • Chun-Hao P. Huang, Hongwei Yi, Markus Höschle, Matvey Safroshkin, Tsvetelina Alexiadis, Senya Polikovsky, Daniel Scharstein, Michael J. Black

We capture a new dataset called RICH for "Real scenes, Interaction, Contact and Humans."

Ranked #3 on Contact Detection on BEHAVE

4k Contact Detection +3

Paper
Code

Human-Aware Object Placement for Visual Environment Reconstruction

1 code implementation • CVPR 2022 • Hongwei Yi, Chun-Hao P. Huang, Dimitrios Tzionas, Muhammed Kocabas, Mohamed Hassan, Siyu Tang, Justus Thies, Michael J. Black

In fact, we demonstrate that these human-scene interactions (HSIs) can be leveraged to improve the 3D reconstruction of a scene from a monocular RGB video.

3D Reconstruction Object

Paper
Code

Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking

2 code implementations • ECCV 2020 • Jian-Feng Yan, Zizhuang Wei, Hongwei Yi, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai

In this paper, we propose an efficient and effective dense hybrid recurrent multi-view stereo net with dynamic consistency checking, namely $D^{2}$HC-RMVSNet, for accurate dense point cloud reconstruction.

Point cloud reconstruction

109

Paper
Code

M^3VSNet: Unsupervised Multi-metric Multi-view Stereo Network

1 code implementation • 30 Apr 2020 • Baichuan Huang, Hongwei Yi, Can Huang, Yijia He, Jingbin Liu, Xiao Liu

To improve the robustness and completeness of point cloud reconstruction, we propose a novel multi-metric loss function that combines pixel-wise and feature-wise loss function to learn the inherent constraints from different perspectives of matching correspondences.

Point cloud reconstruction

151

Paper
Code

M^3VSNet: Unsupervised Multi-metric Multi-view Stereo Network

1 code implementation • 21 Apr 2020 • Baichuan Huang, Hongwei Yi, Can Huang, Yijia He, Jingbin Liu, Xiao Liu

Point cloud reconstruction

151

Paper
Code

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

no code implementations • 13 Feb 2020 • Hongwei Yi, Shaoshuai Shi, Mingyu Ding, Jiankai Sun, Kui Xu, Hui Zhou, Zhe Wang, Sheng Li, Guoping Wang

First, the semantic context information in LiDAR is seldom explored in previous works, which may help identify ambiguous vehicles.

Autonomous Driving Semantic Segmentation

Paper
Add Code

Learning Depth-Guided Convolutions for Monocular 3D Object Detection

2 code implementations • CVPR 2020 • Mingyu Ding, Yuqi Huo, Hongwei Yi, Zhe Wang, Jianping Shi, Zhiwu Lu, Ping Luo

3D object detection from a single image without LiDAR is a challenging task due to the lack of accurate depth information.

Ranked #17 on Vehicle Pose Estimation on KITTI Cars Hard

Monocular 3D Object Detection Object +2

313

Paper
Code

Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation

1 code implementation • ECCV 2020 • Hongwei Yi, Zizhuang Wei, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai

n this paper, we propose an effective and efficient pyramid multi-view stereo (MVS) net with self-adaptive view aggregation for accurate and complete dense point cloud reconstruction.

3D Point Cloud Reconstruction Depth Estimation +1

Paper
Code

MMFace: A Multi-Metric Regression Network for Unconstrained Face Reconstruction

no code implementations • CVPR 2019 • Hongwei Yi, Chen Li, Qiong Cao, Xiaoyong Shen, Sheng Li, Guoping Wang, Yu-Wing Tai

We propose to address the face reconstruction in the wild by using a multi-metric regression network, MMFace, to align a 3D face morphable model (3DMM) to an input image.

Face Reconstruction regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.