Search Results for author: Mingzhen Huang

Found 8 papers, 6 papers with code

Exposing Text-Image Inconsistency Using Diffusion Models

1 code implementation • 28 Apr 2024 • Mingzhen Huang, Shan Jia, Zhou Zhou, Yan Ju, Jialing Cai, Siwei Lyu

In the battle against widespread online misinformation, a growing problem is text-image inconsistency, where images are misleadingly paired with texts with different intent or meaning.

Misinformation

Paper
Code

Language-guided Human Motion Synthesis with Atomic Actions

1 code implementation • 18 Aug 2023 • Yuanhao Zhai, Mingzhen Huang, Tianyu Luan, Lu Dong, Ifeoma Nwogu, Siwei Lyu, David Doermann, Junsong Yuan

In this paper, we propose ATOM (ATomic mOtion Modeling) to mitigate this problem, by decomposing actions into atomic actions, and employing a curriculum learning strategy to learn atomic action composition.

Motion Synthesis

Paper
Code

AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics

1 code implementation • 14 Apr 2023 • Shan Jia, Mingzhen Huang, Zhou Zhou, Yan Ju, Jialing Cai, Siwei Lyu

To achieve this, we propose a new approach that leverages the DALL-E2 language-image model to automatically generate and splice masked regions guided by a text prompt.

Image and Video Forgery Detection Image Generation

Paper
Code

Tracking Multiple Deformable Objects in Egocentric Videos

no code implementations • CVPR 2023 • Mingzhen Huang, Xiaoxing Li, Jun Hu, Honghong Peng, Siwei Lyu

DETracker outperforms existing state-of-the-art method on the DogThruGlasses dataset and YouTube-Hand dataset.

Disentanglement Motion Disentanglement +2

Paper
Add Code

Whose Hands Are These? Hand Detection and Hand-Body Association in the Wild

1 code implementation • CVPR 2022 • Supreeth Narasimhaswamy, Thanh Nguyen, Mingzhen Huang, Minh Hoai

We also introduce a new challenging dataset called BodyHands containing unconstrained images with hand and their corresponding body locations annotations.

Hand Detection

Paper
Code

Forward Propagation, Backward Regression, and Pose Association for Hand Tracking in the Wild

1 code implementation • CVPR 2022 • Mingzhen Huang, Supreeth Narasimhaswamy, Saif Vazir, Haibin Ling, Minh Hoai

The first stage is Forward Propagation, where the features from frame t-1 are propagated to frame t based on previously detected hands and their estimated motion.

Ranked #1 on Multiple Object Tracking on YouTube-Hands (using extra training data)

Multiple Object Tracking regression

Paper
Code

Variational Feature Disentangling for Fine-Grained Few-Shot Classification

no code implementations • ICCV 2021 • Jingyi Xu, Hieu Le, Mingzhen Huang, ShahRukh Athar, Dimitris Samaras

We assume that the distribution of intra-class variance generalizes across the base class and the novel class.

Ranked #15 on Few-Shot Image Classification on CUB 200 5-way 5-shot

Classification Few-Shot Image Classification +2

Paper
Add Code

LaSOT: A High-quality Large-scale Single Object Tracking Benchmark

1 code implementation • 8 Sep 2020 • Heng Fan, Hexin Bai, Liting Lin, Fan Yang, Peng Chu, Ge Deng, Sijia Yu, Harshit, Mingzhen Huang, Juehuan Liu, Yong Xu, Chunyuan Liao, Lin Yuan, Haibin Ling

The average video length of LaSOT is around 2, 500 frames, where each video contains various challenge factors that exist in real world video footage, such as the targets disappearing and re-appearing.

Object Tracking Visual Tracking +1

103

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.