Search Results for author: Wenze Hu

Found 13 papers, 9 papers with code

Guiding Instruction-based Image Editing via Multimodal Large Language Models

2 code implementations • 29 Sep 2023 • Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan

Extensive experimental results demonstrate that expressive instructions are crucial to instruction-based image editing, and our MGIE can lead to a notable improvement in automatic metrics and human evaluation while maintaining competitive inference efficiency.

Image Manipulation Response Generation

3,776

Paper
Code

Universal Object Detection with Large Vision Model

1 code implementation • 19 Dec 2022 • Feng Lin, Wenze Hu, YaoWei Wang, Yonghong Tian, Guangming Lu, Fanglin Chen, Yong Xu, Xiaoyu Wang

In this study, our focus is on a specific challenge: the large-scale, multi-domain universal object detection problem, which contributes to the broader goal of achieving a universal vision system.

Object object-detection +1

Paper
Code

NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

1 code implementation • CVPR 2023 • Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang

In this paper, we propose a neural architecture representation model that can be used to estimate these attributes holistically.

Representation Learning

Paper
Code

Fcaformer: Forward Cross Attention in Hybrid Vision Transformer

2 code implementations • ICCV 2023 • Haokui Zhang, Wenze Hu, Xiaoyu Wang

Currently, one main research line in designing a more efficient vision transformer is reducing the computational cost of self attention modules by adopting sparse attention or using local attention windows.

Image Classification Knowledge Distillation

Paper
Code

ParCNetV2: Oversized Kernel with Enhanced Attention

1 code implementation • ICCV 2023 • Ruihan Xu, Haokui Zhang, Wenze Hu, Shiliang Zhang, Xiaoyu Wang

Specifically, we propose a new convolutional neural network, ParCNetV2, that extends position-aware circular convolution (ParCNet) with oversized convolutions and bifurcate gate units to enhance attention.

Paper
Code

Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs

no code implementations • 8 Oct 2022 • Tao Yang, Haokui Zhang, Wenze Hu, Changwen Chen, Xiaoyu Wang

Transformer models have made tremendous progress in various fields in recent years.

Image Classification Inductive Bias +1

Paper
Add Code

ALBench: A Framework for Evaluating Active Learning in Object Detection

1 code implementation • 27 Jul 2022 • Zhanpeng Feng, Shiliang Zhang, Rinyoichi Takezoe, Wenze Hu, Manmohan Chandraker, Li-Jia Li, Vijay K. Narayanan, Xiaoyu Wang

To facilitate the research in this field, this paper contributes an active learning benchmark framework named as ALBench for evaluating active learning in object detection.

Active Learning Image Classification +4

559

Paper
Code

Implementation of an Automated Learning System for Non-experts

1 code implementation • 26 Mar 2022 • Phoenix X. Huang, Zhiwei Zhao, Chao Liu, Jingyi Liu, Wenze Hu, Xiaoyu Wang

This paper detailed the engineering system implementation of an automated machine learning system called YMIR, which completely relies on graphical interface to interact with users.

BIG-bench Machine Learning Management

559

Paper
Code

ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer

3 code implementations • 8 Mar 2022 • Haokui Zhang, Wenze Hu, Xiaoyu Wang

Experiment results show that the proposed ParC-Net achieves better performance than popular light-weight ConvNets and vision transformer based models in common vision tasks and datasets, while having fewer parameters and faster inference speed.

Ranked #751 on Image Classification on ImageNet

Image Classification object-detection +3

341

Paper
Code

YMIR: A Rapid Data-centric Development Platform for Vision Applications

1 code implementation • 19 Nov 2021 • Phoenix X. Huang, Wenze Hu, William Brendel, Manmohan Chandraker, Li-Jia Li, Xiaoyu Wang

This paper introduces an open source platform to support the rapid development of computer vision applications at scale.

Active Learning

559

Paper
Code

Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search

no code implementations • 30 Jul 2021 • Haokui Zhang, Buzhou Tang, Wenze Hu, Xiaoyu Wang

Specifically, based on transformer, we propose a new network structure to compress the feature into a low dimensional space, and an inhomogeneous neighborhood relationship preserving (INRP) loss that aims to maintain high search accuracy.

Feature Compression Information Retrieval +2

Paper
Add Code

Learning Inhomogeneous FRAME Models for Object Patterns

no code implementations • CVPR 2014 • Jianwen Xie, Wenze Hu, Song-Chun Zhu, Ying Nian Wu

We investigate an inhomogeneous version of the FRAME (Filters, Random field, And Maximum Entropy) model and apply it to modeling object patterns.

Object

Paper
Add Code

Unsupervised Learning of Dictionaries of Hierarchical Compositional Models

no code implementations • CVPR 2014 • Jifeng Dai, Yi Hong, Wenze Hu, Song-Chun Zhu, Ying Nian Wu

Given a set of unannotated training images, a dictionary of such hierarchical templates are learned so that each training image can be represented by a small number of templates that are spatially translated, rotated and scaled versions of the templates in the learned dictionary.

Domain Adaptation Template Matching

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.