Search Results for author: Chuang Zhang

Found 26 papers, 14 papers with code

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation

2 code implementations • 25 Apr 2024 • Haotian Yan, Ming Wu, Chuang Zhang

VWA leverages the local window attention (LWA) and disentangles LWA into the query window and context window, allowing the context's scale to vary for the query to learn representations at multiple scales.

Decoder Semantic Segmentation

110

Paper
Code

MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation

1 code implementation • 21 Mar 2024 • Longzheng Wang, Xiaohan Xu, Lei Zhang, Jiarui Lu, Yongxiu Xu, Hongbo Xu, Minghao Tang, Chuang Zhang

Automatic detection of multimodal misinformation has gained a widespread attention recently.

Data Augmentation Decision Making +5

Paper
Code

HIP Network: Historical Information Passing Network for Extrapolation Reasoning on Temporal Knowledge Graph

1 code implementation • 19 Feb 2024 • Yongquan He, Peng Zhang, Luchen Liu, Qi Liang, Wenyuan Zhang, Chuang Zhang

In recent years, temporal knowledge graph (TKG) reasoning has received significant attention.

Paper
Code

SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection

1 code implementation • 29 Jan 2024 • Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei Li, Yang shen

Our method surpasses all previous methods by a significant margin in new scenes, including +42. 57% for vehicle, +5. 87% for pedestrian, and +14. 89% for cyclist compared to BEVHeight on the DAIR-V2X-I heterologous benchmark.

3D Object Detection Autonomous Vehicles +1

Paper
Code

VN-Net: Vision-Numerical Fusion Graph Convolutional Network for Sparse Spatio-Temporal Meteorological Forecasting

no code implementations • 26 Jan 2024 • Yutong Xiong, Xun Zhu, Ming Wu, Weiqing Li, Fanbin Mo, Chuang Zhang, Bin Zhang

Sparse meteorological forecasting is indispensable for fine-grained weather forecasting and deserves extensive attention.

Decoder Weather Forecasting

Paper
Add Code

UAV Swarm-enabled Collaborative Secure Relay Communications with Time-domain Colluding Eavesdropper

no code implementations • 3 Oct 2023 • Chuang Zhang, Geng Sun, Qingqing Wu, Jiahui Li, Shuang Liang, Dusit Niyato, Victor C. M. Leung

Unmanned aerial vehicles (UAVs) as aerial relays are practically appealing for assisting Internet of Things (IoT) network.

Paper
Add Code

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations • 30 Sep 2023 • Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

Paper
Add Code

PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection

no code implementations • 26 Feb 2023 • Shenwei Xie, Wanfeng Zheng, Zhenglin Xian, Junli Yang, Chuang Zhang, Ming Wu

In this paper, we propose a new scheme for multi-task satellite imagery road extraction, Patch-wise Road Keypoints Detection (PaRK-Detect).

Keypoint Detection Road Segmentation +1

Paper
Add Code

Cross-modal Contrastive Learning for Multimodal Fake News Detection

1 code implementation • 25 Feb 2023 • Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Xiaohan Xu, Siqi Wang

An attention mechanism with an attention guidance module is implemented to help effectively and interpretably aggregate the aligned unimodal representations and the cross-modality correlations.

Contrastive Learning Fake News Detection +1

Paper
Code

SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar Engines

no code implementations • 19 Jan 2023 • Shizun Wang, Weihong Zeng, Xu Wang, Hao Yang, Li Chen, Yi Yuan, Yunzhao Zeng, Min Zheng, Chuang Zhang, Ming Wu

To this end, we propose SwiftAvatar, a novel avatar auto-creation framework that is evidently superior to previous works.

Paper
Add Code

Privileged Prior Information Distillation for Image Matting

no code implementations • 25 Nov 2022 • Cheng Lyu, Jiake Xie, Bo Xu, Cheng Lu, Han Huang, Xin Huang, Ming Wu, Chuang Zhang, Yong Tang

Performance of trimap-free image matting methods is limited when trying to decouple the deterministic and undetermined regions, especially in the scenes where foregrounds are semantically ambiguous, chromaless, or high transmittance.

Image Matting

Paper
Add Code

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

1 code implementation • 10 Jul 2022 • Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Chuang Zhang, Jun Li

Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4. 65% improvement on AP@0. 7 for car detection, reaching 18. 54% AP@0. 7, which ranks the 1st place among all monocular based methods on KITTI test leaderboard.

Autonomous Driving Model Optimization +2

Paper
Code

Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels

no code implementations • 27 Jun 2022 • Chuang Zhang, Li Shen, Jian Yang, Chen Gong

To exploit this effect, the model prediction-based methods have been widely adopted, which aim to exploit the outputs of DNNs in the early stage of learning to correct noisy labels.

Learning with noisy labels Memorization

Paper
Add Code

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

no code implementations • 9 Jun 2022 • Xiaojun Liu, Shunan Zang, Chuang Zhang, Xiaojun Chen, Yangyang Ding

In order to solve this problem, we paraphrase the reference summaries in CLTS, the Chinese Long Text Summarization dataset, correct errors of factual inconsistencies, and propose the first Chinese Long Text Summarization dataset with a high level of abstractiveness, CLTS+, which contains more than 180K article-summary pairs and is available online.

Text Summarization

Paper
Add Code

Multi-class Label Noise Learning via Loss Decomposition and Centroid Estimation

no code implementations • 21 Mar 2022 • Yongliang Ding, Tao Zhou, Chuang Zhang, Yijing Luo, Juan Tang, Chen Gong

Further, by defining a new form of data centroid, we transform the recovery problem of a label-dependent part to a centroid estimation problem.

Binary Classification

Paper
Add Code

Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention

2 code implementations • 5 Jan 2022 • Haotian Yan, Chuang Zhang, Ming Wu

In this paper, we succeed in introducing multi-scale representations into semantic segmentation ViT via window attention mechanism and further improves the performance and efficiency.

Ranked #14 on Semantic Segmentation on DADA-seg

Decoder Image Classification +2

766

Paper
Code

Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification

1 code implementation • 12 Dec 2021 • Lin Wan, Qianyan Jing, Zongyuan Sun, Chuang Zhang, Zhihang Li, Yehansen Chen

Much of that is due to the notorious modality bias training issue brought by the single-modality ImageNet pre-training, which might yield RGB-biased representations that severely hinder the cross-modality image retrieval.

Contrastive Learning Cross-Modality Person Re-identification +3

Paper
Code

SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution

1 code implementation • 30 Nov 2021 • Shizun Wang, Ming Lu, Kaixin Chen, Jiaming Liu, Xiaoqi Li, Chuang Zhang, Ming Wu

However, existing methods mostly train the DNNs on uniformly sampled LR-HR patch pairs, which makes them fail to fully exploit informative patches within the image.

Data Augmentation Image Super-Resolution

Paper
Code

Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation

1 code implementation • ICCV 2021 • Jiaming Liu, Ming Lu, Kaixin Chen, Xiaoqi Li, Shizun Wang, Zhaoqing Wang, Enhua Wu, Yurong Chen, Chuang Zhang, Ming Wu

Internet video delivery has undergone a tremendous explosion of growth over the past few years.

105

Paper
Code

ConTNet: Why not use convolution and transformer at the same time?

2 code implementations • 27 Apr 2021 • Haotian Yan, Zhe Li, Weijian Li, Changhu Wang, Ming Wu, Chuang Zhang

It is also worth pointing that, given identical strong data augmentations, the performance improvement of ConTNet is more remarkable than that of ResNet.

Image Classification object-detection +1

Paper
Code

Contextual Graph Reasoning Networks

no code implementations • 1 Jan 2021 • Zhaoqing Wang, Jiaming Liu, Yangyuxuan Kang, Mingming Gong, Chuang Zhang, Ming Lu, Ming Wu

Graph Reasoning has shown great potential recently in modeling long-range dependencies, which are crucial for various computer vision tasks.

2D Human Pose Estimation Instance Segmentation +4

Paper
Add Code

GINet: Graph Interaction Network for Scene Parsing

1 code implementation • ECCV 2020 • Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, Ming Wu, Zhanyu Ma, Guodong Guo

GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based semantic graph.

Scene Parsing

8,288

Paper
Code

FGSD: A Dataset for Fine-Grained Ship Detection in High Resolution Satellite Images

no code implementations • 15 Mar 2020 • Kaiyan Chen, Ming Wu, Jiaming Liu, Chuang Zhang

To further promote the research of ship detection, we introduced a new fine-grained ship detection datasets, which is named as FGSD.

Paper
Add Code

C-DLinkNet: considering multi-level semantic features for human parsing

1 code implementation • 31 Jan 2020 • Yu Lu, Muyan Feng, Ming Wu, Chuang Zhang

Human parsing is an essential branch of semantic segmentation, which is a fine-grained semantic segmentation task to identify the constituent parts of human.

Decoder Human Parsing +2