no code implementations • 23 Apr 2024 • Yu-Jie Liang, ZiHan Cao, Liang-Jian Deng, Xiao Wu
Besides, a new decoder employing a complex Gabor wavelet activation function, called Spatial-Frequency Interactive Decoder (SFID), is invented to enhance the interaction of INR features.
no code implementations • 17 Apr 2024 • ZiHan Cao, Xiao Wu, Liang-Jian Deng
In this paper, we identify shortcomings in directly applying DPMs to the task of pansharpening as an inverse problem: 1) initiating sampling directly from Gaussian noise neglects the low-resolution multispectral image (LRMS) as a prior; 2) low sampling efficiency often necessitates a higher number of sampling steps.
no code implementations • 17 Apr 2024 • Yu Zhong, Xiao Wu, Liang-Jian Deng, ZiHan Cao
Pansharpening is a significant image fusion technique that merges the spatial content and spectral characteristics of remote sensing images to generate high-resolution multispectral images.
no code implementations • 14 Apr 2024 • ZiHan Cao, Xiao Wu, Liang-Jian Deng, Yu Zhong
However, due to the nature of images different from casual language sequences, the limited state capacity of Mamba weakens its ability to model image information.
2 code implementations • 11 Apr 2024 • Yule Duan, Xiao Wu, Haoyu Deng, Liang-Jian Deng
In this paper, we introduce a so-called content-adaptive non-local convolution (CANConv), a novel method tailored for remote sensing image pansharpening.
no code implementations • 1 Apr 2024 • Wen-Jie Shu, Hong-Xia Dou, Rui Wen, Xiao Wu, Liang-Jian Deng
In response, we present the Cross Modulation Transformer (CMT), a pioneering method that modifies the attention mechanism.
no code implementations • 8 Mar 2024 • Xiang Huang, Zhi-Qi Cheng, Jun-Yan He, Chenyang Li, Wangmeng Xiang, Baigui Sun, Xiao Wu
The advancement of autonomous driving systems hinges on the ability to achieve low-latency and high-accuracy perception.
no code implementations • 22 Feb 2024 • Miaoxin Wang, Xiao Wu, Jun Lin, Zhongfeng Wang
Particularly, it demonstrates efficient support for large-kernel CNNs, achieving throughputs of 169. 68 GOPS and 244. 55 GOPS for RepLKNet-31 and PyConvResNet-50, respectively, both of which are implemented on hardware for the first time.
no code implementations • TMC 2024 • Penglin Dai, Yaorong Huang, Kaiwen Hu, Xiao Wu, Huanlai Xing, and Zhaofei Yu, Member, IEEE
The objective is to design a unified solution to minimize task execution time under different MTO scenarios.
no code implementations • 28 Oct 2023 • Hao Wang, Zhi-Qi Cheng, Jingdong Sun, Xin Yang, Xiao Wu, Hongyang Chen, Yan Yang
Multi-view or even multi-modal data is appealing yet challenging for real-world applications.
no code implementations • 16 Aug 2023 • Ji Zhang, Xiao Wu, Zhi-Qi Cheng, Qi He, Wei Li
Anomaly segmentation plays a pivotal role in identifying atypical objects in images, crucial for hazard detection in autonomous driving systems.
no code implementations • 10 Apr 2023 • ZiHan Cao, ShiQi Cao, Xiao Wu, JunMing Hou, Ran Ran, Liang-Jian Deng
Denosing diffusion model, as a generative model, has received a lot of attention in the field of image generation recently, thanks to its powerful generation capability.
no code implementations • 3 Mar 2023 • Xiaoqing Zhang, Zunjie Xiao, Xiao Wu, Yanlin Chen, Jilu Zhao, Yan Hu, Jiang Liu
Spatial attention mechanism has been widely incorporated into deep neural networks (DNNs), significantly lifting the performance in computer vision tasks via long-range dependency modeling.
1 code implementation • 13 Dec 2022 • Siran Peng, Chenhao Guo, Xiao Wu, Liang-Jian Deng
The U2Net utilizes a spatial U-Net and a spectral U-Net to extract spatial details and spectral characteristics, which allows for the discriminative and hierarchical learning of features from diverse images.
Hyperspectral Image Super-Resolution Image Super-Resolution +1
1 code implementation • CVPR 2022 • Zhi-Qi Cheng, Qi Dai, Hong Li, Jingkuan Song, Xiao Wu, Alexander G. Hauptmann
We evaluate our methods on 4 mainstream object counting networks (i. e., MCNN, CSRNet, SANet, and ResNet-50).
Ranked #1 on Object Counting on TRANCOS
no code implementations • 30 Mar 2022 • Cheng Jin, Rui-Jie Zhu, Xiao Wu, Liang-Jian Deng
Spiking Neural Networks (SNNs) have piqued researchers' interest because of their capacity to process temporal information and low power consumption.
1 code implementation • 6 May 2021 • Kun Lei, Peng Guo, Yi Wang, Xiao Wu, Wenchao Zhao
In this paper, an end-to-end deep reinforcement learning framework is proposed to solve this type of combinatorial optimization problems.
no code implementations • ICCV 2021 • Xiao Wu, Ting-Zhu Huang, Liang-Jian Deng, Tian-Jing Zhang
In order to enhance the relationships of inter-branches, dynamic cross feature transfers are embedded into multiple branches to obtain high-resolution representations.
no code implementations • 17 Sep 2019 • Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Jun-Yan He, Alexander Hauptmann
By minimizing the mutual information, each column is guided to learn features with different image scales.
no code implementations • ICCV 2019 • Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander Hauptmann
Although the Maximum Excess over SubArrays (MESA) loss has been previously proposed to address the above issues by finding the rectangular subregion whose predicted density map has the maximum difference from the ground truth, it cannot be solved by gradient descent, thus can hardly be integrated into the deep learning framework.
Ranked #5 on Crowd Counting on WorldExpo’10
no code implementations • 24 Jun 2019 • Zhaoquan Yuan, Siyuan Sun, Lixin Duan, Xiao Wu, Changsheng Xu
In AMN, as inspired by generative adversarial networks, we propose to learn multimodal feature representations by finding a more coherent subspace for video clips and the corresponding texts (e. g., subtitles and questions).
1 code implementation • 17 May 2019 • Xiao Wu, Yi Xu, Bradley P. Carlin
In developing products for rare diseases, statistical challenges arise due to the limited number of patients available for participation in drug trials and other clinical research.
Applications Computation Methodology
1 code implementation • 17 Dec 2018 • Xiao Wu, Fabrizia Mealli, Marianthi-Anna Kioumourtzoglou, Francesca Dominici, Danielle Braun
We apply our proposed method to estimate the average causal exposure-response function between long-term PM$_{2. 5}$ exposure and all-cause mortality among 68. 5 million Medicare enrollees, 2000-2016.
Methodology Applications
no code implementations • 29 Nov 2018 • Siyu Huang, Zhi-Qi Cheng, Xi Li, Xiao Wu, Zhongfei Zhang, Alexander Hauptmann
To tackle this challenge, we present a novel pipeline comprised of an Observer Engine and a Physicist Engine by respectively imitating the actions of an observer and a physicist in the real world.
2 code implementations • CVPR 2017 • Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua
For the video side, deep visual features are extracted from detected object regions in each frame, and further fed into a Long Short-Term Memory (LSTM) framework for sequence modeling, which captures the temporal dynamics in videos.
no code implementations • 14 Apr 2018 • Zhi-Qi Cheng, Hao Zhang, Xiao Wu, Chong-Wah Ngo
A principle way of hyperlinking can be carried out by picking centers of clusters as anchors and from there reach out to targets within or outside of clusters with consideration of neighborhood complexity.
1 code implementation • 2 Dec 2017 • Xiao Wu, Danielle Braun, Marianthi-Anna Kioumourtzoglou, Christine Choirat, Qian Di, Francesca Dominici
We propose a new approach for estimating causal effects when the exposure is measured with error and confounding adjustment is performed via a generalized propensity score (GPS).
Methodology Applications
no code implementations • CVPR 2017 • Bo Zhao, Jiashi Feng, Xiao Wu, Shuicheng Yan
We introduce a new fashion search protocol where attribute manipulation is allowed within the interaction between users and search engines, e. g. manipulating the color attribute of the clothing from red to blue.
no code implementations • 17 Apr 2017 • Bo Zhao, Xiao Wu, Zhi-Qi Cheng, Hao liu, Zequn Jie, Jiashi Feng
This paper addresses a challenging problem -- how to generate multi-view cloth images from only a single view input.
no code implementations • 28 Jun 2016 • Bo Zhao, Xiao Wu, Jiashi Feng, Qiang Peng, Shuicheng Yan
Fine-grained object classification is a challenging task due to the subtle inter-class difference and large intra-class variation.