Search Results for author: Shuo Chen

Found 52 papers, 26 papers with code

Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models

no code implementations • 2 Jun 2024 • Wenqiang Sun, Zhengyi Wang, Shuo Chen, Yikai Wang, Zilong Chen, Jun Zhu, Jun Zhang

We first analyze the role of triplanes in feed-forward methods and find that the inconsistent multi-view images introduce high-frequency artifacts on triplanes, leading to low-quality 3D meshes.

Paper
Add Code

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

no code implementations • 2 Jun 2024 • Xingrui Wang, Wufei Ma, Angtian Wang, Shuo Chen, Adam Kortylewski, Alan Yuille

To demonstrate the importance of an explicit 4D dynamics representation of the scenes in understanding world dynamics, we further propose NS-4Dynamics, a Neural-Symbolic model for reasoning on 4D Dynamics properties under explicit scene representation from videos.

Paper
Add Code

A Systematic Bias of Machine Learning Regression Models and Its Correction: an Application to Imaging-based Brain Age Prediction

no code implementations • 24 May 2024 • Hwiyoung Lee, Shuo Chen

In comparison to competing machine learning models, our method effectively addresses the longstanding issue of "systematic bias of machine learning regression" in neuroimaging-based brain age calculation, yielding unbiased predictions of brain age.

regression

Paper
Add Code

Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion

1 code implementation • 18 May 2024 • Zeyu Zhang, Yiran Wang, Biao Wu, Shuo Chen, Zhiyuan Zhang, Shiya Huang, Wenbo Zhang, Meng Fang, Ling Chen, Yang Zhao

Firstly, we proposed a novel agent-based approach named Motion Avatar, which allows for the automatic generation of high-quality customizable human and animal avatars with motions through text queries.

Paper
Code

Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories

no code implementations • 22 Apr 2024 • Ning Yang, Shuo Chen, Haijun Zhang, Randall Berry

Mobile Edge Computing (MEC) broadens the scope of computation and storage beyond the central network, incorporating edge nodes close to end devices.

Edge-computing Reinforcement Learning (RL) +1

Paper
Add Code

Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

no code implementations • 4 Apr 2024 • Shuo Chen, Zhen Han, Bailan He, Zifeng Ding, Wenqian Yu, Philip Torr, Volker Tresp, Jindong Gu

Various jailbreak attacks have been proposed to red-team Large Language Models (LLMs) and revealed the vulnerable safeguards of LLMs.

Paper
Add Code

3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching

no code implementations • 1 Apr 2024 • Yibin Ye, Xichao Teng, Shuo Chen, Yijie Bian, Tao Tan, Zhang Li

Optical-SAR image matching is a fundamental task for image fusion and visual navigation.

Domain Adaptation Visual Navigation

Paper
Add Code

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

no code implementations • 8 Mar 2024 • Zhengyi Wang, Yikai Wang, Yifei Chen, Chendong Xiang, Shuo Chen, Dajiang Yu, Chongxuan Li, Hang Su, Jun Zhu

In this work, we present the Convolutional Reconstruction Model (CRM), a high-fidelity feed-forward single image-to-3D generative model.

Image to 3D

Paper
Add Code

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models

1 code implementation • 5 Mar 2024 • Zheng Li, Xiang Li, Xinyi Fu, Xin Zhang, Weiqiang Wang, Shuo Chen, Jian Yang

To our best knowledge, we are the first to (1) perform unsupervised domain-specific prompt-driven knowledge distillation for CLIP, and (2) establish a practical pre-storing mechanism of text features as shared class vectors between teacher and student.

Ranked #1 on Prompt Engineering on Oxford-IIIT Pet Dataset

Knowledge Distillation Prompt Engineering +1

128

Paper
Code

Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images

no code implementations • 22 Feb 2024 • Zefeng Wang, Zhen Han, Shuo Chen, Fan Xue, Zifeng Ding, Xun Xiao, Volker Tresp, Philip Torr, Jindong Gu

Our research evaluates the adversarial robustness of MLLMs when employing CoT reasoning, finding that CoT marginally improves adversarial robustness against existing attack methods.

Adversarial Robustness

Paper
Add Code

Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy

1 code implementation • 21 Jan 2024 • Shuo Chen, Mao Peng, Yijin Li, Bing-Feng Ju, Hujun Bao, Yuan-Liu Chen, Guofeng Zhang

However, conventional AFM scanning struggles to reconstruct complex 3D micro-/nanostructures precisely due to limitations such as incomplete sample topography capturing and tip-sample convolution artifacts.

3D Reconstruction Surface Reconstruction

Paper
Code

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

1 code implementation • 19 Jan 2024 • Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu

Within CivRealm, we provide interfaces for two typical agent types: tensor-based agents that focus on learning, and language-based agents that emphasize reasoning.

Decision Making

Paper
Code

Direct Distillation between Different Domains

no code implementations • 12 Jan 2024 • Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama

Then, we build a fusion-activation mechanism to transfer the valuable domain-invariant knowledge to the student network, while simultaneously encouraging the adapter within the teacher network to learn the domain-specific knowledge of the target data.

Domain Adaptation Knowledge Distillation

Paper
Add Code

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

no code implementations • 9 Jan 2024 • Weimin WANG, Jiawei Liu, Zhijie Lin, Jiangqiao Yan, Shuo Chen, Chetwin Low, Tuyen Hoang, Jie Wu, Jun Hao Liew, Hanshu Yan, Daquan Zhou, Jiashi Feng

The growing demand for high-fidelity video generation from textual descriptions has catalyzed significant research in this field.

MORPH Video Generation

Paper
Add Code

Understanding and Improving In-Context Learning on Vision-language Models

no code implementations • 29 Nov 2023 • Shuo Chen, Zhen Han, Bailan He, Mark Buckley, Philip Torr, Volker Tresp, Jindong Gu

Our findings indicate that ICL in VLMs is predominantly driven by the textual information in the demonstrations whereas the visual information in the demonstrations barely affects the ICL performance.

In-Context Learning

Paper
Add Code

Criticality-Guided Efficient Pruning in Spiking Neural Networks Inspired by Critical Brain Hypothesis

no code implementations • 5 Nov 2023 • Shuo Chen, Boxiao Liu, Haihang You

Firstly, we propose a low-cost metric for the criticality in SNNs.

Network Pruning

Paper
Add Code

Atom-Motif Contrastive Transformer for Molecular Property Prediction

no code implementations • 11 Oct 2023 • Wentao Yu, Shuo Chen, Chen Gong, Gang Niu, Masashi Sugiyama

As motifs in a molecule are significant patterns that are of great importance for determining molecular properties (e. g., toxicity and solubility), overlooking motif interactions inevitably hinders the effectiveness of MPP.

Molecular Property Prediction Property Prediction

Paper
Add Code

Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation

no code implementations • 2 Oct 2023 • Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, Gao Huang

This study utilizes the intricate Avalon game as a testbed to explore LLMs' potential in deceptive environments.

Misinformation

Paper
Add Code

Creative Birds: Self-Supervised Single-View 3D Style Transfer

2 code implementations • ICCV 2023 • Renke Wang, Guimin Que, Shuo Chen, Xiang Li, Jun Li, Jian Yang

Our focus lies primarily on birds, a popular subject in 3D reconstruction, for which no existing single-view 3D transfer methods have been developed. The method we propose seeks to generate a 3D mesh shape and texture of a bird from two single-view images.

3D Reconstruction Style Transfer

Paper
Code

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

1 code implementation • 24 Jul 2023 • Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr

This paper aims to provide a comprehensive survey of cutting-edge research in prompt engineering on three types of vision-language models: multimodal-to-text generation models (e. g. Flamingo), image-text matching models (e. g.

Image-text matching Language Modelling +4

283

Paper
Code

Distribution Shift Matters for Knowledge Distillation with Webly Collected Images

no code implementations • ICCV 2023 • Jialiang Tang, Shuo Chen, Gang Niu, Masashi Sugiyama, Chen Gong

Knowledge distillation aims to learn a lightweight student network from a pre-trained teacher network.

Contrastive Learning Data-free Knowledge Distillation +2

Paper
Add Code

Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach

no code implementations • 3 Jul 2023 • Shuo Chen, Ning Yang, Meng Zhang, Jun Wang

In this paper, we consider multiple users offloading tasks to heterogeneous edge servers in a MEC system.

Edge-computing

Paper
Add Code

Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation

1 code implementation • 16 Jun 2023 • Shuo Chen, Yingjun Du, Pascal Mettes, Cees G. M. Snoek

This paper investigates the problem of scene graph generation in videos with the aim of capturing semantic relations between subjects and objects in the form of $\langle$subject, predicate, object$\rangle$ triplets.

Graph Generation Meta-Learning +1

Paper
Code

On Underdamped Nesterov's Acceleration

no code implementations • 28 Apr 2023 • Shuo Chen, Bin Shi, Ya-xiang Yuan

In this paper, based on the high-resolution differential equation framework, we construct the new Lyapunov functions for the underdamped case, which is motivated by the power of the time $t^{\gamma}$ or the iteration $k^{\gamma}$ in the mixed term.

Paper
Add Code

BlinkFlow: A Dataset to Push the Limits of Event-based Optical Flow Estimation

no code implementations • 14 Mar 2023 • Yijin Li, Zhaoyang Huang, Shuo Chen, Xiaoyu Shi, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

BlinkSim consists of a configurable rendering engine and a flexible engine for event data simulation.

Event-based Optical Flow Optical Flow Estimation

Paper
Add Code

Revisiting the acceleration phenomenon via high-resolution differential equations

no code implementations • 12 Dec 2022 • Shuo Chen, Bin Shi, Ya-xiang Yuan

Furthermore, we also investigate NAG from the implicit-velocity scheme.

Vocal Bursts Intensity Prediction

Paper
Add Code

Decision-making with Speculative Opponent Models

no code implementations • 22 Nov 2022 • Jing Sun, Shuo Chen, Cong Zhang, Yining Ma, Jie Zhang

To address this issue, we introduce Distributional Opponent-aided Multi-agent Actor-Critic (DOMAC), the first speculative opponent modelling algorithm that relies solely on local information (i. e., the controlled agent's observations, actions, and rewards).

Decision Making SMAC+ +1

Paper
Add Code

R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition

2 code implementations • 20 Nov 2022 • Shuo Chen, Tan Yu, Ping Li

Recently, vision architectures based exclusively on multi-layer perceptrons (MLPs) have gained much attention in the computer vision community.

Ranked #1 on 3D Object Recognition on ModelNet40

3D Object Recognition Image Classification +1

Paper
Code

Introduction and Exemplars of Uncertainty Decomposition

no code implementations • 17 Nov 2022 • Shuo Chen

Uncertainty plays a crucial role in the machine learning field.

Autonomous Driving Ensemble Learning +2

Paper
Add Code

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

1 code implementation • ICCV 2023 • Weicai Ye, Shuo Chen, Chong Bao, Hujun Bao, Marc Pollefeys, Zhaopeng Cui, Guofeng Zhang

Existing inverse rendering combined with neural rendering methods can only perform editable novel view synthesis on object-specific scenes, while we present intrinsic neural radiance fields, dubbed IntrinsicNeRF, which introduce intrinsic decomposition into the NeRF-based neural rendering method and can extend its application to room-scale scenes.

Clustering Inverse Rendering +2

178

Paper
Code

Gradient Norm Minimization of Nesterov Acceleration: $o(1/k^3)$

no code implementations • 19 Sep 2022 • Shuo Chen, Bin Shi, Ya-xiang Yuan

In the history of first-order algorithms, Nesterov's accelerated gradient descent (NAG) is one of the milestones.

Open-Ended Question Answering

Paper
Add Code

Segmentation-guided Domain Adaptation and Data Harmonization of Multi-device Retinal Optical Coherence Tomography using Cycle-Consistent Generative Adversarial Networks

no code implementations • 31 Aug 2022 • Shuo Chen, Da Ma, Sieun Lee, Timothy T. L. Yu, Gavin Xu, Donghuan Lu, Karteek Popuri, Myeong Jin Ju, Marinko V. Sarunic, Mirza Faisal Beg

Optical Coherence Tomography(OCT) is a non-invasive technique capturing cross-sectional area of the retina in micro-meter resolutions.

Domain Adaptation Hallucination +1

Paper
Add Code

Higher-order accurate two-sample network inference and network hashing

1 code implementation • 16 Aug 2022 • Meijia Shao, Dong Xia, Yuan Zhang, Qiong Wu, Shuo Chen

Two-sample hypothesis testing for network comparison presents many significant challenges, including: leveraging repeated network observations and known node registration, but without requiring them to operate; relaxing strong structural assumptions; achieving finite-sample higher-order accuracy; handling different network sizes and sparsity levels; fast computation and memory parsimony; controlling false discovery rate (FDR) in multiple testing; and theoretical understandings, particularly regarding finite-sample accuracy and minimax optimality.

Vocal Bursts Valence Prediction

Paper
Code

ForecastTKGQuestions: A Benchmark for Temporal Question Answering and Forecasting over Temporal Knowledge Graphs

1 code implementation • 12 Aug 2022 • Zifeng Ding, Zongyue Li, Ruoxia Qi, Jingpei Wu, Bailan He, Yunpu Ma, Zhao Meng, Shuo Chen, Ruotong Liao, Zhen Han, Volker Tresp

To this end, we propose ForecastTKGQA, a TKGQA model that employs a TKG forecasting module for future inference, to answer all three types of questions.

Knowledge Graphs Question Answering

Paper
Code

PVO: Panoptic Visual Odometry

1 code implementation • CVPR 2023 • Weicai Ye, Xinyue Lan, Shuo Chen, Yuhang Ming, Xingyuan Yu, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

We present PVO, a novel panoptic visual odometry framework to achieve more comprehensive modeling of the scene motion, geometry, and panoptic segmentation information.

Optical Flow Estimation Pose Estimation +3

198

Paper
Code

Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation

1 code implementation • CVPR 2022 • Jinchao Yang, Fei Guo, Shuo Chen, Jun Li, Jian Yang

Given a source product, a target product, and an art style image, our method produces a neural warping field that warps the source shape to imitate the geometric style of the target and a neural texture transformation network that transfers the artistic style to the warped source product.

Style Transfer

Paper
Code

MVT: Multi-view Vision Transformer for 3D Object Recognition

2 code implementations • 25 Oct 2021 • Shuo Chen, Tan Yu, Ping Li

Nevertheless, multi-view CNN models cannot model the communications between patches from different views, limiting its effectiveness in 3D object recognition.

Ranked #2 on 3D Object Recognition on ModelNet40

3D Object Recognition Inductive Bias +1

Paper
Code

Diagnosing Errors in Video Relation Detectors

1 code implementation • 25 Oct 2021 • Shuo Chen, Pascal Mettes, Cees G. M. Snoek

Video relation detection forms a new and challenging problem in computer vision, where subjects and objects need to be localized spatio-temporally and a predicate label needs to be assigned if and only if there is an interaction between the two.

Action Localization Object +3

Paper
Code

Spectrum-to-Kernel Translation for Accurate Blind Image Super-Resolution

no code implementations • NeurIPS 2021 • Guangpin Tao, Xiaozhong Ji, Wenzhuo Wang, Shuo Chen, Chuming Lin, Yun Cao, Tong Lu, Donghao Luo, Ying Tai

In this paper, we propose a novel blind SR framework to super-resolve LR images degraded by arbitrary blur kernel with accurate kernel estimation in frequency domain.

Image Super-Resolution Translation

Paper
Add Code

Can We Leverage Predictive Uncertainty to Detect Dataset Shift and Adversarial Examples in Android Malware Detection?

1 code implementation • 20 Sep 2021 • Deqiang Li, Tian Qiu, Shuo Chen, Qianmu Li, Shouhuai Xu

Our main findings are: (i) predictive uncertainty indeed helps achieve reliable malware detection in the presence of dataset shift, but cannot cope with adversarial evasion attacks; (ii) approximate Bayesian methods are promising to calibrate and generalize malware detectors to deal with dataset shift, but cannot cope with adversarial evasion attacks; (iii) adversarial evasion attacks can render calibration methods useless, and it is an open problem to quantify the uncertainty associated with the predicted labels of adversarial examples (i. e., it is not effective to use predictive uncertainty to detect adversarial examples).

Android Malware Detection Malware Detection

Paper
Code

Social Fabric: Tubelet Compositions for Video Relation Detection

1 code implementation • ICCV 2021 • Shuo Chen, Zenglin Shi, Pascal Mettes, Cees G. M. Snoek

We also propose Social Fabric: an encoding that represents a pair of object tubelets as a composition of interaction primitives.

Ranked #1 on Video Visual Relation Detection on VidOR

Object Relation +2

Paper
Code

Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model

1 code implementation • NeurIPS 2021 • Jiangning Zhang, Chao Xu, Jian Li, Wenzhou Chen, Yabiao Wang, Ying Tai, Shuo Chen, Chengjie Wang, Feiyue Huang, Yong liu

Inspired by biological evolution, we explain the rationality of Vision Transformer by analogy with the proven practical Evolutionary Algorithm (EA) and derive that both of them have consistent mathematical representation.

Image Retrieval Retrieval

Paper
Code

Contrastive Embedding for Generalized Zero-Shot Learning

3 code implementations • CVPR 2021 • Zongyan Han, ZhenYong Fu, Shuo Chen, Jian Yang

To tackle this issue, we propose to integrate the generation model with the embedding model, yielding a hybrid GZSL framework.

Generalized Zero-Shot Learning

Paper
Code

DR-TANet: Dynamic Receptive Temporal Attention Network for Street Scene Change Detection

1 code implementation • 1 Mar 2021 • Shuo Chen, Kailun Yang, Rainer Stiefelhagen

Street scene change detection continues to capture researchers' interests in the computer vision community.

Autonomous Vehicles Change Detection +3

Paper
Code

Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection

7 code implementations • NeurIPS 2020 • Xiang Li, Wenhai Wang, Lijun Wu, Shuo Chen, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang

Specifically, we merge the quality estimation into the class prediction vector to form a joint representation of localization quality and classification, and use a vector to represent arbitrary distribution of box locations.

Ranked #104 on Object Detection on COCO test-dev

Dense Object Detection General Classification

28,222

Paper
Code

Neural Architecture Search for Compressed Sensing Magnetic Resonance Image Reconstruction

1 code implementation • 22 Feb 2020 • Jiangpeng Yan, Shuo Chen, Yongbing Zhang, Xiu Li

Our proposed method can reach a better trade-off between computation cost and reconstruction performance for MR reconstruction problem with good generalizability and offer insights to design neural networks for other medical image applications.

Image Reconstruction Neural Architecture Search +1

Paper
Code

Curvilinear Distance Metric Learning

1 code implementation • NeurIPS 2019 • Shuo Chen, Lei Luo, Jian Yang, Chen Gong, Jun Li, Heng Huang

To address this issue, we first reveal that the traditional linear distance metric is equivalent to the cumulative arc length between the data pair's nearest points on the learned straight measurer lines.

Metric Learning

Paper
Code

GANSynth: Adversarial Neural Audio Synthesis

6 code implementations • ICLR 2019 • Jesse Engel, Kumar Krishna Agrawal, Shuo Chen, Ishaan Gulrajani, Chris Donahue, Adam Roberts

Efficient audio synthesis is an inherently difficult machine learning task, as human perception is sensitive to both global structure and fine-scale waveform coherence.

Audio Generation Audio Synthesis

18,971

Paper
Code

Formal Specification and Verification of Smart Contracts for Azure Blockchain

1 code implementation • 20 Dec 2018 • Shuvendu K. Lahiri, Shuo Chen, Yuepeng Wang, Isil Dillig

In this paper, we describe the formal verification of Smart Contracts offered as part of the Azure Blockchain Content and Samples on github.

Programming Languages F.3.1

244

Paper
Code

Adversarial Metric Learning

no code implementations • 9 Feb 2018 • Shuo Chen, Chen Gong, Jian Yang, Xiang Li, Yang Wei, Jun Li

In distinguishment stage, a metric is exhaustively learned to try its best to distinguish both the adversarial pairs and the original training pairs.

Metric Learning

Paper
Add Code

Understanding the Disharmony between Dropout and Batch Normalization by Variance Shift

4 code implementations • CVPR 2019 • Xiang Li, Shuo Chen, Xiaolin Hu, Jian Yang

Theoretically, we find that Dropout would shift the variance of a specific neural unit when we transfer the state of that network from train to test.

Paper
Code

Deep Multi-Species Embedding

no code implementations • 28 Sep 2016 • Di Chen, Yexiang Xue, Shuo Chen, Daniel Fink, Carla Gomes

Additionally, we demonstrate the benefit of using a deep neural network to extract features within the embedding and show how they improve the predictive performance of species distribution modelling.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.