Search Results for author: Jun Xiao

Found 92 papers, 36 papers with code

ECNU_ICA at SemEval-2022 Task 10: A Simple and Unified Model for Monolingual and Crosslingual Structured Sentiment Analysis

no code implementations • SemEval (NAACL) 2022 • Qi Zhang, Jie zhou, Qin Chen, Qingchun Bai, Jun Xiao, Liang He

The task aims to extract the structured sentiment information (e. g., holder, target, expression and sentiment polarity) in a text.

Event Extraction Sentiment Analysis

Paper
Add Code

De-Biased Court's View Generation with Causality

no code implementations • EMNLP 2020 • Yiquan Wu, Kun Kuang, Yating Zhang, Xiaozhong Liu, Changlong Sun, Jun Xiao, Yueting Zhuang, Luo Si, Fei Wu

Court{'}s view generation is a novel but essential task for legal AI, aiming at improving the interpretability of judgment prediction results and enabling automatic legal document generation.

counterfactual Text Generation

Paper
Add Code

Neural Interaction Energy for Multi-Agent Trajectory Prediction

no code implementations • 25 Apr 2024 • Kaixin Shen, Ruijie Quan, Linchao Zhu, Jun Xiao, Yi Yang

In this study, we introduce a framework called Multi-Agent Trajectory prediction via neural interaction Energy (MATE).

Trajectory Prediction

Paper
Add Code

AudioScenic: Audio-Driven Video Scene Editing

no code implementations • 25 Apr 2024 • Kaixin Shen, Ruijie Quan, Linchao Zhu, Jun Xiao, Yi Yang

AudioScenic exploits the inherent properties of audio, namely, audio magnitude and frequency, to guide the editing process, aiming to control the temporal dynamics and enhance the temporal consistency.

Paper
Add Code

Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer

no code implementations • 24 Apr 2024 • JiaMing Lei, Lin Li, Chunping Wang, Jun Xiao, Long Chen

This complex task usually involves three steps: verb recognition, semantic role grounding, and noun recognition.

Grounded Situation Recognition Scene Understanding

Paper
Add Code

Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration

no code implementations • 21 Mar 2024 • Zhihao Wang, Yulin Zhou, Ningyu Zhang, Xiaosong Yang, Jun Xiao, Zhao Wang

We believe our work could provide a novel perspective to consider the uncertainty quality for the general motion prediction task and encourage the studies in this field.

Decoder Human motion prediction +1

Paper
Add Code

Distributionally Generative Augmentation for Fair Facial Attribute Classification

1 code implementation • 11 Mar 2024 • Fengda Zhang, Qianpei He, Kun Kuang, Jiashuo Liu, Long Chen, Chao Wu, Jun Xiao, Hanwang Zhang

This work proposes a novel, generation-based two-stage framework to train a fair FAC model on biased data without additional annotation.

Attribute Classification +2

Paper
Code

Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models

1 code implementation • 23 Feb 2024 • Shunyu Liu, Jie zhou, Qunxi Zhu, Qin Chen, Qingchun Bai, Jun Xiao, Liang He

Aspect-Based Sentiment Analysis (ABSA) stands as a crucial task in predicting the sentiment polarity associated with identified aspects within text.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation

no code implementations • 23 Jan 2024 • Kexin Li, Tao Jiang, Zongxin Yang, Yi Yang, Yueting Zhuang, Jun Xiao

Interactive Video Object Segmentation (iVOS) is a challenging task that requires real-time human-computer interaction.

Interactive Video Object Segmentation Semantic Segmentation +1

Paper
Add Code

DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism

no code implementations • 25 Nov 2023 • Zhen Wang, Xinyun Jiang, Jun Xiao, Tao Chen, Long Chen

The denoising process involves the explicit predictions of edit operations and corresponding content words, refining reference captions through iterative step-wise editing.

Caption Generation Denoising +1

Paper
Add Code

Compositional Zero-shot Learning via Progressive Language-based Observations

no code implementations • 23 Nov 2023 • Lin Li, Guikun Chen, Jun Xiao, Long Chen

Compositional zero-shot learning aims to recognize unseen state-object compositions by leveraging known primitives (state and object) during training.

Compositional Zero-Shot Learning

Paper
Add Code

Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation

no code implementations • ICCV 2023 • Yuxi Wang, Jian Liang, Jun Xiao, Shuqi Mei, Yuran Yang, Zhaoxiang Zhang

One-shot domain adaptation methods attempt to overcome these challenges by transferring the pre-trained source model to the target domain using only one target data.

Domain Adaptation Semantic Segmentation +1

Paper
Add Code

CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation

1 code implementation • 18 Sep 2023 • Kexin Li, Zongxin Yang, Lei Chen, Yi Yang, Jun Xiao

However, existing methods exhibit two limitations: 1) they address video temporal features and audio-visual interactive features separately, disregarding the inherent spatial-temporal dependence of combined audio and video, and 2) they inadequately introduce audio constraints and object-level information during the decoding stage, resulting in segmentation outcomes that fail to comply with audio directives.

Video Segmentation Video Semantic Segmentation

Paper
Code

Compositional Feature Augmentation for Unbiased Scene Graph Generation

1 code implementation • ICCV 2023 • Lin Li, Guikun Chen, Jun Xiao, Yi Yang, Chunping Wang, Long Chen

Specifically, we first decompose each relation triplet feature into two components: intrinsic feature and extrinsic feature, which correspond to the intrinsic characteristics and extrinsic contexts of a relation triplet, respectively.

Graph Generation Relation +1

Paper
Code

Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation

no code implementations • 30 Jul 2023 • Wenqing Wang, Kaifeng Gao, Yawei Luo, Tao Jiang, Fei Gao, Jian Shao, Jianwen Sun, Jun Xiao

Video-based scene graph generation (VidSGG) is an approach that aims to represent video content in a dynamic graph by identifying visual entities and their relationships.

Graph Generation Missing Labels +2

Paper
Add Code

Improved Neural Radiance Fields Using Pseudo-depth and Fusion

no code implementations • 27 Jul 2023 • Jingliang Li, Qiang Zhou, Chaohui Yu, Zhengda Lu, Jun Xiao, Zhibin Wang, Fan Wang

To make the constructed volumes as close as possible to the surfaces of objects in the scene and the rendered depth more accurate, we propose to perform depth prediction and radiance field reconstruction simultaneously.

Depth Estimation Depth Prediction +1

Paper
Add Code

Improving Reference-based Distinctive Image Captioning with Contrastive Rewards

no code implementations • 25 Jun 2023 • Yangjun Mao, Jun Xiao, Dong Zhang, Meng Cao, Jian Shao, Yueting Zhuang, Long Chen

A recent DIC method proposes to generate distinctive captions by comparing the target image with a set of semantic-similar reference images, i. e., reference-based DIC (Ref-DIC).

Benchmarking Contrastive Learning +1

Paper
Add Code

Counterfactual Co-occurring Learning for Bias Mitigation in Weakly-supervised Object Localization

no code implementations • 24 May 2023 • Feifei Shao, Yawei Luo, Lei Chen, Ping Liu, Wei Yang, Yi Yang, Jun Xiao

In this paper, we conduct a thorough causal analysis to investigate the origins of biased activation.

Attribute counterfactual +1

Paper
Add Code

Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models

1 code implementation • NeurIPS 2023 • Lin Li, Jun Xiao, Guikun Chen, Jian Shao, Yueting Zhuang, Long Chen

To dynamically fuse different cues, we further introduce a chain-of-thought method that prompts LLMs to generate reasonable weights for different visual cues.

Relation

Paper
Code

TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding

no code implementations • 19 May 2023 • Chenchi Zhang, Jun Xiao, Lei Chen, Jian Shao, Long Chen

In this paper, we argue that their poor interpretability is attributed to the holistic prompt generation and inference process.

Sentence Visual Grounding

Paper
Add Code

Generalized Universal Domain Adaptation with Generative Flow Networks

no code implementations • 8 May 2023 • Didi Zhu, Yinchuan Li, Yunfeng Shao, Jianye Hao, Fei Wu, Kun Kuang, Jun Xiao, Chao Wu

We introduce a new problem in unsupervised domain adaptation, termed as Generalized Universal Domain Adaptation (GUDA), which aims to achieve precise prediction of all target labels including unknown categories.

Universal Domain Adaptation Unsupervised Domain Adaptation

Paper
Add Code

Taking A Closer Look at Visual Relation: Unbiased Video Scene Graph Generation with Decoupled Label Learning

1 code implementation • 23 Mar 2023 • Wenqing Wang, Yawei Luo, Zhiqing Chen, Tao Jiang, Lei Chen, Yi Yang, Jun Xiao

Specifically, DLL decouples the predicate labels and adopts separate classifiers to learn actional and spatial patterns respectively.

Ranked #1 on Video scene graph generation on ImageNet-VidVRD

Graph Generation Scene Graph Generation +1

Paper
Code

Decomposed Prototype Learning for Few-Shot Scene Graph Generation

no code implementations • 20 Mar 2023 • Xingchen Li, Long Chen, Guikun Chen, Yinfu Feng, Yi Yang, Jun Xiao

To this end, we propose a novel Decomposed Prototype Learning (DPL).

Few-Shot Learning Graph Generation +1

Paper
Add Code

Learning Combinatorial Prompts for Universal Controllable Image Captioning

no code implementations • 11 Mar 2023 • Zhen Wang, Jun Xiao, Yueting Zhuang, Fei Gao, Jian Shao, Long Chen

To this end, we propose a novel prompt-based framework for CIC by learning Combinatorial Prompts, dubbed as ComPro.

controllable image captioning Language Modelling +1

Paper
Add Code

Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection

1 code implementation • 1 Feb 2023 • Kaifeng Gao, Long Chen, Hanwang Zhang, Jun Xiao, Qianru Sun

Without bells and whistles, our RePro achieves a new state-of-the-art performance on two VidVRD benchmarks of not only the base training object and predicate categories, but also the unseen ones.

Object Relation +1

Paper
Code

Knowledge-guided Causal Intervention for Weakly-supervised Object Localization

1 code implementation • 3 Jan 2023 • Feifei Shao, Yawei Luo, Fei Gao, Yi Yang, Jun Xiao

Previous weakly-supervised object localization (WSOL) methods aim to expand activation map discriminative areas to cover the whole objects, yet neglect two inherent challenges when relying solely on image-level labels.

Knowledge Distillation Object +1

Paper
Code

SSF: Accelerating Training of Spiking Neural Networks with Stabilized Spiking Flow

no code implementations • ICCV 2023 • Jingtao Wang, Zengjie Song, Yuxi Wang, Jun Xiao, Yuran Yang, Shuqi Mei, Zhaoxiang Zhang

Surrogate gradient (SG) is one of the most effective approaches for training spiking neural networks (SNNs).

Paper
Add Code

VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation

1 code implementation • CVPR 2023 • Bingchen Yang, Haiyong Jiang, Hao Pan, Jun Xiao

To overcome these issues, we propose to fully utilize the regular elements in vector floorplans for more integral segmentation.

Graph Attention Segmentation +2

Paper
Code

Bit-Shrinking: Limiting Instantaneous Sharpness for Improving Post-Training Quantization

no code implementations • CVPR 2023 • Chen Lin, Bo Peng, Zheyang Li, Wenming Tan, Ye Ren, Jun Xiao, ShiLiang Pu

To this end, we detach a sharpness term from the loss which reflects the impact of quantization noise.

Quantization

Paper
Add Code

SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization

1 code implementation • NIPS 2022 • Zheng Chuanyang, Zheyang Li, Kai Zhang, Zhi Yang, Wenming Tan, Jun Xiao, Ye Ren, ShiLiang Pu

In this paper, we introduce joint importance, which integrates essential structural-aware interactions between components for the first time, to perform collaborative pruning.

object-detection Object Detection

Paper
Code

DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis

no code implementations • 13 Aug 2022 • Jingliang Li, Zhengda Lu, Yiqun Wang, Ying Wang, Jun Xiao

To mine the information in probability volume, we creatively synthesize the source depths by splattering the probability volume and depth hypotheses to source views.

Paper
Add Code

Label Semantic Knowledge Distillation for Unbiased Scene Graph Generation

no code implementations • 7 Aug 2022 • Lin Li, Long Chen, Hanrong Shi, Wenxiao Wang, Jian Shao, Yi Yang, Jun Xiao

To this end, we propose a novel model-agnostic Label Semantic Knowledge Distillation (LS-KD) for unbiased SGG.

Graph Generation Knowledge Distillation +3

Paper
Add Code

Deep Progressive Feature Aggregation Network for High Dynamic Range Imaging

no code implementations • 4 Aug 2022 • Jun Xiao, Qian Ye, Tianshan Liu, Cong Zhang, Kin-Man Lam

The primary challenges are ghosting artifacts caused by object motion between low dynamic range images and distorted content in under and overexposed regions.

Vocal Bursts Intensity Prediction

Paper
Add Code

Online Video Super-Resolution with Convolutional Kernel Bypass Graft

no code implementations • 4 Aug 2022 • Jun Xiao, Xinyang Jiang, Ningxin Zheng, Huan Yang, Yifan Yang, Yuqing Yang, Dongsheng Li, Kin-Man Lam

Then, our proposed CKBG method enhances this lightweight base model by bypassing the original network with ``kernel grafts'', which are extra convolutional kernels containing the prior knowledge of external pretrained image SR models.

Transfer Learning Video Super-Resolution

Paper
Add Code

Rethinking the Evaluation of Unbiased Scene Graph Generation

no code implementations • 3 Aug 2022 • Xingchen Li, Long Chen, Jian Shao, Shaoning Xiao, Songyang Zhang, Jun Xiao

Current Scene Graph Generation (SGG) methods tend to predict frequent predicate categories and fail to recognize rare ones due to the severe imbalanced distribution of predicates.

Graph Generation Unbiased Scene Graph Generation

Paper
Add Code

Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph Generation

1 code implementation • 3 Aug 2022 • Xingchen Li, Long Chen, Wenbo Ma, Yi Yang, Jun Xiao

However, we argue that most existing WSSGG works only focus on object-consistency, which means the grounded regions should have the same object category label as text entities.

Graph Generation Object +1

Paper
Code

Unified Normalization for Accelerating and Stabilizing Transformers

1 code implementation • 2 Aug 2022 • Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, ShiLiang Pu

To tackle these issues, we propose Unified Normalization (UN), which can speed up the inference by being fused with other linear operations and achieve comparable performance on par with LN.

Paper
Code

NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation

no code implementations • 27 Jul 2022 • Lin Li, Long Chen, Hanrong Shi, Hanwang Zhang, Yi Yang, Wei Liu, Jun Xiao

To this end, we propose a novel NoIsy label CorrEction and Sample Training strategy for SGG: NICEST.

Graph Generation Knowledge Distillation +1

Paper
Add Code

Rethinking the Reference-based Distinctive Image Captioning

1 code implementation • 22 Jul 2022 • Yangjun Mao, Long Chen, Zhihong Jiang, Dong Zhang, Zhimeng Zhang, Jian Shao, Jun Xiao

Unfortunately, reference images used by existing Ref-DIC works are easy to distinguish: these reference images only resemble the target image at scene-level and have few common objects, such that a Ref-DIC model can trivially generate distinctive captions even without considering the reference images.

Attribute Benchmarking +1

Paper
Code

Explicit Image Caption Editing

1 code implementation • 20 Jul 2022 • Zhen Wang, Long Chen, Wenbo Ma, Guangxing Han, Yulei Niu, Jian Shao, Jun Xiao

Given an image and a reference caption, the image caption editing task aims to correct the misalignment errors and generate a refined caption.

Sentence

Paper
Code

Rethinking Data Augmentation for Robust Visual Question Answering

1 code implementation • 18 Jul 2022 • Long Chen, Yuhang Zheng, Jun Xiao

Unfortunately, to guarantee augmented samples have reasonable ground-truth answers, they manually design a set of heuristic rules for several question types, which extremely limits its generalization abilities.

Data Augmentation Knowledge Distillation +2

Paper
Code

Learning Regularized Multi-Scale Feature Flow for High Dynamic Range Imaging

no code implementations • 6 Jul 2022 • Qian Ye, Masanori Suganuma, Jun Xiao, Takayuki Okatani

Reconstructing ghosting-free high dynamic range (HDR) images of dynamic scenes from a set of multi-exposure images is a challenging task, especially with large object motion and occlusions, leading to visible artifacts using existing methods.

Vocal Bursts Intensity Prediction

Paper
Add Code

The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation

1 code implementation • CVPR 2022 • Lin Li, Long Chen, Yifeng Huang, Zhimeng Zhang, Songyang Zhang, Jun Xiao

Then, in Pos-NSD, we use a clustering-based algorithm to divide all positive samples into multiple sets, and treat the samples in the noisiest set as noisy positive samples.

Graph Generation Out-of-Distribution Detection +2

Paper
Code

A Knowledge-Enhanced Adversarial Model for Cross-lingual Structured Sentiment Analysis

no code implementations • 31 May 2022 • Qi Zhang, Jie zhou, Qin Chen, Qingchun Bai, Jun Xiao, Liang He

Notably, we propose a Knowledge-Enhanced Adversarial Model (\texttt{KEAM}) with both implicit distributed and explicit structural knowledge to enhance the cross-lingual transfer.

Cross-Lingual Transfer Sentiment Analysis

Paper
Add Code

Rethinking Multi-Modal Alignment in Video Question Answering from Feature and Sample Perspectives

no code implementations • 25 Apr 2022 • Shaoning Xiao, Long Chen, Kaifeng Gao, Zhao Wang, Yi Yang, Zhimeng Zhang, Jun Xiao

From the view of feature, we break down the video into trajectories and first leverage trajectory feature in VideoQA to enhance the alignment between two modalities.

Question Answering Video Question Answering

Paper
Add Code

Bidirectional Self-Training with Multiple Anisotropic Prototypes for Domain Adaptive Semantic Segmentation

1 code implementation • 16 Apr 2022 • Yulei Lu, Yawei Luo, Li Zhang, Zheyang Li, Yi Yang, Jun Xiao

A thriving trend for domain adaptive segmentation endeavors to generate the high-quality pseudo labels for target domain and retrain the segmentor on them.

Ranked #12 on Unsupervised Domain Adaptation on GTAV-to-Cityscapes Labels

Pseudo Label Semantic Segmentation +2

Paper
Code

DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts

no code implementations • 22 Mar 2022 • Yidi Li, Yiqun Wang, Zhengda Lu, Jun Xiao

Limited by the computational efficiency and accuracy, generating complex 3D scenes remains a challenging problem for existing generation networks.

Computational Efficiency

Paper
Add Code

Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning

no code implementations • 25 Feb 2022 • Feifei Shao, Yawei Luo, Ping Liu, Jie Chen, Yi Yang, Yulei Lu, Jun Xiao

To deploy SSDR-AL in a more practical scenario, we design a noise-aware iterative labeling strategy to confront the "noisy annotation" problem introduced by the previous "dominant labeling" strategy in superpoints.

Active Learning Semantic Segmentation

Paper
Add Code

ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation

1 code implementation • 29 Dec 2021 • Chuanqing Zhuang, Zhengda Lu, Yiqun Wang, Jun Xiao, Ying Wang

Depth estimation is a crucial step for 3D reconstruction with panorama images in recent years.

Ranked #5 on Depth Estimation on Stanford2D3D Panoramic

3D Reconstruction Depth Estimation

Paper
Code

Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs

1 code implementation • CVPR 2022 • Kaifeng Gao, Long Chen, Yulei Niu, Jian Shao, Jun Xiao

To this end, we propose a new classification-then-grounding framework for VidSGG, which can avoid all the three overlooked drawbacks.

Predicate Classification

Paper
Code

Relational Graph Learning for Grounded Video Description Generation

no code implementations • 2 Dec 2021 • Wenqiao Zhang, Xin Eric Wang, Siliang Tang, Haizhou Shi, Haocheng Shi, Jun Xiao, Yueting Zhuang, William Yang Wang

Such a setting can help explain the decisions of captioning models and prevents the model from hallucinating object words in its description.

Graph Learning Hallucination +2

Paper
Add Code

Consensus Graph Representation Learning for Better Grounded Image Captioning

no code implementations • 2 Dec 2021 • Wenqiao Zhang, Haochen Shi, Siliang Tang, Jun Xiao, Qiang Yu, Yueting Zhuang

The contemporary visual captioning models frequently hallucinate objects that are not actually in a scene, due to the visual misclassification or over-reliance on priors that resulting in the semantic inconsistency between the visual information and the target lexical words.

Graph Representation Learning Hallucination +1

Paper
Add Code

Unified Group Fairness on Federated Learning

no code implementations • 9 Nov 2021 • Fengda Zhang, Kun Kuang, Yuxuan Liu, Long Chen, Chao Wu, Fei Wu, Jiaxun Lu, Yunfeng Shao, Jun Xiao

We validate the advantages of the FMDA-M algorithm with various kinds of distribution shift settings in experiments, and the results show that FMDA-M algorithm outperforms the existing fair FL algorithms on unified group fairness.

Attribute Fairness +1

Paper
Add Code

Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering

1 code implementation • 3 Oct 2021 • Long Chen, Yuhang Zheng, Yulei Niu, Hanwang Zhang, Jun Xiao

Specifically, CSST is composed of two parts: Counterfactual Samples Synthesizing (CSS) and Counterfactual Samples Training (CST).

counterfactual Question Answering +1

Paper
Code

Natural Language Video Localization with Learnable Moment Proposals

1 code implementation • EMNLP 2021 • Shaoning Xiao, Long Chen, Jian Shao, Yueting Zhuang, Jun Xiao

Given an untrimmed video and a natural language query, Natural Language Video Localization (NLVL) aims to identify the video moment described by the query.

Paper
Code

Instance-wise or Class-wise? A Tale of Neighbor Shapley for Concept-based Explanation

no code implementations • 3 Sep 2021 • Jiahui Li, Kun Kuang, Lin Li, Long Chen, Songyang Zhang, Jian Shao, Jun Xiao

Deep neural networks have demonstrated remarkable performance in many data-driven and prediction-oriented applications, and sometimes even perform better than humans.

Medical Diagnosis

Paper
Add Code

Single-Image Specular Highlight Removal via Real-World Dataset Construction

1 code implementation • TMM 2021 • Zhongqi Wu, Chuanqing Zhuang, Jian Shi, Jianwei Guo, Jun Xiao, Xiaopeng Zhang, Dong-Ming Yan

Specular reflections pose great challenges on various multimedia and computer vision tasks, e. g. , image segmentation, detection and matching.

Generative Adversarial Network Highlight Detection +3

Paper
Code

Progressive and Selective Fusion Network for High Dynamic Range Imaging

1 code implementation • 19 Aug 2021 • Qian Ye, Jun Xiao, Kin-Man Lam, Takayuki Okatani

We propose a novel method that can better fuse the features based on two ideas.

Image Generation Vocal Bursts Intensity Prediction

Paper
Code

Video Relation Detection via Tracklet based Visual Transformer

1 code implementation • 19 Aug 2021 • Kaifeng Gao, Long Chen, Yifeng Huang, Jun Xiao

Video Visual Relation Detection (VidVRD), has received significant attention of our community over recent years.

Decoder Relation +1

Paper
Code

Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning

no code implementations • 1 Jun 2021 • Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Fei Wu, Jun Xiao

Specifically, Shapley Value and its desired properties are leveraged in deep MARL to credit any combinations of agents, which grants us the capability to estimate the individual credit for each agent.

counterfactual Multi-agent Reinforcement Learning +4

Paper
Add Code

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

no code implementations • 26 May 2021 • Feifei Shao, Long Chen, Jian Shao, Wei Ji, Shaoning Xiao, Lu Ye, Yueting Zhuang, Jun Xiao

With the success of deep neural networks in object detection, both WSOD and WSOL have received unprecedented attention.

Object object-detection +2

Paper
Add Code

VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching

no code implementations • 12 May 2021 • Chenchi Zhang, Wenbo Ma, Jun Xiao, Hanwang Zhang, Jian Shao, Yueting Zhuang, Long Chen

In this paper, we argue that these methods overlook an obvious \emph{mismatch} between the roles of proposals in the two stages: they generate proposals solely based on the detection confidence (i. e., query-agnostic), hoping that the proposals contain all instances mentioned in the text query (i. e., query-aware).

Image-text matching Referring Expression +2

Paper
Add Code

Improving Weakly-supervised Object Localization via Causal Intervention

1 code implementation • 21 Apr 2021 • Feifei Shao, Yawei Luo, Li Zhang, Lu Ye, Siliang Tang, Yi Yang, Jun Xiao

The recent emerged weakly supervised object localization (WSOL) methods can learn to localize an object in the image only using image-level labels.

Object Weakly-Supervised Object Localization

Paper
Code

Efficient Ring-topology Decentralized Federated Learning with Deep Generative Models for Industrial Artificial Intelligent

no code implementations • 15 Apr 2021 • Zhao Wang, Yifan Hu, Jun Xiao, Chao Wu

A novel ring FL topology as well as a map-reduce based synchronizing method are designed in the proposed RDFL to improve decentralized FL performance and bandwidth utilization.

Federated Learning

Paper
Add Code

Human-like Controllable Image Captioning with Verb-specific Semantic Roles

1 code implementation • CVPR 2021 • Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu

However, we argue that almost all existing objective control signals have overlooked two indispensable characteristics of an ideal control signal: 1) Event-compatible: all visual contents referred to in a single sentence should be compatible with the described activity.

Caption Generation controllable image captioning +3

Paper
Code

Boundary Proposal Network for Two-Stage Natural Language Video Localization

no code implementations • 15 Mar 2021 • Shaoning Xiao, Long Chen, Songyang Zhang, Wei Ji, Jian Shao, Lu Ye, Jun Xiao

State-of-the-art NLVL methods are almost in one-stage fashion, which can be typically grouped into two categories: 1) anchor-based approach: it first pre-defines a series of video segment candidates (e. g., by sliding window), and then does classification for each candidate; 2) anchor-free approach: it directly predicts the probabilities for each video frame as a boundary or intermediate frame inside the positive segment.

Vocal Bursts Valence Prediction

Paper
Add Code

Kinetic Energy Distribution of Fragments for Thermal Neutron-Induced $^{235}$U and $^{239}$Pu Fission Reactions

no code implementations • 24 Dec 2020 • Xiaojun Sun, Haiyuan Peng, Liying Xie, Kai Zhang, Yan Liang, Yinlu Han, Nengchuan Su, Jie Yan, Jun Xiao, Junjie Sun

(2) Every complementary pair of the primary fission fragments is approximatively described as two ellipsoids with large deformation at scission moment.

Nuclear Theory

Paper
Add Code

ROBY: Evaluating the Robustness of a Deep Model by its Decision Boundaries

no code implementations • 18 Dec 2020 • Jinyin Chen, Zhen Wang, Haibin Zheng, Jun Xiao, Zhaoyan Ming

This work proposes a generic evaluation metric ROBY, a novel attack-independent robustness measure based on the model's decision boundaries.

Paper
Add Code

GFL: A Decentralized Federated Learning Framework Based On Blockchain

no code implementations • 21 Oct 2020 • Yifan Hu, YuHang Zhou, Jun Xiao, Chao Wu

Federated learning(FL) is a rapidly growing field and many centralized and decentralized FL frameworks have been proposed.

Data Poisoning Federated Learning

Paper
Add Code

Federated Unsupervised Representation Learning

no code implementations • 18 Oct 2020 • Fengda Zhang, Kun Kuang, Zhaoyang You, Tao Shen, Jun Xiao, Yin Zhang, Chao Wu, Yueting Zhuang, Xiaolin Li

FURL poses two new challenges: (1) data distribution shift (Non-IID distribution) among clients would make local models focus on different categories, leading to the inconsistency of representation spaces.

Federated Learning Representation Learning

Paper
Add Code

Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding

1 code implementation • 3 Sep 2020 • Long Chen, Wenbo Ma, Jun Xiao, Hanwang Zhang, Shih-Fu Chang

The prevailing framework for solving referring expression grounding is based on a two-stage process: 1) detecting proposals with an object detector and 2) grounding the referent to one of the proposals.

Referring Expression Vocal Bursts Valence Prediction

Paper
Code

Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling

no code implementations • 11 Aug 2020 • Jiacheng Li, Siliang Tang, Juncheng Li, Jun Xiao, Fei Wu, ShiLiang Pu, Yueting Zhuang

In this paper, we focus on enhancing the generalization ability of the VIST model by considering the few-shot setting.

Meta-Learning Visual Storytelling

Paper
Add Code

Accurate Lung Nodules Segmentation with Detailed Representation Transfer and Soft Mask Supervision

no code implementations • 29 Jul 2020 • Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

Then, a novel Network with detailed representation transfer and Soft Mask supervision (DSNet) is proposed to process the input low-resolution images of lung nodules into high-quality segmentation results.

Computed Tomography (CT) Lesion Segmentation +3

Paper
Add Code

Deep Multi-task Learning for Facial Expression Recognition and Synthesis Based on Selective Feature Sharing

no code implementations • 9 Jul 2020 • Rui Zhao, Tianshan Liu, Jun Xiao, Daniel P. K. Lun, Kin-Man Lam

Multi-task learning is an effective learning strategy for deep-learning-based facial expression recognition tasks.

Facial Expression Recognition Facial Expression Recognition (FER) +2

Paper
Add Code

Hierarchical Fashion Graph Network for Personalized Outfit Recommendation

1 code implementation • 26 May 2020 • Xingchen Li, Xiang Wang, Xiangnan He, Long Chen, Jun Xiao, Tat-Seng Chua

Fashion outfit recommendation has attracted increasing attentions from online shopping services and fashion communities. Distinct from other scenarios (e. g., social networking or content sharing) which recommend a single item (e. g., a friend or picture) to a user, outfit recommendation predicts user preference on a set of well-matched fashion items. Hence, performing high-quality personalized outfit recommendation should satisfy two requirements -- 1) the nice compatibility of fashion items and 2) the consistence with user preference.

Paper
Code

Counterfactual Samples Synthesizing for Robust Visual Question Answering

2 code implementations • CVPR 2020 • Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, ShiLiang Pu, Yueting Zhuang

To reduce the language biases, several recent works introduce an auxiliary question-only model to regularize the training of targeted VQA model, and achieve dominating performance on VQA-CP.

Ranked #1 on Visual Question Answering (VQA) on VQA-CP (using extra training data)

counterfactual Question Answering +1

Paper
Code

Evaluation Framework For Large-scale Federated Learning

1 code implementation • 3 Mar 2020 • Lifeng Liu, Fengda Zhang, Jun Xiao, Chao Wu

Federated learning is proposed as a machine learning setting to enable distributed edge devices, such as mobile phones, to collaboratively learn a shared prediction model while keeping all the training data on device, which can not only take full advantage of data distributed across millions of nodes to train a good model but also protect data privacy.

Federated Learning

Paper
Code

Reinforcement-Learning based Portfolio Management with Augmented Asset Movement Prediction States

1 code implementation • 9 Feb 2020 • Yunan Ye, Hengzhi Pei, Boxin Wang, Pin-Yu Chen, Yada Zhu, Jun Xiao, Bo Li

Our framework aims to address two unique challenges in financial PM: (1) data heterogeneity -- the collected information for each asset is usually diverse, noisy and imbalanced (e. g., news articles); and (2) environment uncertainty -- the financial market is versatile and non-stationary.

Management reinforcement-learning +1

Paper
Code

Video Dialog via Progressive Inference and Cross-Transformer

no code implementations • IJCNLP 2019 • Weike Jin, Zhou Zhao, Mao Gu, Jun Xiao, Furu Wei, Yueting Zhuang

Video dialog is a new and challenging task, which requires the agent to answer questions combining video information with dialog history.

Answer Generation Question Answering +4

Paper
Add Code

DEBUG: A Dense Bottom-Up Grounding Approach for Natural Language Video Localization

no code implementations • IJCNLP 2019 • Chujie Lu, Long Chen, Chilie Tan, Xiaolin Li, Jun Xiao

In this paper, we focus on natural language video localization: localizing (ie, grounding) a natural language description in a long and untrimmed video sequence.

Paper
Add Code

Weak Supervision Enhanced Generative Network for Question Generation

no code implementations • 1 Jul 2019 • Yutong Wang, Jiyuan Zheng, Qijiong Liu, Zhou Zhao, Jun Xiao, Yueting Zhuang

More specifically, we devise a discriminator, Relation Guider, to capture the relations between the whole passage and the associated answer and then the Multi-Interaction mechanism is deployed to transfer the knowledge dynamically for our question generation system.

Decoder Question Answering +2

Paper
Add Code

Galaxy Learning -- A Position Paper

no code implementations • 22 Apr 2019 • Chao Wu, Jun Xiao, Gang Huang, Fei Wu

Model training, as well as the communication, is achieved with blockchain and its smart contracts.

BIG-bench Machine Learning Position

Paper
Add Code

Counterfactual Critic Multi-Agent Training for Scene Graph Generation

no code implementations • ICCV 2019 • Long Chen, Hanwang Zhang, Jun Xiao, Xiangnan He, ShiLiang Pu, Shih-Fu Chang

CMAT is a multi-agent policy gradient method that frames objects as cooperative agents, and then directly maximizes a graph-level metric as the reward.

counterfactual Graph Generation +2

Paper
Add Code

CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation

1 code implementation • 27 Nov 2018 • Junsong Fan, Zhao-Xiang Zhang, Tieniu Tan, Chunfeng Song, Jun Xiao

Weakly supervised semantic segmentation with only image-level labels saves large human effort to annotate pixel-level labels.

Segmentation Weakly supervised segmentation +2

Paper
Code

Textually Guided Ranking Network for Attentional Image Retweet Modeling

no code implementations • 24 Oct 2018 • Zhou Zhao, Hanbing Zhan, Lingtao Meng, Jun Xiao, Jun Yu, Min Yang, Fei Wu, Deng Cai

In this paper, we study the problem of image retweet prediction in social media, which predicts the image sharing behavior that the user reposts the image tweets from their followees.

Paper
Add Code

Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Networks

1 code implementation • CVPR 2018 • Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, Shih-Fu Chang

We propose a novel framework called Semantics-Preserving Adversarial Embedding Network (SP-AEN) for zero-shot visual recognition (ZSL), where test images and their classes are both unseen during training.

General Classification Zero-Shot Learning

Paper
Code

Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks

7 code implementations • 15 Aug 2017 • Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, Tat-Seng Chua

Factorization Machines (FMs) are a supervised learning approach that enhances the linear regression model by incorporating the second-order feature interactions.

regression

7,367

Paper
Code

Graph-Theoretic Spatiotemporal Context Modeling for Video Saliency Detection

no code implementations • 25 Jul 2017 • Lina Wei, Fangfang Wang, Xi Li, Fei Wu, Jun Xiao

As a result, a key issue in video saliency detection is how to effectively capture the intrinsical properties of atomic video structures as well as their associated contextual interactions along the spatial and temporal dimensions.

Video Saliency Detection

Paper
Add Code

Video Question Answering via Attribute-Augmented Attention Network Learning

no code implementations • 20 Jul 2017 • Yunan Ye, Zhou Zhao, Yimeng Li, Long Chen, Jun Xiao, Yueting Zhuang

Video Question Answering is a challenging problem in visual information retrieval, which provides the answer to the referenced video content according to the question.

Attribute Information Retrieval +6

Paper
Add Code

On Geometric Features for Skeleton-Based Action Recognition using Multilayer LSTM Networks

1 code implementation • 2017 IEEE Winter Conference on Applications of Computer Vision (WACV) 2017 • Songyang Zhang, Xiaoming Liu, Jun Xiao

RNN-based approaches have achieved outstanding performance on action recognition with skeleton inputs.

Ranked #1 on Skeleton Based Action Recognition on SBU

Action Recognition Skeleton Based Action Recognition

Paper
Code

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

2 code implementations • CVPR 2017 • Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua

Existing visual attention models are generally spatial, i. e., the attention is modeled as spatial probabilities that re-weight the last conv-layer feature map of a CNN encoding an input image.

Image Captioning Sentence

207

Paper
Code

Metric Learning Driven Multi-Task Structured Output Optimization for Robust Keypoint Tracking

no code implementations • 4 Dec 2014 • Liming Zhao, Xi Li, Jun Xiao, Fei Wu, Yueting Zhuang

As an important and challenging problem in computer vision and graphics, keypoint-based object tracking is typically formulated in a spatio-temporal statistical learning framework.

Metric Learning Object Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.