Search Results for author: Ruoyu Wang

Found 32 papers, 12 papers with code

Multi-agent Traffic Prediction via Denoised Endpoint Distribution

no code implementations • 11 May 2024 • Yao Liu, Ruoyu Wang, Yuanjiang Cao, Quan Z. Sheng, Lina Yao

The exploration of high-speed movement by robots or road traffic agents is crucial for autonomous driving and navigation.

Autonomous Driving Traffic Prediction +1

Paper
Add Code

Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion

no code implementations • 3 Apr 2024 • Su Sun, Cheng Zhao, Yuliang Guo, Ruoyu Wang, Xinyu Huang, Yingjie Victor Chen, Liu Ren

The 3D Inpainter with abstract representation at coarse levels is trained offline using various scenes to complete occluded surfaces.

3D Reconstruction 3D Scene Reconstruction +1

Paper
Add Code

TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes

no code implementations • 3 Apr 2024 • Cheng Zhao, Su Sun, Ruoyu Wang, Yuliang Guo, Jun-Jun Wan, Zhou Huang, Xinyu Huang, Yingjie Victor Chen, Liu Ren

Most 3D Gaussian Splatting (3D-GS) based methods for urban scenes initialize 3D Gaussians directly with 3D LiDAR points, which not only underutilizes LiDAR data capabilities but also overlooks the potential advantages of fusing LiDAR with camera data.

3D Reconstruction Autonomous Driving

Paper
Add Code

UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation

no code implementations • 23 Mar 2024 • Yuliang Guo, Abhinav Kumar, Cheng Zhao, Ruoyu Wang, Xinyu Huang, Liu Ren

Monocular 3D reconstruction for categorical objects heavily relies on accurately perceiving each object's pose.

3D Object Reconstruction 3D Reconstruction +2

Paper
Add Code

Multitask frame-level learning for few-shot sound event detection

no code implementations • 17 Mar 2024 • Liang Zou, Genwei Yan, Ruoyu Wang, Jun Du, Meng Lei, Tian Gao, Xin Fang

This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples.

Data Augmentation Event Detection +1

Paper
Add Code

StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images

1 code implementation • 14 Mar 2024 • Robert Jewsbury, Ruoyu Wang, Abhir Bhalerao, Nasir Rajpoot, Quoc Dang Vu

Stain normalization algorithms aim to transform the color and intensity characteristics of a source multi-gigapixel histology image to match those of a target image, mitigating inconsistencies in the appearance of stains used to highlight cellular components in the images.

Computational Efficiency Instance Segmentation +3

Paper
Code

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

1 code implementation • 7 Mar 2024 • Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee

In this paper, we investigate this contrasting phenomenon from the perspective of modality bias and reveal that an excessive modality bias on the audio caused by dropout is the underlying reason.

Audio-Visual Speech Recognition Knowledge Distillation +2

Paper
Code

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

no code implementations • 5 Mar 2024 • Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer, Samuel Marks, Oam Patel, Andy Zou, Mantas Mazeika, Zifan Wang, Palash Oswal, Weiran Lin, Adam A. Hunt, Justin Tienken-Harder, Kevin Y. Shih, Kemper Talley, John Guan, Russell Kaplan, Ian Steneker, David Campbell, Brad Jokubaitis, Alex Levinson, Jean Wang, William Qian, Kallol Krishna Karmakar, Steven Basart, Stephen Fitz, Mindy Levine, Ponnurangam Kumaraguru, Uday Tupakula, Vijay Varadharajan, Ruoyu Wang, Yan Shoshitaishvili, Jimmy Ba, Kevin M. Esvelt, Alexandr Wang, Dan Hendrycks

To measure these risks of malicious use, government institutions and major AI labs are developing evaluations for hazardous capabilities in LLMs.

Multiple-choice

Paper
Add Code

Multi-modality action recognition based on dual feature shift in vehicle cabin monitoring

no code implementations • 26 Jan 2024 • Dan Lin, Philip Hann Yung Lee, Yiming Li, Ruoyu Wang, Kim-Hui Yap, Bingbing Li, You Shing Ngim

Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems.

Action Recognition

Paper
Add Code

DETER: Detecting Edited Regions for Deterring Generative Manipulations

no code implementations • 16 Dec 2023 • Sai Wang, Ye Zhu, Ruoyu Wang, Amaya Dharmasiri, Olga Russakovsky, Yu Wu

While face swapping and attribute editing are performed on similar face regions such as eyes and nose, the inpainting operation can be performed on random image regions, removing the spurious correlations of previous datasets.

Attribute Face Swapping +1

Paper
Add Code

An Automated Pipeline for Tumour-Infiltrating Lymphocyte Scoring in Breast Cancer

1 code implementation • 10 Nov 2023 • Adam J Shephard, Mostafa Jahanifar, Ruoyu Wang, Muhammad Dawood, Simon Graham, Kastytis Sidlauskas, Syed Ali Khurram, Nasir M Rajpoot, Shan E Ahmed Raza

Tumour-infiltrating lymphocytes (TILs) are considered as a valuable prognostic markers in both triple-negative and human epidermal growth factor receptor 2 (HER2) positive breast cancer.

whole slide images

Paper
Code

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

1 code implementation • 17 Sep 2023 • Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang, Yanyan Yue, Shuangqing Qian, Shilong Wu, Jun Du, Chin-Hui Lee

We propose a novel neural speaker diarization system using memory-aware multi-speaker embedding with sequence-to-sequence architecture (NSD-MS2S), which integrates the strengths of memory-aware multi-speaker embedding (MA-MSE) and sequence-to-sequence (Seq2Seq) architecture, leading to improvement in both efficiency and performance.

speaker-diarization Speaker Diarization

Paper
Code

The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

no code implementations • 15 Sep 2023 • Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao

This pioneering effort aims to set the first benchmark for the AVTSE task, offering fresh insights into enhancing the ac-curacy of back-end speech recognition systems through AVTSE in challenging and real acoustic environments.

Audio-Visual Speech Recognition speech-recognition +2

Paper
Add Code

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge

no code implementations • 28 Aug 2023 • Ruoyu Wang, Maokui He, Jun Du, Hengshun Zhou, Shutong Niu, Hang Chen, Yanyan Yue, Gaobin Yang, Shilong Wu, Lei Sun, Yanhui Tu, Haitao Tang, Shuangqing Qian, Tian Gao, Mengzhi Wang, Genshun Wan, Jia Pan, Jianqing Gao, Chin-Hui Lee

This technical report details our submission system to the CHiME-7 DASR Challenge, which focuses on speaker diarization and speech recognition under complex multi-speaker scenarios.

speaker-diarization Speaker Diarization +2

Paper
Add Code

Concavity-Induced Distance for Unoriented Point Cloud Decomposition

no code implementations • 19 Jun 2023 • Ruoyu Wang, Yanfei Xue, Bharath Surianarayanan, Dong Tian, Chen Feng

We propose Concavity-induced Distance (CID) as a novel way to measure the dissimilarity between a pair of points in an unoriented point cloud.

Instance Segmentation Semantic Segmentation

Paper
Add Code

Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation

1 code implementation • 14 Jun 2023 • Ruoyu Wang, Yongqi Yang, Zhihao Qian, Ye Zhu, Yu Wu

In this work, we investigate the diffusion (physics) in diffusion (machine learning) properties and propose our Cyclic One-Way Diffusion (COW) method to control the direction of diffusion phenomenon given a pre-trained frozen diffusion model for versatile customization application scenarios, where the low-level pixel information from the conditioning needs to be preserved.

Denoising Image Generation

Paper
Code

P$^2$SDF for Neural Indoor Scene Reconstruction

no code implementations • 1 Mar 2023 • Jing Li, Jinpeng Yu, Ruoyu Wang, Zhengxin Li, Zhengyu Zhang, Lina Cao, Shenghua Gao

As the unsupervised plane segments are usually noisy and inaccurate, we propose to assign different weights to the sampled points on the plane in plane estimation as well as the regularization loss.

Indoor Scene Reconstruction Surface Reconstruction

Paper
Add Code

An Aggregation of Aggregation Methods in Computational Pathology

no code implementations • 2 Nov 2022 • Mohsin Bilal, Robert Jewsbury, Ruoyu Wang, Hammam M. AlGhamdi, Amina Asif, Mark Eastwood, Nasir Rajpoot

Image analysis and machine learning algorithms operating on multi-gigapixel whole-slide images (WSIs) often process a large number of tiles (sub-images) and require aggregating predictions from the tiles in order to predict WSI-level labels.

Multiple Instance Learning whole slide images

Paper
Add Code

PlaneDepth: Self-supervised Depth Estimation via Orthogonal Planes

1 code implementation • CVPR 2023 • Ruoyu Wang, Zehao Yu, Shenghua Gao

PlaneDepth estimates the depth distribution using a Laplacian Mixture Model based on orthogonal planes for an input image.

Ranked #3 on Monocular Depth Estimation on KITTI Eigen split unsupervised

Autonomous Driving Data Augmentation +1

Paper
Code

Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods

no code implementations • 19 Aug 2022 • Chao Chen, Xinhao Liu, Xuchu Xu, Yiming Li, Li Ding, Ruoyu Wang, Chen Feng

Inspired by noisy label learning, we propose a novel self-supervised framework named \textit{TF-VPR} that uses temporal neighborhoods and learnable feature neighborhoods to discover unknown spatial neighborhoods.

Data Augmentation Representation Learning +1

Paper
Add Code

Breaking Correlation Shift via Conditional Invariant Regularizer

no code implementations • 14 Jul 2022 • Mingyang Yi, Ruoyu Wang, Jiachen Sun, Zhenguo Li, Zhi-Ming Ma

The correlation shift is caused by the spurious attributes that correlate to the class label, as the correlation between them may vary in training and test data.

Paper
Add Code

TIAger: Tumor-Infiltrating Lymphocyte Scoring in Breast Cancer for the TiGER Challenge

1 code implementation • 23 Jun 2022 • Adam Shephard, Mostafa Jahanifar, Ruoyu Wang, Muhammad Dawood, Simon Graham, Kastytis Sidlauskas, Syed Ali Khurram, Nasir Rajpoot, Shan E Ahmed Raza

The Tumor InfiltratinG lymphocytes in breast cancER (TiGER) challenge, aims to assess the prognostic significance of computer-generated TILs scores for predicting survival as part of a Cox proportional hazards model.

Paper
Code

Rank the triplets: A ranking-based multiple instance learning framework for detecting HPV infection in head and neck cancers using routine H&E images

no code implementations • 16 Jun 2022 • Ruoyu Wang, Syed Ali Khurram, Amina Asif, Lawrence Young, Nasir Rajpoot

Unique gene expression profiles were also identified with respect to HPV infection status, and is in line with existing findings.

Multiple Instance Learning

Paper
Add Code

Out-of-distribution Generalization with Causal Invariant Transformations

no code implementations • CVPR 2022 • Ruoyu Wang, Mingyang Yi, Zhitang Chen, Shengyu Zhu

In this work, we obviate these assumptions and tackle the OOD problem without explicitly recovering the causal feature.

Out-of-Distribution Generalization

Paper
Add Code

ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning

1 code implementation • 8 Feb 2022 • Zhenkun Shi, Qianqian Yuan, Ruoyu Wang, Hoaran Li, Xiaoping Liao, Hongwu Ma

Take UniPort protein "A0A0U5GJ41" as an example (1. 14.-.-), ECRECer annotated it with "1. 14. 11. 38", which supported by further protein structure analysis based on AlphaFold2.

Benchmarking Protein Language Model

Paper
Code

Active Learning-Based Optimization of Scientific Experimental Design

no code implementations • 29 Dec 2021 • Ruoyu Wang

Active learning (AL) is a machine learning algorithm that can achieve greater accuracy with fewer labeled training instances, for having the ability to ask oracles to label the most valuable unlabeled data chosen iteratively and heuristically by query strategies.

Active Learning Experimental Design

Paper
Add Code

Improving OOD Generalization with Causal Invariant Transformations

no code implementations • 29 Sep 2021 • Ruoyu Wang, Mingyang Yi, Shengyu Zhu, Zhitang Chen

In this work, we obviate these assumptions and tackle the OOD problem without explicitly recovering the causal feature.

Paper
Add Code

Constructing Flow Graphs from Procedural Cybersecurity Texts

1 code implementation • Findings (ACL) 2021 • Kuntal Kumar Pal, Kazuaki Kashihara, Pratyay Banerjee, Swaroop Mishra, Ruoyu Wang, Chitta Baral

We must read the whole text to identify the relevant information or identify the instruction flows to complete a task, which is prone to failures.

Sentence Sentence Embeddings

Paper
Code

Deep Weakly Supervised Positioning

no code implementations • 10 Apr 2021 • Ruoyu Wang, Xuchu Xu, Li Ding, Yang Huang, Chen Feng

PoseNet can map a photo to the position where it is taken, which is appealing in robotics.

Paper
Add Code

Characterization of Excess Risk for Locally Strongly Convex Population Risk

1 code implementation • 4 Dec 2020 • Mingyang Yi, Ruoyu Wang, Zhi-Ming Ma

Our bounds underscore that with locally strongly convex population risk, the models trained by any proper iterative algorithm can generalize well, even for non-convex problems, and $d$ is large.

5,788

Paper
Code

SPARE3D: A Dataset for SPAtial REasoning on Three-View Line Drawings

1 code implementation • CVPR 2020 • Wenyu Han, Siyuan Xiang, Chenhui Liu, Ruoyu Wang, Chen Feng

Our experiments show that although convolutional networks have achieved superhuman performance in many visual learning tasks, their spatial reasoning performance on SPARE3D tasks is either lower than average human performance or even close to random guesses.

Paper
Code

Real-time Soft Body 3D Proprioception via Deep Vision-based Sensing

1 code implementation • 8 Apr 2019 • Ruoyu Wang, Shiheng Wang, Songyu Du, Erdong Xiao, Wenzhen Yuan, Chen Feng

Soft bodies made from flexible and deformable materials are popular in many robotics applications, but their proprioceptive sensing has been a long-standing challenge.

Robotics

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.