Search Results for author: Xiaoyu Dong

Found 11 papers, 9 papers with code

Vector Quantization for Recommender Systems: A Review and Outlook

1 code implementation • 6 May 2024 • Qijiong Liu, Xiaoyu Dong, Jiaren Xiao, Nuo Chen, Hengchang Hu, Jieming Zhu, Chenxu Zhu, Tetsuya Sakai, Xiao-Ming Wu

Finally, the survey analyzes the remaining challenges and anticipates future trends in VQ4Rec, including the challenges associated with the training of vector quantization, the opportunities presented by large language models, and emerging trends in multimodal recommender systems.

Feature Compression Quantization +1

Paper
Code

Optimal Embedding Dimension for Sparse Subspace Embeddings

no code implementations • 17 Nov 2023 • Shabarish Chenakkod, Michał Dereziński, Xiaoyu Dong, Mark Rudelson

We use this to construct the first oblivious subspace embedding with $O(d)$ embedding dimension that can be applied faster than current matrix multiplication time, and to obtain an optimal single-pass algorithm for least squares regression.

Paper
Add Code

Understanding Dark Scenes by Contrasting Multi-Modal Observations

1 code implementation • 23 Aug 2023 • Xiaoyu Dong, Naoto Yokoya

Experiments show that our approach can effectively enhance dark scene understanding based on multi-modal images with limited semantics by shaping semantic-discriminative feature spaces.

Ranked #1 on Semantic Segmentation on LLRGBD-synthetic

Contrastive Learning Scene Understanding +1

Paper
Code

V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative Perception

1 code implementation • CVPR 2023 • Runsheng Xu, Xin Xia, Jinlong Li, Hanzhao Li, Shuo Zhang, Zhengzhong Tu, Zonglin Meng, Hao Xiang, Xiaoyu Dong, Rui Song, Hongkai Yu, Bolei Zhou, Jiaqi Ma

To facilitate the development of cooperative perception, we present V2V4Real, the first large-scale real-world multi-modal dataset for V2V perception.

3D Object Detection 3D Object Tracking +4

175

Paper
Code

Bridging the Domain Gap for Multi-Agent Perception

1 code implementation • 16 Oct 2022 • Runsheng Xu, Jinlong Li, Xiaoyu Dong, Hongkai Yu, Jiaqi Ma

Existing multi-agent perception algorithms usually select to share deep neural features extracted from raw sensing data between agents, achieving a trade-off between accuracy and communication bandwidth limit.

3D Object Detection Domain Adaptation +1

Paper
Code

Learning Mutual Modulation for Self-Supervised Cross-Modal Super-Resolution

1 code implementation • 19 Jul 2022 • Xiaoyu Dong, Naoto Yokoya, Longguang Wang, Tatsumi Uezato

Self-supervised cross-modal super-resolution (SR) can overcome the difficulty of acquiring paired training data, but is challenging because only low-resolution (LR) source and high-resolution (HR) guide images from different modalities are available.

Super-Resolution

Paper
Code

Pik-Fix: Restoring and Colorizing Old Photos

1 code implementation • 4 May 2022 • Runsheng Xu, Zhengzhong Tu, Yuanqi Du, Xiaoyu Dong, Jinlong Li, Zibo Meng, Jiaqi Ma, Alan Bovik, Hongkai Yu

Our proposed framework consists of three modules: a restoration sub-network that conducts restoration from degradations, a similarity network that performs color histogram matching and color transfer, and a colorization subnet that learns to predict the chroma elements of images conditioned on chromatic reference signals.

Colorization

Paper
Code

ROMNet: Renovate the Old Memories

no code implementations • 5 Feb 2022 • Runsheng Xu, Zhengzhong Tu, Yuanqi Du, Xiaoyu Dong, Jinlong Li, Zibo Meng, Jiaqi Ma, Hongkai Yu

Renovating the memories in old photos is an intriguing research topic in computer vision fields.

Colorization

Paper
Add Code

Learnable Lookup Table for Neural Network Quantization

1 code implementation • CVPR 2022 • Longguang Wang, Xiaoyu Dong, Yingqian Wang, Li Liu, Wei An, Yulan Guo

Since a linear quantizer (i. e., round(*) function) cannot well fit the bell-shaped distributions of weights and activations, many existing methods use pre-defined functions (e. g., exponential function) with learnable parameters to build the quantizer for joint optimization.

Computational Efficiency Image Classification +3