Search Results for author: RuiQi Li

Found 19 papers, 5 papers with code

Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion

no code implementations • 4 Jun 2024 • RuiQi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao

We adopt discrete-unit random resampling and pitch corruption strategies, enabling training with unpaired singing data and thus mitigating the issue of data scarcity.

In-Context Learning Language Modelling +3

Paper
Add Code

Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching

no code implementations • 1 Jun 2024 • Yongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang, Zehan Wang, Fuming You, RuiQi Li, Zhou Zhao

By employing a non-autoregressive vector field estimator based on a feed-forward transformer and channel-level cross-modal feature fusion with strong temporal alignment, our model generates audio that is highly synchronized with the input video.

Audio Generation

Paper
Add Code

FAITH: Frequency-domain Attention In Two Horizons for Time Series Forecasting

1 code implementation • 22 May 2024 • RuiQi Li, Maowei Jiang, Kai Wang, Kaiduo Feng, Quangao Liu, Yue Sun, Xiufang Zhou

Time Series Forecasting plays a crucial role in various fields such as industrial equipment maintenance, meteorology, energy consumption, traffic flow and financial investment.

Time Series Time Series Forecasting

Paper
Code

Robust Singing Voice Transcription Serves Synthesis

no code implementations • 16 May 2024 • RuiQi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao

Note-level Automatic Singing Voice Transcription (AST) converts singing recordings into note sequences, facilitating the automatic annotation of singing datasets for Singing Voice Synthesis (SVS) applications.

Decoder Singing Voice Synthesis

Paper
Add Code

Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment

no code implementations • 14 Apr 2024 • Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, RuiQi Li, Fuming You, Zhou Zhao, Zhimeng Zhang

A song is a combination of singing voice and accompaniment.

Music Generation Singing Voice Synthesis

Paper
Add Code

Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt

no code implementations • 18 Mar 2024 • Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, RuiQi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao

Recent singing-voice-synthesis (SVS) methods have achieved remarkable audio quality and naturalness, yet they lack the capability to control the style attributes of the synthesized singing explicitly.

Attribute Decoder +1

Paper
Add Code

Stochastic Data-Driven Predictive Control with Equivalence to Stochastic MPC

no code implementations • 23 Dec 2023 • RuiQi Li, John W. Simpson-Porco, Stephen L. Smith

We propose a data-driven receding-horizon control method dealing with the chance-constrained output-tracking problem of unknown stochastic linear time-invariant (LTI) systems with partial state observation.

Model Predictive Control

Paper
Add Code

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis

no code implementations • 17 Dec 2023 • Yu Zhang, Rongjie Huang, RuiQi Li, Jinzheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

Moreover, existing SVS methods encounter a decline in the quality of synthesized singing voices in OOD scenarios, as they rest upon the assumption that the target vocal attributes are discernible during the training phase.

Quantization Singing Voice Synthesis +1

Paper
Add Code

Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer

no code implementations • 14 Sep 2023 • Yongqi Wang, Jionghao Bai, Rongjie Huang, RuiQi Li, Zhiqing Hong, Zhou Zhao

Direct speech-to-speech translation (S2ST) with discrete self-supervised representations has achieved remarkable accuracy, but is unable to preserve the speaker timbre of the source speech during translation.

In-Context Learning Language Modelling +3

Paper
Add Code

When Do Discourse Markers Affect Computational Sentence Understanding?

no code implementations • 1 Sep 2023 • RuiQi Li, Liesbeth Allein, Damien Sileo, Marie-Francine Moens

The capabilities and use cases of automatic natural language processing (NLP) have grown significantly over the last few years.

Sentence

Paper
Add Code

DisCover: Disentangled Music Representation Learning for Cover Song Identification

no code implementations • 19 Jul 2023 • Jiahao Xun, Shengyu Zhang, Yanting Yang, Jieming Zhu, Liqun Deng, Zhou Zhao, Zhenhua Dong, RuiQi Li, Lichao Zhang, Fei Wu

We analyze the CSI task in a disentanglement view with the causal graph technique, and identify the intra-version and inter-version effects biasing the invariant learning.

Blocking Cover song identification +3

Paper
Add Code

Automated Action Model Acquisition from Narrative Texts

no code implementations • 17 Jul 2023 • RuiQi Li, Leyang Cui, Songtuan Lin, Patrik Haslum

Action models, which take the form of precondition/effect axioms, facilitate causal and motivational connections between actions for AI agents.

Paper
Add Code

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment

no code implementations • 8 May 2023 • RuiQi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao

The speech-to-singing (STS) voice conversion task aims to generate singing samples corresponding to speech recordings while facing a major challenge: the alignment between the target (singing) pitch contour and the source (speech) content is difficult to learn in a text-free situation.

STS Voice Conversion

Paper
Add Code

Zero-shot Medical Image Translation via Frequency-Guided Diffusion Models

1 code implementation • 5 Apr 2023 • Yunxiang Li, Hua-Chieh Shao, Xiao Liang, Liyuan Chen, RuiQi Li, Steve Jiang, Jing Wang, You Zhang

However, for medical image translation, the existing diffusion models are deficient in accurately retaining structural information since the structure details of source domain images are lost during the forward diffusion process and cannot be fully recovered through learned reverse diffusion, while the integrity of anatomical structures is extremely important in medical images.

Anatomy SSIM +2

Paper
Code

EDeR: A Dataset for Exploring Dependency Relations Between Events

1 code implementation • 4 Apr 2023 • RuiQi Li, Patrik Haslum, Leyang Cui

We argue that an important type of relation not explored in NLP or IR research to date is that of an event being an argument - required or optional - of another event.

Event Extraction Information Retrieval +3

Paper
Code

M4Singer: a Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus

1 code implementation • NIPS 2022 • Lichao Zhang, RuiQi Li, Shoutong Wang, Liqun Deng, Jinglin Liu, Yi Ren, Jinzheng He, Rongjie Huang, Jieming Zhu, Xiao Chen, Zhou Zhao

The lack of publicly available high-quality and accurately labeled datasets has long been a major bottleneck for singing voice synthesis (SVS).

Music Transcription Singing Voice Synthesis +1

168

Paper
Code

Data-Driven Model Predictive Control for Linear Time-Periodic Systems

no code implementations • 30 Mar 2022 • RuiQi Li, John W. Simpson-Porco, Stephen L. Smith

Robustness of the algorithm to noisy data is illustrated via simulation of a regularized version of the algorithm applied to a stochastic multi-input multi-output LTP system.

LEMMA Model Predictive Control

Paper
Add Code

Latent Space Arc Therapy Optimization

no code implementations • 24 May 2021 • Noah Bice, Mohamad Fakhreddine, RuiQi Li, Dan Nguyen, Christopher Kabat, Pamela Myers, Niko Papanikolaou, Neil Kirby

Volumetric modulated arc therapy planning is a challenging problem in high-dimensional, non-convex optimization.

Paper
Add Code

The Geometry of Information Cocoon: Analyzing the Cultural Space with Word Embedding Models

1 code implementation • 20 Jul 2020 • Huimin Xu, Zhicong Chen, RuiQi Li, Cheng-Jun Wang

In contrast, the people of higher social class have more capability to stride over the constraints of information cocoon.

Computers and Society

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.