Search Results for author: Bing'er Jiang

Found 4 papers, 1 papers with code

Multilingual Turn-taking Prediction Using Voice Activity Projection

no code implementations • 11 Mar 2024 • Koji Inoue, Bing'er Jiang, Erik Ekstedt, Tatsuya Kawahara, Gabriel Skantze

The results show that a monolingual VAP model trained on one language does not make good predictions when applied to other languages.

Paper
Add Code

Real-time and Continuous Turn-taking Prediction Using Voice Activity Projection

1 code implementation • 10 Jan 2024 • Koji Inoue, Bing'er Jiang, Erik Ekstedt, Tatsuya Kawahara, Gabriel Skantze

A demonstration of a real-time and continuous turn-taking prediction system is presented.

Paper
Code

Response-conditioned Turn-taking Prediction

no code implementations • 3 May 2023 • Bing'er Jiang, Erik Ekstedt, Gabriel Skantze

Treating the turn-prediction and response-ranking as a one-stage process, our findings suggest that our model can be used as an incremental response ranker, which can be applied in various settings.

Response Generation

Paper
Add Code

What makes a good pause? Investigating the turn-holding effects of fillers

no code implementations • 3 May 2023 • Bing'er Jiang, Erik Ekstedt, Gabriel Skantze

Filled pauses (or fillers), such as "uh" and "um", are frequent in spontaneous speech and can serve as a turn-holding cue for the listener, indicating that the current speaker is not done yet.

Position

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.