no code implementations • 13 Jun 2022 • Arlo Faria, Adam Janin, Korbinian Riedhammer, Sidhi Adkoli
While commercial ASR systems are still below this threshold, a research system is shown to clearly surpass the accuracy of commercial human speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 13 Mar 2015 • Julia Bernd, Damian Borth, Benjamin Elizalde, Gerald Friedland, Heather Gallagher, Luke Gottlieb, Adam Janin, Sara Karabashlieva, Jocelyn Takahashi, Jennifer Won
The YLI Multimedia Event Detection corpus is a public-domain index of videos with annotations and computed features, specialized for research in multimedia event detection (MED), i. e., automatically identifying what's happening in a video by analyzing the audio and visual content.