no code implementations • 2 Dec 2022 • Ibrahim Shoer, Berkay Kopru, Engin Erzin
Video summarization attracts attention for efficient video representation, retrieval, and browsing to ease volume and traffic surge problems.
no code implementations • 13 Oct 2022 • Ali Safaya, Engin Erzin
In this report, we present our findings on Turkish ASR with speech representation learning using HUBERT.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 8 Jul 2021 • Berkay Köprü, Engin Erzin
Then, we integrate the estimated emotional attributes and the high-level representations from the CER-NET with the visual information to define the proposed affective video summarization architectures (AVSUM).
no code implementations • 12 Dec 2020 • Zana Bucinca, Yucel Yemez, Engin Erzin, Metin Sezgin
For generating language in a targeted affect, our approach leverages a probabilistic language model and an affective space.
no code implementations • 6 Aug 2019 • Nusrah Hussain, Engin Erzin, T. Metin Sezgin, Yucel Yemez
We present here a method for training a robot for backchannel generation during a human-robot interaction within the reinforcement learning (RL) framework, with the goal of maintaining high engagement level.
no code implementations • 5 Aug 2019 • Nusrah Hussain, Engin Erzin, T. Metin Sezgin, Yucel Yemez
Our experiments demonstrate the potential of our method to train a robot for engaging behaviors in an offline manner.