Search Results for author: Saket Dingliwal

Found 14 papers, 5 papers with code

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

no code implementations14 May 2024 Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

Despite safety guardrails, experiments on jailbreaking demonstrate the vulnerability of SLMs to adversarial perturbations and transfer attacks, with average attack success rates of 90% and 10% respectively when evaluated on a dataset of carefully designed harmful questions spanning 12 different toxic categories.

Adversarial Robustness Instruction Following +1

SpeechVerse: A Large-scale Generalizable Audio Language Model

no code implementations14 May 2024 Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, David Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

The models are instruction finetuned using continuous latent representations extracted from the speech foundation model to achieve optimal zero-shot performance on a diverse range of speech processing tasks using natural language instructions.

Automatic Speech Recognition Benchmarking +4

Multilingual Contextual Adapters To Improve Custom Word Recognition In Low-resource Languages

no code implementations3 Jul 2023 Devang Kulshreshtha, Saket Dingliwal, Brady Houston, Sravan Bodapati

A recent approach explores Contextual Adapters, wherein an attention-based biasing model for CTC is used to improve the recognition of custom entities.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale

1 code implementation18 Dec 2022 Hritik Bansal, Karthik Gopalakrishnan, Saket Dingliwal, Sravan Bodapati, Katrin Kirchhoff, Dan Roth

Using a 66 billion parameter language model (OPT-66B) across a diverse set of 14 downstream tasks, we find this is indeed the case: $\sim$70% of attention heads and $\sim$20% of feed forward networks can be removed with minimal decline in task performance.

In-Context Learning Language Modelling +1

Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting

no code implementations18 Oct 2022 Saket Dingliwal, Monica Sunkara, Sravan Bodapati, Srikanth Ronanki, Jeff Farris, Katrin Kirchhoff

End-to-end speech recognition models trained using joint Connectionist Temporal Classification (CTC)-Attention loss have gained popularity recently.

Decoder speech-recognition +1

Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems

no code implementations16 Dec 2021 Saket Dingliwal, Ashish Shenoy, Sravan Bodapati, Ankur Gandhe, Ravi Teja Gadde, Katrin Kirchhoff

Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains creating a need to adapt to new domains with small memory and deployment overhead.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Prompt-tuning in ASR systems for efficient domain-adaptation

no code implementations13 Oct 2021 Saket Dingliwal, Ashish Shenoy, Sravan Bodapati, Ankur Gandhe, Ravi Teja Gadde, Katrin Kirchhoff

In this work, we overcome the problem using prompt-tuning, a methodology that trains a small number of domain token embedding parameters to prime a transformer-based LM to a particular domain.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Few Shot Dialogue State Tracking using Meta-learning

1 code implementation EACL 2021 Saket Dingliwal, Bill Gao, Sanchit Agarwal, Chien-Wei Lin, Tagyoung Chung, Dilek Hakkani-Tur

Dialogue State Tracking (DST) forms a core component of automated chatbot based systems designed for specific goals like hotel, taxi reservation, tourist information, etc.

Chatbot Dialogue State Tracking +1

Covariate Distribution Aware Meta-learning

1 code implementation ICML Workshop LifelongML 2020 Amrith Setlur, Saket Dingliwal, Barnabas Poczos

Based on this model we propose a computationally feasible meta-learning algorithm by introducing meaningful relaxations in our final objective.

Few-Shot Learning regression

Finding Input Characterizations for Output Properties in ReLU Neural Networks

1 code implementation9 Mar 2020 Saket Dingliwal, Divyansh Pareek, Jatin Arora

Deep Neural Networks (DNNs) have emerged as a powerful mechanism and are being increasingly deployed in real-world safety-critical domains.

Cannot find the paper you are looking for? You can Submit a new open access paper.