Search Results for author: Keerthiram Murugesan

Found 32 papers, 5 papers with code

Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

no code implementations • 30 May 2024 • Hyo Jin Do, Rachel Ostrand, Justin D. Weisz, Casey Dugan, Prasanna Sattigeri, Dennis Wei, Keerthiram Murugesan, Werner Geyer

To address this issue, we conducted a scenario-based study (N=104) to systematically compare the impact of various design strategies for communicating factuality and source attribution on participants' ratings of trust, preferences, and ease in validating response accuracy.

Paper
Add Code

SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

no code implementations • 24 May 2024 • Shuai Zhang, Heshan Devaka Fernando, Miao Liu, Keerthiram Murugesan, Songtao Lu, Pin-Yu Chen, Tianyi Chen, Meng Wang

This paper studies the transfer reinforcement learning (RL) problem where multiple RL problems have different reward functions but share the same underlying transition dynamics.

Q-Learning Reinforcement Learning (RL) +1

Paper
Add Code

On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning

no code implementations • 15 Apr 2024 • Mauricio Gruppi, Soham Dan, Keerthiram Murugesan, Subhajit Chaudhury

Moreover, we describe the occurrence of semantic degeneration as a consequence of inappropriate fine-tuning of language models in text-based reinforcement learning (TBRL).

reinforcement-learning

Paper
Add Code

EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

1 code implementation • 15 Mar 2024 • Kinjal Basu, Keerthiram Murugesan, Subhajit Chaudhury, Murray Campbell, Kartik Talamadupula, Tim Klinger

To tackle these issues, in this paper, we present EXPLORER which is an exploration-guided reasoning agent for textual reinforcement learning.

Natural Language Understanding reinforcement-learning +2

Paper
Code

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

no code implementations • 9 Mar 2024 • Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy, Inkit Padhi, David Piorkowski, Ambrish Rawat, Orna Raz, Prasanna Sattigeri, Hendrik Strobelt, Sarathkrishna Swaminathan, Christoph Tillmann, Aashka Trivedi, Kush R. Varshney, Dennis Wei, Shalisha Witherspooon, Marcel Zalmanovici

Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations.

Paper
Add Code

Language Guided Exploration for RL Agents in Text Environments

no code implementations • 5 Mar 2024 • Hitesh Golchha, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan

Real-world sequential decision making is characterized by sparse rewards and large decision spaces, posing significant difficulty for experiential learning systems like $\textit{tabula rasa}$ reinforcement learning (RL) agents.

Decision Making Language Modelling +2

Paper
Add Code

On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)

no code implementations • 4 Jan 2024 • Vishal Pallagani, Kaushik Roy, Bharath Muppasani, Francesco Fabiano, Andrea Loreggia, Keerthiram Murugesan, Biplav Srivastava, Francesca Rossi, Lior Horesh, Amit Sheth

Automated Planning and Scheduling is among the growing areas in Artificial Intelligence (AI) where mention of LLMs has gained popularity.

Scheduling

Paper
Add Code

On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration

no code implementations • 24 Oct 2023 • Shuai Zhang, Hongkang Li, Meng Wang, Miao Liu, Pin-Yu Chen, Songtao Lu, Sijia Liu, Keerthiram Murugesan, Subhajit Chaudhury

This paper provides the first theoretical convergence and sample complexity analysis of the practical setting of DQNs with $\epsilon$-greedy policy.

Q-Learning

Paper
Add Code

Value-based Fast and Slow AI Nudging

no code implementations • 14 Jul 2023 • Marianna B. Ganapini, Francesco Fabiano, Lior Horesh, Andrea Loreggia, Nicholas Mattei, Keerthiram Murugesan, Vishal Pallagani, Francesca Rossi, Biplav Srivastava, Brent Venable

Values that are relevant to a specific decision scenario are used to decide when and how to use each of these nudging modalities.

Paper
Add Code

Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning

1 code implementation • 5 Jul 2023 • Subhajit Chaudhury, Sarathkrishna Swaminathan, Daiki Kimura, Prithviraj Sen, Keerthiram Murugesan, Rosario Uceda-Sosa, Michiaki Tatsubori, Achille Fokoue, Pavan Kapanipathi, Asim Munawar, Alexander Gray

Text-based reinforcement learning agents have predominantly been neural network-based models with embeddings-based representation, learning uninterpretable policies that often do not generalize well to unseen games.

reinforcement-learning Representation Learning

Paper
Code

MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

no code implementations • 18 Jun 2023 • Keerthiram Murugesan, Sarathkrishna Swaminathan, Soham Dan, Subhajit Chaudhury, Chulaka Gunasekara, Maxwell Crouse, Diwakar Mahajan, Ibrahim Abdelaziz, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Alexander Gray

In this work, we propose a new evaluation scheme to model human judgments in 7 NLP tasks, based on the fine-grained mismatches between a pair of texts.

Sentence

Paper
Add Code

Understanding the Capabilities of Large Language Models for Automated Planning

no code implementations • 25 May 2023 • Vishal Pallagani, Bharath Muppasani, Keerthiram Murugesan, Francesca Rossi, Biplav Srivastava, Lior Horesh, Francesco Fabiano, Andrea Loreggia

Firstly, we want to understand the extent to which LLMs can be used for plan generation.

Protein Folding

Paper
Add Code

Fast and Slow Planning

no code implementations • 7 Mar 2023 • Francesco Fabiano, Vishal Pallagani, Marianna Bergamaschi Ganapini, Lior Horesh, Andrea Loreggia, Keerthiram Murugesan, Francesca Rossi, Biplav Srivastava

The concept of Artificial Intelligence has gained a lot of attention over the last decade.

Paper
Add Code

Plansformer: Generating Symbolic Plans using Transformers

no code implementations • 16 Dec 2022 • Vishal Pallagani, Bharath Muppasani, Keerthiram Murugesan, Francesca Rossi, Lior Horesh, Biplav Srivastava, Francesco Fabiano, Andrea Loreggia

Large Language Models (LLMs) have been the subject of active research, significantly advancing the field of Natural Language Processing (NLP).

Question Answering Text Generation +2

Paper
Add Code

Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

1 code implementation • 23 Oct 2022 • Heshan Fernando, Han Shen, Miao Liu, Subhajit Chaudhury, Keerthiram Murugesan, Tianyi Chen

Machine learning problems with multiple objective functions appear either in learning with multiple criteria where learning has to make a trade-off between multiple performance metrics such as fairness, safety and accuracy; or, in multi-task learning where multiple tasks are optimized jointly, sharing inductive bias between them.

Fairness Inductive Bias +1

Paper
Code

Targeted Advertising on Social Networks Using Online Variational Tensor Regression

no code implementations • 22 Aug 2022 • Tsuyoshi Idé, Keerthiram Murugesan, Djallel Bouneffouf, Naoki Abe

The proposed framework is designed to accommodate any number of feature vectors in the form of multi-mode tensor, thereby enabling to capture the heterogeneity that may exist over user preferences, products, and campaign strategies in a unified manner.

Marketing regression

Paper
Add Code

Auto-Transfer: Learning to Route Transferrable Representations

1 code implementation • 2 Feb 2022 • Keerthiram Murugesan, Vijay Sadashivaiah, Ronny Luss, Karthikeyan Shanmugam, Pin-Yu Chen, Amit Dhurandhar

Knowledge transfer between heterogeneous source and target networks and tasks has received a lot of attention in recent times as large amounts of quality labeled data can be difficult to obtain in many applications.

Transfer Learning

Paper
Code

A Hybrid Neuro-Symbolic approach for Text-Based Games using Inductive Logic Programming

no code implementations • AAAI Workshop CLeaR 2022 • Kinjal Basu, Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Kartik Talamadupula, Tim Klinger, Murray Campbell, Mrinmaya Sachan, Gopal Gupta

These rules are learned in an online manner and applied with an ASP solver to predict an action for the agent.

Inductive logic programming Natural Language Understanding +2

Paper
Add Code

Case-based Reasoning for Better Generalization in Textual Reinforcement Learning

no code implementations • ICLR 2022 • Mattia Atzeni, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan

Text-based games (TBG) have emerged as promising environments for driving research in grounded language understanding and studying problems like generalization and sample efficiency.

Out-of-Distribution Generalization reinforcement-learning +2

Paper
Add Code

Auto-Transfer: Learning to Route Transferable Representations

no code implementations • ICLR 2022 • Keerthiram Murugesan, Vijay Sadashivaiah, Ronny Luss, Karthikeyan Shanmugam, Pin-Yu Chen, Amit Dhurandhar

Knowledge transfer between heterogeneous source and target networks and tasks has received a lot of attention in recent times as large amounts of quality labelled data can be difficult to obtain in many applications.

Transfer Learning

Paper
Add Code

Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph Representations

no code implementations • ACL 2021 • Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell

Text-based games (TBGs) have emerged as useful benchmarks for evaluating progress at the intersection of grounded language understanding and reinforcement learning (RL).

Graph Attention Reinforcement Learning (RL) +1

Paper
Add Code

Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning Agents

no code implementations • 9 Jun 2021 • Keerthiram Murugesan, Subhajit Chaudhury, Kartik Talamadupula

This improves the agent's overall understanding of the game 'scene' and objects' relationships to the world around them, and the variety of visual representations on offer allow the agent to generate a better generalization of a relationship.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

Thinking Fast and Slow in AI

no code implementations • 12 Oct 2020 • Grady Booch, Francesco Fabiano, Lior Horesh, Kiran Kate, Jon Lenchner, Nick Linck, Andrea Loreggia, Keerthiram Murugesan, Nicholas Mattei, Francesca Rossi, Biplav Srivastava

This paper proposes a research direction to advance AI which draws inspiration from cognitive theories of human decision making.

Common Sense Reasoning Decision Making

Paper
Add Code

Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines

2 code implementations • 8 Oct 2020 • Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Pushkar Shukla, Sadhana Kumaravel, Gerald Tesauro, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell

Text-based games have emerged as an important test-bed for Reinforcement Learning (RL) research, requiring RL agents to combine grounded language understanding with sequential decision making.

Ranked #1 on Commonsense Reasoning for RL on commonsense-rl

Common Sense Reasoning Commonsense Reasoning for RL +3

Paper
Code

Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches

no code implementations • 12 Jul 2020 • Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Pushkar Shukla, Sadhana Kumaravel, Gerald Tesauro, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell

We introduce a number of RL agents that combine the sequential context with a dynamic graph representation of their beliefs of the world and commonsense knowledge from ConceptNet in different ways.

Decision Making Reinforcement Learning (RL) +1

Paper
Add Code

Enhancing Text-based Reinforcement Learning Agents with Commonsense Knowledge

no code implementations • 2 May 2020 • Keerthiram Murugesan, Mattia Atzeni, Pushkar Shukla, Mrinmaya Sachan, Pavan Kapanipathi, Kartik Talamadupula

In this paper, we consider the recent trend of evaluating progress on reinforcement learning technology by using text-based environments and games as evaluation environments.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Lifelong Learning with Output Kernels

no code implementations • ICLR 2018 • Keerthiram Murugesan, Jaime Carbonell

Lifelong learning poses considerable challenges in terms of effectiveness (minimizing prediction errors for all tasks) and overall computational tractability for real-time performance.

Paper
Add Code

Active Learning from Peers

no code implementations • NeurIPS 2017 • Keerthiram Murugesan, Jaime Carbonell

This paper addresses the challenge of learning from peers in an online multitask setting.

Active Learning

Paper
Add Code

Co-Clustering for Multitask Learning

no code implementations • 3 Mar 2017 • Keerthiram Murugesan, Jaime Carbonell, Yiming Yang

This paper presents a new multitask learning framework that learns a shared representation among the tasks, incorporating both task and feature clusters.

Clustering

Paper
Add Code

Self-Paced Multitask Learning with Shared Knowledge

no code implementations • 2 Mar 2017 • Keerthiram Murugesan, Jaime Carbonell

This paper introduces self-paced task selection to multitask learning, where instances from more closely related tasks are selected in a progression of easier-to-harder tasks, to emulate an effective human education strategy, but applied to multitask machine learning.

Paper
Add Code

Adaptive Smoothed Online Multi-Task Learning

no code implementations • NeurIPS 2016 • Keerthiram Murugesan, Hanxiao Liu, Jaime Carbonell, Yiming Yang

This paper addresses the challenge of jointly learning both the per-task model parameters and the inter-task relationships in a multi-task online learning setting.

Multi-Task Learning

Paper
Add Code

Multi-Task Multiple Kernel Relationship Learning

no code implementations • 10 Nov 2016 • Keerthiram Murugesan, Jaime Carbonell

The problem is formulated as a regularization-based approach called \textit{Multi-Task Multiple Kernel Relationship Learning} (\textit{MK-MTRL}), which models the task relationship matrix from the weights learned from latent feature spaces of task-specific base kernels.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.