Search Results for author: Jihan Yao

Found 2 papers, 0 papers with code

LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop

no code implementations • 14 Feb 2024 • Maryam Amirizaniani, Jihan Yao, Adrian Lavergne, Elizabeth Snell Okada, Aman Chadha, Tanya Roosta, Chirag Shah

A case study using questions from the TruthfulQA dataset demonstrates that we can generate a reliable set of probes from one LLM that can be used to audit inconsistencies in a different LLM.

Hallucination

Paper
Add Code

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

no code implementations • 9 Feb 2024 • Yuta Saito, Jihan Yao, Thorsten Joachims

We also show that POTEC provides a strict generalization of policy- and regression-based approaches and their associated assumptions.

regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.