Search Results for author: Orr Paradise

Found 5 papers, 2 papers with code

Models That Prove Their Own Correctness

no code implementations • 24 May 2024 • Noga Amit, Shafi Goldwasser, Orr Paradise, Guy Rothblum

The *soundness* property of $V$ guarantees that, for *every* input, no model can convince $V$ of the correctness of an incorrect output.

Paper
Add Code

Pseudointelligence: A Unifying Framework for Language Model Evaluation

no code implementations • 18 Oct 2023 • Shikhar Murty, Orr Paradise, Pratyusha Sharma

With large language models surpassing human performance on an increasing number of benchmarks, we must take a principled approach for targeted evaluation of model capabilities.

Language Modelling

Paper
Add Code

UniMASK: Unified Inference in Sequential Decision Problems

1 code implementation • 20 Nov 2022 • Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks.

Decision Making

Paper
Code

A Theory of Unsupervised Translation Motivated by Understanding Animal Communication

1 code implementation • NeurIPS 2023 • Shafi Goldwasser, David F. Gruber, Adam Tauman Kalai, Orr Paradise

Neural networks are capable of translating between languages -- in some cases even between two languages where there is little or no access to parallel translations, in what is known as Unsupervised Machine Translation (UMT).

Machine Translation Translation

Paper
Code

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

no code implementations • 28 Apr 2022 • Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks.

Decision Making Offline RL

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.