Search Results for author: Ksenia Konyushkova

Found 14 papers, 5 papers with code

Reinforced Self-Training (ReST) for Language Modeling

no code implementations • 17 Aug 2023 • Caglar Gulcehre, Tom Le Paine, Srivatsan Srinivasan, Ksenia Konyushkova, Lotte Weerts, Abhishek Sharma, Aditya Siddhant, Alex Ahern, Miaosen Wang, Chenjie Gu, Wolfgang Macherey, Arnaud Doucet, Orhan Firat, Nando de Freitas

Reinforcement learning from human feedback (RLHF) can improve the quality of large language model's (LLM) outputs by aligning them with human preferences.

Language Modelling Machine Translation +4

Paper
Add Code

$\pi2\text{vec}$: Policy Representations with Successor Features

no code implementations • 16 Jun 2023 • Gianluca Scarpellini, Ksenia Konyushkova, Claudio Fantacci, Tom Le Paine, Yutian Chen, Misha Denil

This paper describes $\pi2\text{vec}$, a method for representing behaviors of black box policies as feature vectors.

Offline RL

Paper
Add Code

Vision-Language Models as Success Detectors

no code implementations • 13 Mar 2023 • Yuqing Du, Ksenia Konyushkova, Misha Denil, Akhil Raju, Jessica Landon, Felix Hill, Nando de Freitas, Serkan Cabi

Detecting successful behaviour is crucial for training intelligent agents.

Question Answering Visual Question Answering

Paper
Add Code

Retrieval-Augmented Reinforcement Learning

no code implementations • 17 Feb 2022 • Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Laurent SIfre, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

In this paper we explore an alternative paradigm in which we train a network to map a dataset of past experiences to optimal behavior.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Active Offline Policy Selection

1 code implementation • NeurIPS 2021 • Ksenia Konyushkova, Yutian Chen, Tom Le Paine, Caglar Gulcehre, Cosmin Paduraru, Daniel J Mankowitz, Misha Denil, Nando de Freitas

We use multiple benchmarks, including real-world robotics, with a large number of candidate policies to show that the proposed approach improves upon state-of-the-art OPE estimates and pure online policy evaluation.

Bayesian Optimization Off-policy evaluation

Paper
Code

Semi-supervised reward learning for offline reinforcement learning

no code implementations • 12 Dec 2020 • Ksenia Konyushkova, Konrad Zolna, Yusuf Aytar, Alexander Novikov, Scott Reed, Serkan Cabi, Nando de Freitas

In offline reinforcement learning (RL) agents are trained using a logged dataset.

Behavioural cloning reinforcement-learning +1

Paper
Add Code

Offline Learning from Demonstrations and Unlabeled Experience

no code implementations • 27 Nov 2020 • Konrad Zolna, Alexander Novikov, Ksenia Konyushkova, Caglar Gulcehre, Ziyu Wang, Yusuf Aytar, Misha Denil, Nando de Freitas, Scott Reed

Behavior cloning (BC) is often practical for robot learning because it allows a policy to be trained offline without rewards, by supervised learning on expert demonstrations.

Continuous Control Imitation Learning

Paper
Add Code

Scaling data-driven robotics with reward sketching and batch reinforcement learning

1 code implementation • 26 Sep 2019 • Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott Reed, Rae Jeong, Konrad Zolna, Yusuf Aytar, David Budden, Mel Vecerik, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang

We present a framework for data-driven robotics that makes use of a large dataset of recorded robot experience and scales to several tasks using learned reward functions.

reinforcement-learning Reinforcement Learning (RL)

12,866

Paper
Code

Discovering General-Purpose Active Learning Strategies

1 code implementation • ICLR 2019 • Ksenia Konyushkova, Raphael Sznitman, Pascal Fua

We propose a general-purpose approach to discovering active learning (AL) strategies from data.

Active Learning BIG-bench Machine Learning +2

Paper
Code

Learning Intelligent Dialogs for Bounding Box Annotation

1 code implementation • CVPR 2018 • Ksenia Konyushkova, Jasper Uijlings, Christoph Lampert, Vittorio Ferrari

We demonstrate that (1) our agents are able to learn efficient annotation strategies in several scenarios, automatically adapting to the image difficulty, the desired quality of the boxes, and the detector strength; (2) in all scenarios the resulting annotation dialogs speed up annotation compared to manual box drawing alone and box verification alone, while also outperforming any fixed combination of verification and drawing in most scenarios; (3) in a realistic scenario where the detector is iteratively re-trained, our agents evolve a series of strategies that reflect the shifting trade-off between verification and drawing as the detector grows stronger.

Paper
Code

Learning Active Learning from Data

1 code implementation • NeurIPS 2017 • Ksenia Konyushkova, Raphael Sznitman, Pascal Fua

In this paper, we suggest a novel data-driven approach to active learning (AL).

Active Learning regression

Paper
Code

Geometry in Active Learning for Binary and Multi-class Image Segmentation

no code implementations • 29 Jun 2016 • Ksenia Konyushkova, Raphael Sznitman, Pascal Fua

Our approach combines geometric smoothness priors in the image space with more traditional uncertainty measures to estimate which pixels or voxels are the most informative, and thus should to be annotated next.

Active Learning Image Segmentation +1

Paper
Add Code

God(s) Know(s): Developmental and Cross-Cultural Patterns in Children Drawings

no code implementations • 11 Nov 2015 • Ksenia Konyushkova, Nikolaos Arvanitopoulos, Zhargalma Dandarova Robert, Pierre-Yves Brandt, Sabine Süsstrunk

This paper introduces a novel approach to data analysis designed for the needs of specialists in psychology of religion.

Paper
Add Code

Introducing Geometry in Active Learning for Image Segmentation

no code implementations • ICCV 2015 • Ksenia Konyushkova, Raphael Sznitman, Pascal Fua

We propose an Active Learning approach to training a segmentation classifier that exploits geometric priors to streamline the annotation process in 3D image volumes.

Active Learning Image Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.