OpenAI Gym

168 papers with code • 14 benchmarks • 3 datasets

An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.

(Description by Evolutionary learning of interpretable decision trees)

(Image Credit: OpenAI Gym)

Benchmarks

Add a Result

These leaderboards are used to track progress in OpenAI Gym

Dataset	Best Model	Compare
Ant-v4	MEow	See all
Humanoid-v4	MEow	See all
Walker2d-v4	SAC	See all
Hopper-v4	MEow	See all
HalfCheetah-v4	SAC	See all
LunarLander-v2	Oblique decision tree	See all
CartPole-v1	Orthogonal decision tree	See all
Mountain Car	Orthogonal decision tree	See all
Cart Pole (OpenAI Gym)	Oblique decision tree	See all
Ant-v2	AWR	See all
HalfCheetah-v2	AWR	See all
Hopper-v2	AWR	See all
Humanoid-v2	AWR	See all
Walker2d-v2	AWR	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find OpenAI Gym models and implementations

toni-sm/skrl

5 papers

421

Datasets

Subtasks

Acrobot

Most implemented papers

Most implemented Social Latest No code

Proximal Policy Optimization Algorithms

labmlai/annotated_deep_learning_paper_implementations • • 20 Jul 2017

We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent.

170

Paper
Code

Continuous control with deep reinforcement learning

ray-project/ray • 9 Sep 2015

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain.

158

Paper
Code

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

haarnoja/sac • • ICML 2018

A platform for Applied Reinforcement Learning (Applied RL)

Paper
Code

Addressing Function Approximation Error in Actor-Critic Methods

sfujim/TD3 • • ICML 2018

In value-based reinforcement learning methods such as deep Q-learning, function approximation errors are known to lead to overestimated value estimates and suboptimal policies.

Paper
Code

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

DartEnv/dart-env • • 26 Feb 2018

The purpose of this technical report is two-fold.

Paper
Code

Decision Transformer: Reinforcement Learning via Sequence Modeling

kzl/decision-transformer • • NeurIPS 2021

In particular, we present Decision Transformer, an architecture that casts the problem of RL as conditional sequence modeling.

Paper
Code

Deep Recurrent Q-Learning for Partially Observable MDPs

marload/DeepRL-TensorFlow2 • • 23 Jul 2015

Deep Reinforcement Learning has yielded proficient controllers for complex tasks.

Paper
Code

Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

google/trax • • 1 Oct 2019

In this paper, we aim to develop a simple and scalable reinforcement learning algorithm that uses standard supervised learning methods as subroutines.

Paper
Code

Deep Reinforcement Learning for Playing 2.5D Fighting Games

elvisyjlin/lf2gym • 5 May 2018

Deep reinforcement learning has shown its success in game playing.

Paper
Code

Maximum Entropy-Regularized Multi-Goal Reinforcement Learning

ruizhaogit/mep • • 21 May 2019

This objective encourages the agent to maximize the expected return, as well as to achieve more diverse goals.

Paper
Code

OpenAI Gym

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result