The Next Input updates 1,127 published updates

The Next Input updates

Browse every published The Next Input update in a calm card overview with images, dates, and direct access to each article.

Profile Updates Videos

The Next Input update

Jukebox

Read paper(opens in a new window)(opens in a new window)

Open article →

The Next Input update

Improving verifiability in AI development

Read paper(opens in a new window)

Open article →

The Next Input update

OpenAI Microscope

Browse Microscope(opens in a new window)

Open article →

The Next Input update

OpenAI standardizes on PyTorch

We are standardizing OpenAI’s deep learning framework on PyTorch.

Open article →

The Next Input update

Scaling laws for neural language models

Read paper(opens in a new window)

Open article →

The Next Input update

Dota 2 with large scale deep reinforcement learning

Read paper(opens in a new window)

Open article →

The Next Input update

Deep double descent

Read paper(opens in a new window)

Open article →

The Next Input update

Procgen Benchmark

Procgen Benchmark consists of 16 unique environments designed to measure both sample efficiency and generalization in reinforcement learning. This benchmark is ideal for evaluating generalization since distinct training and test sets can be generated in each environment. This benchmark is also well-suited to evaluate sample efficiency, since all environments pose diverse and compelling challenges for RL agents. The environments’ intrinsic diversity demands that agents learn robust policies; overfitting to narrow regions in state space will not suffice. Put differently, the ability to generalize becomes an integral component of success when agents are faced with ever-changing levels.

Open article →

The Next Input update

Safety Gym

To study constrained RL for safe exploration, we developed a new set of environments and tools called Safety Gym. By comparison to existing environments for constrained RL, Safety Gym environments are richer and feature a wider range of difficulty and complexity.

Open article →

The Next Input update

Benchmarking safe exploration in deep reinforcement learning

Read paper(opens in a new window)

Open article →

The Next Input update

GPT-2: 1.5B release

Title: GPT-2: 1.5B release

Open article →

The Next Input update

Solving Rubik’s Cube with a robot hand

Read paper(opens in a new window)Watch all videos(opens in a new window)

Open article →

Showing 925 to 936 of 1,127 updates.

Gemini komt eraan