The Next Input updates 994 published updates 994 gepubliceerde updates

The Next Input updates The Next Input-updates

Browse every published The Next Input update in a calm card overview with images, dates, and direct access to each article. Bekijk alle gepubliceerde The Next Input-updates in een rustig kaartenoverzicht met beelden, datums en directe toegang tot elk artikel.

The Next Input update The Next Input-update

The Next Input
30 Jan 2020 30 jan. 2020

OpenAI standardizes on PyTorch OpenAI standardizes on PyTorch

We are standardizing OpenAI’s deep learning framework on PyTorch. We are standardizing OpenAI’s deep learning framework on PyTorch.

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
23 Jan 2020 23 jan. 2020

Scaling laws for neural language models Scaling laws for neural language models

Read paper(opens in a new window) Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
13 Dec 2019 13 dec. 2019

Dota 2 with large scale deep reinforcement learning Dota 2 with large scale deep reinforcement learning

Read paper(opens in a new window) Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
5 Dec 2019 5 dec. 2019

Deep double descent Deep double descent

Read paper(opens in a new window) Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
3 Dec 2019 3 dec. 2019

Procgen Benchmark Procgen Benchmark

Procgen Benchmark consists of 16 unique environments designed to measure both sample efficiency and generalization in reinforcement learning. This benchmark is ideal for evaluating generalization since distinct training and test sets can be generated in each environment. This benchmark is also well-suited to evaluate sample efficiency, since all environments pose diverse and compelling challenges for RL agents. The environments’ intrinsic diversity demands that agents learn robust policies; overfitting to narrow regions in state space will not suffice. Put differently, the ability to generalize becomes an integral component of success when agents are faced with ever-changing levels. Procgen Benchmark consists of 16 unique environments designed to measure both sample efficiency and generalization in reinforcement learning. This benchmark is ideal for evaluating generalization since distinct training and test sets can be generated in each environment. This benchmark is also well-suited to evaluate sample efficiency, since all environments pose diverse and compelling challenges for RL agents. The environments’ intrinsic diversity demands that agents learn robust policies; overfitting to narrow regions in state space will not suffice. Put differently, the ability to generalize becomes an integral component of success when agents are faced with ever-changing levels.

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
21 Nov 2019 21 nov. 2019

Safety Gym Safety Gym

To study constrained RL for safe exploration, we developed a new set of environments and tools called Safety Gym. By comparison to existing environments for constrained RL, Safety Gym environments are richer and feature a wider range of difficulty and complexity. To study constrained RL for safe exploration, we developed a new set of environments and tools called Safety Gym. By comparison to existing environments for constrained RL, Safety Gym environments are richer and feature a wider range of difficulty and complexity.

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
21 Nov 2019 21 nov. 2019

Benchmarking safe exploration in deep reinforcement learning Benchmarking safe exploration in deep reinforcement learning

Read paper(opens in a new window) Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
5 Nov 2019 5 nov. 2019

GPT-2: 1.5B release GPT-2: 1.5B release

Title: GPT-2: 1.5B release Title: GPT-2: 1.5B release

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
15 Oct 2019 15 okt. 2019

Solving Rubik’s Cube with a robot hand Solving Rubik’s Cube with a robot hand

Read paper(opens in a new window)Watch all videos(opens in a new window) Read paper(opens in a new window)Watch all videos(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
11 Oct 2019 11 okt. 2019

OpenAI Scholars 2020: Applications open OpenAI Scholars 2020: Applications open

We are now accepting applications for our third class of OpenAI Scholars. We are now accepting applications for our third class of OpenAI Scholars.

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
19 Sep 2019 19 sep. 2019

Fine-tuning GPT-2 from human preferences Fine-tuning GPT-2 from human preferences

Title: Fine-tuning GPT-2 from human preferences Title: Fine-tuning GPT-2 from human preferences

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
17 Sep 2019 17 sep. 2019

Emergent tool use from multi-agent interaction Emergent tool use from multi-agent interaction

Title: Emergent tool use from multi-agent interaction Title: Emergent tool use from multi-agent interaction

Open article → Open artikel →

Showing 817 to 828 of 994 updates. Je bekijkt 817 tot 828 van 994 updates.

Gemini komt eraan