The Next Input updates 993 published updates 993 gepubliceerde updates

The Next Input updates The Next Input-updates

Browse every published The Next Input update in a calm card overview with images, dates, and direct access to each article. Bekijk alle gepubliceerde The Next Input-updates in een rustig kaartenoverzicht met beelden, datums en directe toegang tot elk artikel.

The Next Input update The Next Input-update

The Next Input
16 Aug 2017 16 aug. 2017

More on Dota 2 More on Dota 2

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system went from barely matching a high-ranked player to beating the top pros and has continued to improve since then. Supervised deep learning systems can only be as good as their training datasets, but in self-play systems, the available data improves automatically as the agent gets better. Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system went from barely matching a high-ranked player to beating the top pros and has continued to improve since then. Supervised deep learning systems can only be as good as their training datasets, but in self-play systems, the available data improves automatically as the agent gets better.

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
11 Aug 2017 11 aug. 2017

Dota 2 Dota 2

Rewatch live event Rewatch live event

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
3 Aug 2017 3 aug. 2017

Gathering human feedback Gathering human feedback

View code(opens in a new window) View code(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
27 Jul 2017 27 jul. 2017

Better exploration with parameter noise Better exploration with parameter noise

Read code(opens in a new window)Read paper(opens in a new window) Read code(opens in a new window)Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
20 Jul 2017 20 jul. 2017

Proximal Policy Optimization Proximal Policy Optimization

View code(opens in a new window)Read paper(opens in a new window) View code(opens in a new window)Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
17 Jul 2017 17 jul. 2017

Robust adversarial inputs Robust adversarial inputs

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple scales, angles, perspectives, and the like. We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple scales, angles, perspectives, and the like.

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
5 Jul 2017 5 jul. 2017

Hindsight Experience Replay Hindsight Experience Replay

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
1 Jul 2017 1 jul. 2017

Teacher–student curriculum learning Teacher–student curriculum learning

Read paper(opens in a new window) Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
28 Jun 2017 28 jun. 2017

Faster physics in Python Faster physics in Python

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research. We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
13 Jun 2017 13 jun. 2017

Learning from human preferences Learning from human preferences

Read paper(opens in a new window) Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
8 Jun 2017 8 jun. 2017

Learning to cooperate, compete, and communicate Learning to cooperate, compete, and communicate

View code(opens in a new window)Read paper(opens in a new window) View code(opens in a new window)Read paper(opens in a new window)

Open article → Open artikel →

The Next Input update The Next Input-update

The Next Input
5 Jun 2017 5 jun. 2017

UCB exploration via Q-ensembles UCB exploration via Q-ensembles

Open article → Open artikel →

Showing 913 to 924 of 993 updates. Je bekijkt 913 tot 924 van 993 updates.

Gemini komt eraan