The Next Input updates The Next Input-updates
Browse every published The Next Input update in a calm card overview with images, dates, and direct access to each article. Bekijk alle gepubliceerde The Next Input-updates in een rustig kaartenoverzicht met beelden, datums en directe toegang tot elk artikel.
The Next Input update The Next Input-update
OpenAI Baselines: DQN OpenAI Baselines: DQN
View code(opens in a new window) View code(opens in a new window)
The Next Input update The Next Input-update
Robots that learn Robots that learn
We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once. We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.
The Next Input update The Next Input-update
Roboschool Roboschool
We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym. We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.
The Next Input update The Next Input-update
Equivalence between policy gradients and soft Q-learning Equivalence between policy gradients and soft Q-learning
The Next Input update The Next Input-update
Stochastic Neural Networks for hierarchical reinforcement learning Stochastic Neural Networks for hierarchical reinforcement learning
Read paper(opens in a new window) Read paper(opens in a new window)
The Next Input update The Next Input-update
Unsupervised sentiment neuron Unsupervised sentiment neuron
We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews. We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.
The Next Input update The Next Input-update
Spam detection in the physical world Spam detection in the physical world
We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot. We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.
The Next Input update The Next Input-update
Evolution strategies as a scalable alternative to reinforcement learning Evolution strategies as a scalable alternative to reinforcement learning
Read paper(opens in a new window)Read documentation(opens in a new window) Read paper(opens in a new window)Read documentation(opens in a new window)
The Next Input update The Next Input-update
One-shot imitation learning One-shot imitation learning
Read paper(opens in a new window) Read paper(opens in a new window)
The Next Input update The Next Input-update
Distill Distill
We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing). We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).
The Next Input update The Next Input-update
Learning to communicate Learning to communicate
Title: Learning to communicate Title: Learning to communicate
The Next Input update The Next Input-update
Emergence of grounded compositional language in multi-agent populations Emergence of grounded compositional language in multi-agent populations
Read paper(opens in a new window) Read paper(opens in a new window)
Showing 925 to 936 of 993 updates. Je bekijkt 925 tot 936 van 993 updates.