OpenAI updates OpenAI-updates
Browse every published OpenAI update in a calm card overview with images, dates, and direct access to each article. Bekijk alle gepubliceerde OpenAI-updates in een rustig kaartenoverzicht met beelden, datums en directe toegang tot elk artikel.
OpenAI update OpenAI-update
Gathering human feedback Gathering human feedback
View code(opens in a new window) View code(opens in a new window)
OpenAI update OpenAI-update
Better exploration with parameter noise Better exploration with parameter noise
Read code(opens in a new window)Read paper(opens in a new window) Read code(opens in a new window)Read paper(opens in a new window)
OpenAI update OpenAI-update
Proximal Policy Optimization Proximal Policy Optimization
View code(opens in a new window)Read paper(opens in a new window) View code(opens in a new window)Read paper(opens in a new window)
OpenAI update OpenAI-update
Robust adversarial inputs Robust adversarial inputs
We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple scales, angles, perspectives, and the like. We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple scales, angles, perspectives, and the like.
OpenAI update OpenAI-update
Hindsight Experience Replay Hindsight Experience Replay
OpenAI update OpenAI-update
Teacher–student curriculum learning Teacher–student curriculum learning
Read paper(opens in a new window) Read paper(opens in a new window)
OpenAI update OpenAI-update
Faster physics in Python Faster physics in Python
We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research. We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.
OpenAI update OpenAI-update
Learning from human preferences Learning from human preferences
Read paper(opens in a new window) Read paper(opens in a new window)
OpenAI update OpenAI-update
Learning to cooperate, compete, and communicate Learning to cooperate, compete, and communicate
View code(opens in a new window)Read paper(opens in a new window) View code(opens in a new window)Read paper(opens in a new window)
OpenAI update OpenAI-update
UCB exploration via Q-ensembles UCB exploration via Q-ensembles
OpenAI update OpenAI-update
OpenAI Baselines: DQN OpenAI Baselines: DQN
View code(opens in a new window) View code(opens in a new window)
OpenAI update OpenAI-update
Robots that learn Robots that learn
We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once. We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.
Showing 865 to 876 of 918 updates. Je bekijkt 865 tot 876 van 918 updates.