OpenAI updates OpenAI-updates
Browse every published OpenAI update in a calm card overview with images, dates, and direct access to each article. Bekijk alle gepubliceerde OpenAI-updates in een rustig kaartenoverzicht met beelden, datums en directe toegang tot elk artikel.
OpenAI update OpenAI-update
Attacking machine learning with adversarial examples Attacking machine learning with adversarial examples
Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult. Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult.
OpenAI update OpenAI-update
Adversarial attacks on neural network policies Adversarial attacks on neural network policies
Read paper(opens in a new window) Read paper(opens in a new window)
OpenAI update OpenAI-update
Team update Team update
Title: Team update Title: Team update
OpenAI update OpenAI-update
PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications
Read paper(opens in a new window)(opens in a new window) Read paper(opens in a new window)(opens in a new window)
OpenAI update OpenAI-update
Faulty reward functions in the wild Faulty reward functions in the wild
Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function. Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.
OpenAI update OpenAI-update
Universe Universe
We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications. We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.
OpenAI update OpenAI-update
OpenAI and Microsoft OpenAI and Microsoft
We’re working with Microsoft to start running most of our large-scale experiments on Azure. We’re working with Microsoft to start running most of our large-scale experiments on Azure.
OpenAI update OpenAI-update
#Exploration: A study of count-based exploration for deep reinforcement learning #Exploration: A study of count-based exploration for deep reinforcement learning
Read paper(opens in a new window) Read paper(opens in a new window)
OpenAI update OpenAI-update
On the quantitative analysis of decoder-based generative models On the quantitative analysis of decoder-based generative models
Read paper(opens in a new window) Read paper(opens in a new window)
OpenAI update OpenAI-update
A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models
Read paper(opens in a new window) Read paper(opens in a new window)
OpenAI update OpenAI-update
RL²: Fast reinforcement learning via slow reinforcement learning RL²: Fast reinforcement learning via slow reinforcement learning
Read paper(opens in a new window) Read paper(opens in a new window)
OpenAI update OpenAI-update
Variational lossy autoencoder Variational lossy autoencoder
Read paper(opens in a new window) Read paper(opens in a new window)
Showing 889 to 900 of 918 updates. Je bekijkt 889 tot 900 van 918 updates.