Provide.ai - We Provide AI To Companies

Faulty reward functions in the wild

OpenAI News / December 21, 2016

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

Uncategorised

Experiments in Handwriting with a Neural Network

Distill / December 6, 2016

Several interactive visualizations of a generative model of handwriting. Some are fun, some are serious.

Research

Universe

OpenAI News / December 5, 2016

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.