Provide.ai - We Provide AI To Companies

Uncategorised

World Models Experiments

大トロ / June 9, 2018

GitHub

In this article I will give step-by-step instructions for reproducing the experiments in the World Models article (pdf). The reference TensorFlow implementation is on GitHub.

Other people have implemented World Models independently. The…

Research

GamePad: A learning environment for theorem proving

OpenAI News / June 2, 2018

Company

OpenAI Fellows Fall 2018

OpenAI News / May 30, 2018

We’re now accepting applications for the next cohort of OpenAI Fellows, a program which offers a compensated 6-month apprenticeship in AI research at OpenAI.

Research

We’re releasing the full version of Gym Retro, a platform for reinforcement learning research on games. This brings our publicly-released game count from around 70 Atari games and 30 Sega games to over 1,000 games across a variety of backing emulators….

Research

AI and compute

OpenAI News / May 16, 2018

We’re releasing an analysis showing that since 2012, the amount of compute used in the largest AI training runs has been increasing exponentially with a 3.4-month doubling time (by comparison, Moore’s Law had a 2-year doubling period)[^footnote-correct…

Uncategorised

Implementing Deep Reinforcement Learning Models with Tensorflow + OpenAI Gym

Posts on Lil'Log / May 5, 2018

The full implementation is available in lilianweng/deep-reinforcement-learning-gym
In the previous two posts, I have introduced the algorithms of many deep reinforcement learning models. Now it is the time to get our hands dirty and practice how to im…

Safety & Alignment

AI safety via debate

OpenAI News / May 3, 2018

We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins.

Research

Evolved Policy Gradients

OpenAI News / April 18, 2018

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test…

Machine Learning & Statistics, programming

The Batch Normalization layer of Keras is broken

Vasilis Vryniotis / April 17, 2018

UPDATE: Unfortunately my Pull-Request to Keras that changed the behaviour of the Batch Normalization layer was not accepted. You can read the details here. For those of you who are brave enough to mess with custom implementations, you can find the code…

Research

Gotta Learn Fast: A new benchmark for generalization in RL

OpenAI News / April 10, 2018