Provide.ai - We Provide AI To Companies

Research

Equivalence between policy gradients and soft Q-learning

OpenAI News / April 21, 2017

Research

Stochastic Neural Networks for hierarchical reinforcement learning

OpenAI News / April 10, 2017

Research

Unsupervised sentiment neuron

OpenAI News / April 6, 2017

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

Uncategorised

Why Momentum Really Works

Distill / April 4, 2017

We often think of optimization with momentum as a ball rolling down a hill. This isn’t wrong, but there is much more to the story.

Research

Spam detection in the physical world

OpenAI News / April 1, 2017

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

Research

Evolution strategies as a scalable alternative to reinforcement learning

OpenAI News / March 24, 2017

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL…

Uncategorised

Research Debt

Distill / March 22, 2017

Science is a human activity. When we fail to distill and explain research, we accumulate a kind of debt…

Research

One-shot imitation learning

OpenAI News / March 21, 2017

Company

Distill

OpenAI News / March 20, 2017

We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).

Research

Learning to communicate

OpenAI News / March 16, 2017

In this post we’ll outline new OpenAI research in which agents develop their own language.