Provide.ai - We Provide AI To Companies

Proximal Policy Optimization

OpenAI News / July 20, 2017

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinfor…

Research

Robust adversarial inputs

OpenAI News / July 17, 2017

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple…

Uncategorised

Predict Stock Prices Using RNN: Part 1

Posts on Lil'Log / July 8, 2017

This is a tutorial for how to build a recurrent neural network using Tensorflow to predict stock market prices. The full working code is available in github.com/lilianweng/stock-rnn. If you don’t know what is recurrent neural network or LSTM cel…

Research

Hindsight Experience Replay

OpenAI News / July 5, 2017

Research

Teacher–student curriculum learning

OpenAI News / July 1, 2017

Research

Faster physics in Python

OpenAI News / June 28, 2017

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

Uncategorised

An Overview of Deep Learning for Curious People

Posts on Lil'Log / June 21, 2017

(The post was originated from my talk for WiMLDS x Fintech meetup hosted by Affirm.)
I believe many of you have watched or heard of the games between AlphaGo and professional Go player Lee Sedol in 2016. Lee has the highest rank of nine dan and many w…

Safety & Alignment

Learning from human preferences

OpenAI News / June 13, 2017

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collabor…

Research

Learning to cooperate, compete, and communicate

OpenAI News / June 8, 2017

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculum—the difficulty of the environment is determined by the skill of …

Research

UCB exploration via Q-ensembles

OpenAI News / June 5, 2017