Provide.ai - We Provide AI To Companies

Better exploration with parameter noise

OpenAI News / July 27, 2017

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

Uncategorised

Predict Stock Prices Using RNN: Part 2

Posts on Lil'Log / July 22, 2017

In the Part 2 tutorial, I would like to continue the topic on stock price prediction and to endow the recurrent neural network that I have built in Part 1 with the capability of responding to multiple stocks. In order to distinguish the patterns assoc…

Research

Proximal Policy Optimization

OpenAI News / July 20, 2017

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinfor…

Research

Robust adversarial inputs

OpenAI News / July 17, 2017

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple…

Uncategorised

Predict Stock Prices Using RNN: Part 1

Posts on Lil'Log / July 8, 2017

This is a tutorial for how to build a recurrent neural network using Tensorflow to predict stock market prices. The full working code is available in github.com/lilianweng/stock-rnn. If you don’t know what is recurrent neural network or LSTM cel…

Research

Hindsight Experience Replay

OpenAI News / July 5, 2017

Research

Teacher–student curriculum learning

OpenAI News / July 1, 2017

Research

Faster physics in Python

OpenAI News / June 28, 2017

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

Uncategorised

An Overview of Deep Learning for Curious People

Posts on Lil'Log / June 21, 2017

(The post was originated from my talk for WiMLDS x Fintech meetup hosted by Affirm.)
I believe many of you have watched or heard of the games between AlphaGo and professional Go player Lee Sedol in 2016. Lee has the highest rank of nine dan and many w…

Safety & Alignment

Learning from human preferences

OpenAI News / June 13, 2017

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collabor…