Provide.ai - We Provide AI To Companies

Gathering human feedback

OpenAI News / August 3, 2017

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforceme…

Uncategorised

How to Explain the Prediction of a Machine Learning Model?

Posts on Lil'Log / August 1, 2017

The machine learning models have started penetrating into critical areas like health care, justice systems, and financial industry. Thus to figure out how the models make the decisions and make sure the decisioning process is aligned with the ethnic r…

Research

Better exploration with parameter noise

OpenAI News / July 27, 2017

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

Uncategorised

Predict Stock Prices Using RNN: Part 2

Posts on Lil'Log / July 22, 2017

In the Part 2 tutorial, I would like to continue the topic on stock price prediction and to endow the recurrent neural network that I have built in Part 1 with the capability of responding to multiple stocks. In order to distinguish the patterns assoc…

Research

Proximal Policy Optimization

OpenAI News / July 20, 2017

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinfor…

Research

Robust adversarial inputs

OpenAI News / July 17, 2017

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple…

Uncategorised

Predict Stock Prices Using RNN: Part 1

Posts on Lil'Log / July 8, 2017

This is a tutorial for how to build a recurrent neural network using Tensorflow to predict stock market prices. The full working code is available in github.com/lilianweng/stock-rnn. If you don’t know what is recurrent neural network or LSTM cel…

Research

Gathering human feedback

How to Explain the Prediction of a Machine Learning Model?

Better exploration with parameter noise

Predict Stock Prices Using RNN: Part 2

Proximal Policy Optimization

Robust adversarial inputs

Predict Stock Prices Using RNN: Part 1

Hindsight Experience Replay

Teacher–student curriculum learning

Faster physics in Python