Provide.ai - We Provide AI To Companies

How AI training scales

OpenAI News / December 14, 2018

We’ve discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks. Since complex tasks tend to have noisier gradients, increasingly large batch sizes are like…

Research

Quantifying generalization in reinforcement learning

OpenAI News / December 6, 2018

We’re releasing CoinRun, a training environment which provides a metric for an agent’s ability to transfer its experience to novel situations and has already helped clarify a longstanding puzzle in reinforcement learning. CoinRun strikes a desirable ba…

ai, deep-learning, Machine Learning

Machine Learning Memes

Unknown / December 1, 2018

A periodically-updated list of my favorite Deep Learning memes. Enjoy!

content warning: may contain crude humor.

Caption: The Gary Marcus/Yoshua Bengio debate. (Thanks Jackie Kay for sending me this)

…

Uncategorised

Meta-Learning: Learning to Learn Fast

Posts on Lil'Log / November 30, 2018

[Updated on 2019-10-01: thanks to Tianhao, we have this post translated in Chinese!]

Machine Learning, python

Model evaluation, model selection, and algorithm selection in machine learning

Sebastian Raschka, PhD / November 10, 2018

This final article in the series *Model evaluation, model selection, and algorithm selection in machine learning* presents overviews of several statistical…

Research

Spinning Up in Deep RL

OpenAI News / November 8, 2018

We’re releasing Spinning Up in Deep RL, an educational resource designed to let anyone learn to become a skilled practitioner in deep reinforcement learning. Spinning Up consists of crystal-clear examples of RL code, educational exercises, documentatio…

Research

Learning concepts with energy functions

OpenAI News / November 7, 2018

We’ve developed an energy-based model that can quickly learn to identify and generate instances of concepts, such as near, above, between, closest, and furthest, expressed as sets of 2d points. Our model learns these concepts after only five demonstrat…

Research

Plan online, learn offline: Efficient learning and exploration via model-based control

OpenAI News / November 5, 2018

Research

Reinforcement learning with prediction-based rewards

OpenAI News / October 31, 2018

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Reveng…

Safety & Alignment

Learning complex goals with iterated amplification

OpenAI News / October 22, 2018

We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled dat…