Provide.ai - We Provide AI To Companies

Uncategorised

Meta-Learning: Learning to Learn Fast

Posts on Lil'Log / November 30, 2018

[Updated on 2019-10-01: thanks to Tianhao, we have this post translated in Chinese!]

Model evaluation, model selection, and algorithm selection in machine learning

Sebastian Raschka, PhD / November 10, 2018

This final article in the series *Model evaluation, model selection, and algorithm selection in machine learning* presents overviews of several statistical…

Research

Spinning Up in Deep RL

OpenAI News / November 8, 2018

We’re releasing Spinning Up in Deep RL, an educational resource designed to let anyone learn to become a skilled practitioner in deep reinforcement learning. Spinning Up consists of crystal-clear examples of RL code, educational exercises, documentatio…

Research

Learning concepts with energy functions

OpenAI News / November 7, 2018

We’ve developed an energy-based model that can quickly learn to identify and generate instances of concepts, such as near, above, between, closest, and furthest, expressed as sets of 2d points. Our model learns these concepts after only five demonstrat…

Research

Plan online, learn offline: Efficient learning and exploration via model-based control

OpenAI News / November 5, 2018

Research

Reinforcement learning with prediction-based rewards

OpenAI News / October 31, 2018

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Reveng…

Safety & Alignment

Learning complex goals with iterated amplification

OpenAI News / October 22, 2018

We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled dat…

Uncategorised

Flow-based Deep Generative Models

Posts on Lil'Log / October 13, 2018

So far, I’ve written about two types of generative models, GAN and VAE. Neither of them explicitly learns the probability density function of real data, $p(\mathbf{x})$ (where $\mathbf{x} \in \mathcal{D}$) — because it is really hard! Taki…

Company

OpenAI Scholars 2019: Applications open

OpenAI News / October 11, 2018

We are now accepting applications for our second cohort of OpenAI Scholars, a program where we provide 6–10 stipends and mentorship to individuals from underrepresented groups to study deep learning full-time for 3 months and open-source a project.

Uncategorised

Reinforcement Learning for Improving Agent Design

大トロ / October 10, 2018

<!–
–>
<!–
–>

<!–
–>
<!–Evolved Biped Walker.
–>

Little dude rewarded for having little legs.
<!–GitHub–>

Redirecting to designrl.github.io, where the article resides.