Provide.ai - We Provide AI To Companies

Safety & Alignment

Benchmarking safe exploration in deep reinforcement learning

OpenAI News / November 21, 2019

Uncategorised

Self-Supervised Representation Learning

Posts on Lil'Log / November 10, 2019

[Updated on 2020-01-09: add a new section on Contrastive Predictive Coding].

[Updated on 2020-04-13: add a “Momentum Contrast” section on MoCo, SimCLR and CURL.]

[Updated on 2020-07-08: add a “Bisimulation” section on DeepMDP…

finance, leverage

Robinhood, Leverage, and Lemonade

Unknown / November 6, 2019

DISCLAIMER: NO INVESTMENT OR LEGAL ADVICEThe Content is for informational purposes only, you should not construe any such information or other material as legal, tax, investment, financial, or other advice. Investing involves risk, please consult a fin…

Research

GPT-2: 1.5B release

OpenAI News / November 5, 2019

As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection of outputs of GPT-2 models. While there have been larger language models rele…

Uncategorised

Computing Receptive Fields of Convolutional Neural Networks

Distill / November 4, 2019

Detailed derivations and open-source code to analyze the receptive fields of convnets.

Artificial Intelligence, Featured, self-driving vehicles, Technology

The Future Impacts of Driverless Cars

Elizabeth Robert / October 29, 2019

The way we drive is changing. From futuristic Hollywood movies to sci-fi fiction novels, the idea of driverless cars has been around for some time now – but it’s never felt as close to becoming reality as it does today. The UK’s De…

Uncategorised

Learning to Predict Without Looking Ahead

大トロ / October 29, 2019

<!–
–>
<!–
–>

<!–
–>
<!–Evolved Biped Walker.
–>

Rather than hardcoding forward prediction, we try to get agents to learn that they need to predict the future.
<!–GitHub–>

Redirecting to learningtopredict.github.io, where the article resides.

Research

Solving Rubik’s Cube with a robot hand

OpenAI News / October 15, 2019

We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using the same reinforcement learning code as OpenAI Five paired with a new technique called Automat…

Company

OpenAI Scholars 2020: Applications open

OpenAI News / October 11, 2019

We are now accepting applications for our third class of OpenAI Scholars.

Uncategorised

The Paths Perspective on Value Learning

Distill / September 30, 2019

A closer look at how Temporal Difference Learning merges paths of experience for greater statistical efficiency