Provide.ai - We Provide AI To Companies - Page 5043

Safety & Alignment

AI safety via debate

OpenAI News / May 3, 2018

We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins.

Evolved Policy Gradients

OpenAI News / April 18, 2018

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test…

Machine Learning & Statistics, programming

The Batch Normalization layer of Keras is broken

Vasilis Vryniotis / April 17, 2018

UPDATE: Unfortunately my Pull-Request to Keras that changed the behaviour of the Batch Normalization layer was not accepted. You can read the details here. For those of you who are brave enough to mess with custom implementations, you can find the code…

Gotta Learn Fast: A new benchmark for generalization in RL

OpenAI News / April 10, 2018

Uncategorised

Policy Gradient Algorithms

Posts on Lil'Log / April 8, 2018

[Updated on 2018-06-30: add two new policy gradient methods, SAC and D4PG.]

[Updated on 2018-09-30: add a new policy gradient method, TD3.]

[Updated on 2019-02-09: add SAC with automatically adjusted temperature].

[Updated on 2019-06-26: Thanks to …

Retro Contest

OpenAI News / April 5, 2018

We’re launching a transfer learning contest that measures a reinforcement learning algorithm’s ability to generalize from previous experience.

Uncategorised

World Models

大トロ / March 27, 2018

<!–
–>
<!–
–>

<!–
–>
<!–Evolved Biped Walker.
–>

Can agents learn inside of their own dreams?
<!–GitHub–>

Redirecting to worldmodels.github.io, where the article resides.

Variance reduction for policy gradient with action-dependent factorized baselines

OpenAI News / March 20, 2018

Improving GANs using optimal transport

OpenAI News / March 15, 2018

Report from the OpenAI hackathon

OpenAI News / March 15, 2018

On March 3rd, we hosted our first hackathon with 100 members of the artificial intelligence community.