Provide.ai - We Provide AI To Companies

Learning with opponent-learning awareness

OpenAI News / September 13, 2017

Datumbox Machine Learning Framework v0.8.1 released

Vasilis Vryniotis / August 31, 2017

The Datumbox v0.8.1 has been released! Download it now from Github or Maven Central Repository. What is new? The main focus of version 0.8.1 is to resolve various bugs, update the depedencies and improve the code architecture of the framework. Here are…

Uncategorised

From GAN to WGAN

Posts on Lil'Log / August 20, 2017

[Updated on 2018-09-30: thanks to Yoonju, we have this post translated in Korean!]

[Updated on 2019-04-18: this post is also available on arXiv.]
Generative adversarial network (GAN) has shown great results in many generative tasks to replicate the r…

Research

OpenAI Baselines: ACKTR & A2C

OpenAI News / August 18, 2017

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal performance. ACKTR is a more sample-efficient reinforcement …

Research

More on Dota 2

OpenAI News / August 16, 2017

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system went from barely matching a high-ranked player to bea…

Research

Dota 2

OpenAI News / August 11, 2017

We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitation learning or tree search. This is a step towards build…

Research

Gathering human feedback

OpenAI News / August 3, 2017

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforceme…

Uncategorised

How to Explain the Prediction of a Machine Learning Model?

Posts on Lil'Log / August 1, 2017

The machine learning models have started penetrating into critical areas like health care, justice systems, and financial industry. Thus to figure out how the models make the decisions and make sure the decisioning process is aligned with the ethnic r…

Research

Better exploration with parameter noise

OpenAI News / July 27, 2017

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

Uncategorised

Predict Stock Prices Using RNN: Part 2

Posts on Lil'Log / July 22, 2017

In the Part 2 tutorial, I would like to continue the topic on stock price prediction and to endow the recurrent neural network that I have built in Part 1 with the capability of responding to multiple stocks. In order to distinguish the patterns assoc…