August 2020 gwern.net newsletter
with an essay on sidenotes; links on human competence, efficient-computing/hardware-overhangs; no reviews.
with an essay on sidenotes; links on human competence, efficient-computing/hardware-overhangs; no reviews.
A collection of articles and comments with the goal of understanding how to design robust and general purpose self-organizing systems.
Training an end-to-end differentiable, self-organising cellular automata for classifying MNIST digits.
In this blog post, I am (briefly) reviewing Christoph Molnar’s *Interpretable Machine Learning Book*. Then, I am writing about two classic generalized…
Links on the Uighurs, authoritarianism, negative emissions, AI overhang; 1 movie & 2 anime reviews
Although most popular and successful model architectures are designed by human experts, it doesn’t mean we have explored the entire network architecture space and settled down with the best option. We would have a better chance to find the optim…
The Datumbox Framework v0.8.2 has been released! Download it now from GitHub or Maven Central Repository. What is new? The version 0.8.2 is a limited incremental release that focuses on resolving bugs and updating the dependencies of the framework. Her…
The first chapter (draft) of the Introduction to Deep Learning book, which is a book based on my lecture notes and slides.
Last week at ICML 2020, Mikael Henaff, Akshay Krishnamurthy, John Langford and I had a paper on a new reinforcement learning (RL) algorithm that solves three key problems in RL: (i) global exploration, (ii) decoding latent dynamics, and (iii) optimizing a given reward function. Our ICML poster is here. The paper is a bit mathematically heavy in nature so this …
Continue reading “HOMER: Provable Exploration in Reinforcement Learning”
Our third class of OpenAI Scholars presented their final projects at virtual Demo Day, showcasing their research results from over the past five months.