Research

Image GPT

We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequences can generate coherent image completions and samples. By establishing a correlation between sample quality an…

Uncategorised

May Gwern.net Newsletter

Link compilation newsletter with anime GAN updates, links on AI scaling, discussion of GPT-3, and 1 book review.

Research

AI and efficiency

We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has been decreasing by a factor of 2 every 16 months. Compared to 2012, it now takes 44 times less…

Scroll to Top