AI Safety Needs Social Scientists
If we want to train AI to do what humans want, we need to study humans.
If we want to train AI to do what humans want, we need to study humans.
We’ve written a paper arguing that long-term AI safety research needs social scientists to ensure AI alignment algorithms succeed when actual humans are involved. Properly aligning advanced AI systems with human values requires resolving many uncertain…
<!–
–>
<!–
–>
<!–
–>
<!–Evolved Biped Walker.
–>
PlaNet learns a world model from image inputs only and successfully leverages it for planning in latent space.
<!––>
GitHub
Redirecting to planetrl.github.io, where the article resides.
We’ve trained a large-scale unsupervised language model which generates coherent paragraphs of text, achieves state-of-the-art performance on many language modeling benchmarks, and performs rudimentary reading comprehension, machine translation, questi…
Some thoughts on the interesting BagNet paper (accepted at ICLR 2019) currently being circulated around the Machine Learning Twitter Community.
Disclaimer: I wasn’t a reviewer of this paper for ICLR. I think it was worthy of acceptance to the c…
[Updated on 2019-02-14: add ULMFiT and GPT-2.]
[Updated on 2020-02-29: add ALBERT.]
[Updated on 2020-10-25: add RoBERTa.]
[Updated on 2020-12-13: add T5.]
[Updated on 2020-12-30: add GPT-3.]
[Updated on 2021-11-13: add XLNet, BART and ELECTRA; Also up…
A PDF version of this post can be found here.
Chinese translation by Xiaoyi Yin
Notions of uncertainty are tossed around in conversations around AI safety, risk management, portfolio optimization, scientific measurement, and insurance. Here are a few …
In Part 3, we have reviewed models in the R-CNN family. All of them are region-based object detection algorithms. They can achieve high accuracy but could be too slow for certain applications such as autonomous driving. In Part 4, we only focus on fas…
Our first cohort of OpenAI Fellows has concluded, with each Fellow going from a machine learning beginner to core OpenAI contributor in the course of a 6-month apprenticeship.