epicurus - Provide.ai

Uncategorised

Reinforcement Learning, Agency and Taste

epicurus / May 12, 2026

This started off as an entry for Dwarkesh’s blog post contest, specifically an answer to his first question on why intuitions about slowdowns in reinforcement learning (RL) progress have either not come true or have had mixed success. His 1000 word lim…

Author name: epicurus

Reinforcement Learning, Agency and Taste