Omspatel - Provide.ai

Artificial Intelligence, finance, Machine Learning, nlp, reinforcement-learning

How Small Can You Go? Testing DeepSeek-R1’s RL Technique on Tiny Financial Models

Omspatel / April 17, 2026

I trained 1.5B, 3B, and 7B models to answer questions about earnings reports using GRPO, the same reinforcement learning behind…Continue reading on Medium »

Author name: Omspatel

How Small Can You Go? Testing DeepSeek-R1’s RL Technique on Tiny Financial Models