Reasoning Models

ai, AI research, attention, large-language-models, LLMs, Open Source, pytorch, Reasoning Models

Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention

Sebastian Raschka, PhD / May 16, 2026

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs

Agents, large-language-models, LLMs, python, Reasoning Models

Components of A Coding Agent

Sebastian Raschka, PhD / April 4, 2026

How coding agents use tools, memory, and repo context to make LLMs work better in practice

ai, AI research, attention, large-language-models, LLMs, Open Source, Reasoning Models

A Visual Guide to Attention Variants in Modern LLMs

Sebastian Raschka, PhD / March 22, 2026

From MHA and GQA to MLA, sparse attention, and hybrid architectures

ai, deep-learning, llm, Machine Learning, Reasoning Models

The State of Reinforcement Learning for LLM Reasoning

Sebastian Raschka, PhD / April 19, 2025

A lot has happened this month, especially with the releases of new flagship models like GPT-4.5 and Llama 4. But you might have noticed that reactions to…