Sebastian Raschka, PhD

Agents, large-language-models, LLMs, python, Reasoning Models

Components of A Coding Agent

Sebastian Raschka, PhD / April 4, 2026

How coding agents use tools, memory, and repo context to make LLMs work better in practice

ai, AI research, attention, large-language-models, LLMs, Open Source, Reasoning Models

A Visual Guide to Attention Variants in Modern LLMs

Sebastian Raschka, PhD / March 22, 2026

From MHA and GQA to MLA, sparse attention, and hybrid architectures

ai, deep-learning, llm, Machine Learning

New LLM Architecture Gallery

Sebastian Raschka, PhD / March 14, 2026

I put together a new LLM Architecture Gallery that collects the architecture figures from my recent comparison articles in one place, together with compact fact sheets and links.

ai, deep-learning, llm, Machine Learning

A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026

Sebastian Raschka, PhD / February 25, 2026

A Round Up And Comparison of 10 Open-Weight LLM Releases in Spring 2026

ai, deep-learning, llm, Machine Learning

State of AI 2026 with Sebastian Raschka, Nathan Lambert, and Lex Fridman

Sebastian Raschka, PhD / February 1, 2026

I recently sat down with Lex Fridman and Nathan Lambert for a comprehensive 4.5 h interview to discuss the current state of progress of AI, and what the…

ai, deep-learning, llm, Machine Learning

Categories of Inference-Time Scaling for Improved LLM Reasoning

Sebastian Raschka, PhD / January 24, 2026

Inference scaling has become one of the most effective ways to improve answer quality and accuracy in deployed LLMs. The idea is straightforward. If we are…

ai, deep-learning, llm, Machine Learning

The State Of LLMs 2025: Progress, Problems, and Predictions

Sebastian Raschka, PhD / December 30, 2025

A 2025 review of large language models, from DeepSeek R1 and RLVR to inference-time scaling, benchmarks, architectures, and predictions for 2026.

ai, deep-learning, llm, Machine Learning

LLM Research Papers: The 2025 List (July to December)

Sebastian Raschka, PhD / December 30, 2025

A curated list of LLM research papers from July–December 2025, organized by reasoning models, inference-time scaling, architectures, training efficiency…

ai, deep-learning, llm, Machine Learning

From Random Forests to RLVR: A Short History of ML/AI Hello Worlds

Sebastian Raschka, PhD / December 8, 2025

Two years ago, I posted a list of Hello World examples for machine learning and AI on social. Here, the Hello World means beginner-friendly examples to…

ai, deep-learning, llm, Machine Learning

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

Sebastian Raschka, PhD / December 3, 2025

Similar to DeepSeek V3, the team released their new flagship model over a major US holiday weekend. Given DeepSeek V3.2’s really good performance (on GPT-5…

Author name: Sebastian Raschka, PhD