Machine Learning - Provide.ai

ai, deep-learning, llm, Machine Learning

Recommendations for Getting the Most Out of a Technical Book

Sebastian Raschka, PhD / November 12, 2025

This short article compiles a few notes I previously shared when readers ask how to get the most out of my building large language model from scratch books…

ai, deep-learning, llm, Machine Learning

Beyond Standard LLMs

Sebastian Raschka, PhD / November 4, 2025

After I shared my Big LLM Architecture Comparison a few months ago, which focused on the main transformer-based LLMs, I received a lot of questions with…

ai, deep-learning, llm, Machine Learning

DGX Spark and Mac Mini for Local PyTorch Development

Sebastian Raschka, PhD / October 29, 2025

The DGX Spark for local LLM inferencing and fine-tuning was a pretty popular discussion topic recently. I got to play with one myself, primarily working…

ai, deep-learning, llm, Machine Learning

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

Sebastian Raschka, PhD / October 5, 2025

Multiple-Choice Benchmarks, Verifiers, Leaderboards, and LLM Judges with Code Examples

ai, deep-learning, llm, Machine Learning

Understanding and Implementing Qwen3 From Scratch

Sebastian Raschka, PhD / September 6, 2025

Previously, I compared the most notable open-weight architectures of 2025 in The Big LLM Architecture Comparison. Then, I zoomed in and discussed the…

ai, deep-learning, llm, Machine Learning

From GPT-2 to gpt-oss: Analyzing the Architectural Advances

Sebastian Raschka, PhD / August 9, 2025

OpenAI just released their new open-weight LLMs this week: gpt-oss-120b and gpt-oss-20b, their first open-weight models since GPT-2 in 2019. And yes, thanks…

ai, deep-learning, llm, Machine Learning

The Big LLM Architecture Comparison

Sebastian Raschka, PhD / July 19, 2025

It has been seven years since the original GPT architecture was developed. At first glance, looking back at GPT-2 (2019) and forward to DeepSeek-V3 and…

ai, deep-learning, llm, Machine Learning

LLM Research Papers: The 2025 List (January to June)

Sebastian Raschka, PhD / July 1, 2025

The latest in LLM research with a hand-curated, topic-organized list of over 200 research papers from 2025.

ai, deep-learning, llm, Machine Learning

Understanding and Coding the KV Cache in LLMs from Scratch

Sebastian Raschka, PhD / June 17, 2025

KV caches are one of the most critical techniques for efficient inference in LLMs in production. KV caches are an important component for compute-efficient…

ai, deep-learning, llm, Machine Learning

Coding LLMs from the Ground Up: A Complete Course

Sebastian Raschka, PhD / May 10, 2025

Why build an LLM from scratch? It’s probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a lot…