deep-learning - Provide.ai

ai, deep-learning, llm, Machine Learning

LLM Research Papers: The 2025 List (January to June)

Sebastian Raschka, PhD / July 1, 2025

The latest in LLM research with a hand-curated, topic-organized list of over 200 research papers from 2025.

ai, deep-learning, llm, Machine Learning

Understanding and Coding the KV Cache in LLMs from Scratch

Sebastian Raschka, PhD / June 17, 2025

KV caches are one of the most critical techniques for efficient inference in LLMs in production. KV caches are an important component for compute-efficient…

ai, deep-learning, llm, Machine Learning

Coding LLMs from the Ground Up: A Complete Course

Sebastian Raschka, PhD / May 10, 2025

Why build an LLM from scratch? It’s probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a lot…

ai, deep-learning, llm, Machine Learning, Reasoning Models

The State of Reinforcement Learning for LLM Reasoning

Sebastian Raschka, PhD / April 19, 2025

A lot has happened this month, especially with the releases of new flagship models like GPT-4.5 and Llama 4. But you might have noticed that reactions to…

ai, deep-learning, llm, Machine Learning

First Look at Reasoning From Scratch: Chapter 1

Sebastian Raschka, PhD / March 29, 2025

As you know, I’ve been writing a lot lately about the latest research on reasoning in LLMs. Before my next research-focused blog post, I wanted to offer…

ai, deep-learning, llm, Machine Learning

Inference-Time Compute Scaling Methods to Improve Reasoning Models

Sebastian Raschka, PhD / March 8, 2025

This article explores recent research advancements in reasoning-optimized LLMs, with a particular focus on inference-time compute scaling that have emerged…

ai, deep-learning, llm, Machine Learning

Understanding Reasoning LLMs

Sebastian Raschka, PhD / February 5, 2025

In this article, I will describe the four main approaches to building reasoning models, or how we can enhance LLMs with reasoning capabilities. I hope this…

ai, deep-learning, llm, Machine Learning

Noteworthy LLM Research Papers of 2024

Sebastian Raschka, PhD / January 23, 2025

This article covers 12 influential AI research papers of 2024, ranging from mixture-of-experts models to new LLM scaling laws for precision.

ai, deep-learning, llm, Machine Learning

Implementing A Byte Pair Encoding (BPE) Tokenizer From Scratch

Sebastian Raschka, PhD / January 17, 2025

This is a standalone notebook implementing the popular byte pair encoding (BPE) tokenization algorithm, which is used in models like GPT-2 to GPT-4, Llama…

ai, deep-learning, llm, Machine Learning

LLM Research Papers: The 2024 List

Sebastian Raschka, PhD / December 29, 2024

I want to share my running bookmark list of many fascinating (mostly LLM-related) papers I stumbled upon in 2024. It’s just a list, but maybe it will come…