Machine Learning - Provide.ai

ai, deep-learning, llm, Machine Learning

Understanding Reasoning LLMs

Sebastian Raschka, PhD / February 5, 2025

In this article, I will describe the four main approaches to building reasoning models, or how we can enhance LLMs with reasoning capabilities. I hope this…

ai, deep-learning, llm, Machine Learning

Noteworthy LLM Research Papers of 2024

Sebastian Raschka, PhD / January 23, 2025

This article covers 12 influential AI research papers of 2024, ranging from mixture-of-experts models to new LLM scaling laws for precision.

ai, deep-learning, llm, Machine Learning

Implementing A Byte Pair Encoding (BPE) Tokenizer From Scratch

Sebastian Raschka, PhD / January 17, 2025

This is a standalone notebook implementing the popular byte pair encoding (BPE) tokenization algorithm, which is used in models like GPT-2 to GPT-4, Llama…

ai, deep-learning, llm, Machine Learning

LLM Research Papers: The 2024 List

Sebastian Raschka, PhD / December 29, 2024

I want to share my running bookmark list of many fascinating (mostly LLM-related) papers I stumbled upon in 2024. It’s just a list, but maybe it will come…

ai, community, computer-vision, Google AI, LiteRT, Machine Learning, On-device AI, Person Detection, Research, TensorFlow Lite

Introducing Wake Vision: A High-Quality, Large-Scale Dataset for TinyML Computer Vision Applications

TensorFlow Blog / December 5, 2024

Posted by Colby Banbury, Emil Njor, Andrea Mattia Garavagno, Vijay Janapa Reddi – Harvard University

TinyML is an exciting frontier in machine learning, enabling models to run on extremely low-power devices such as microcontrollers and edge dev…

ai, community, data-engineering, Deployment, Machine Learning, Model Development, Monitoring & Maintenance, Optimization, SocratiQ, Systems Engineering

MLSysBook.AI: Principles and Practices of Machine Learning Systems Engineering

TensorFlow Blog / November 19, 2024

Posted by Jason Jabbour, Kai Kleinbard and Vijay Janapa Reddi (Harvard University)

Everyone wants to do the modeling work, but no one wants to do the engineering.

If ML developers are like astronauts exploring new frontiers, ML systems enginee…

ai, deep-learning, llm, Machine Learning

Understanding Multimodal LLMs

Sebastian Raschka, PhD / November 3, 2024

There has been a lot of new research on the multimodal LLM front, including the latest Llama 3.2 vision models, which employ diverse architectural…

ai, deep-learning, llm, Machine Learning

Building A GPT-Style LLM Classifier From Scratch

Sebastian Raschka, PhD / September 21, 2024

This article shows you how to transform pretrained large language models (LLMs) into strong text classifiers. But why focus on classification? First…

ai, deep-learning, llm, Machine Learning

Building LLMs from the Ground Up: A 3-hour Coding Workshop

Sebastian Raschka, PhD / September 1, 2024

This tutorial is aimed at coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from…

ai, deep-learning, llm, Machine Learning

New LLM Pre-training and Post-training Paradigms

Sebastian Raschka, PhD / August 17, 2024

There are hundreds of LLM papers each month proposing new techniques and approaches. However, one of the best ways to see what actually works well in…