llm - Provide.ai

ai, deep-learning, llm, Machine Learning

Understanding Multimodal LLMs

Sebastian Raschka, PhD / November 3, 2024

There has been a lot of new research on the multimodal LLM front, including the latest Llama 3.2 vision models, which employ diverse architectural…

ai, deep-learning, llm, Machine Learning

Building A GPT-Style LLM Classifier From Scratch

Sebastian Raschka, PhD / September 21, 2024

This article shows you how to transform pretrained large language models (LLMs) into strong text classifiers. But why focus on classification? First…

ai, deep-learning, llm, Machine Learning

Building LLMs from the Ground Up: A 3-hour Coding Workshop

Sebastian Raschka, PhD / September 1, 2024

This tutorial is aimed at coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from…

ai, deep-learning, llm, Machine Learning

New LLM Pre-training and Post-training Paradigms

Sebastian Raschka, PhD / August 17, 2024

There are hundreds of LLM papers each month proposing new techniques and approaches. However, one of the best ways to see what actually works well in…

ai, deep-learning, llm, Machine Learning

Instruction Pretraining LLMs

Sebastian Raschka, PhD / July 20, 2024

This article covers a new, cost-effective method for generating data for instruction finetuning LLMs; instruction finetuning from scratch; pretraining LLMs…

ai, deep-learning, llm, Machine Learning

LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments?

Sebastian Raschka, PhD / June 2, 2024

This article covers three new papers related to instruction finetuning and parameter-efficient finetuning with LoRA in large language models (LLMs). I work…

ai, deep-learning, llm, Machine Learning

Developing an LLM: Building, Training, Finetuning

Sebastian Raschka, PhD / June 2, 2024

This is an overview of the LLM development process. This one-hour talk focuses on the essential three stages of developing an LLM: coding the architecture…

ai, deep-learning, llm, Machine Learning

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

Sebastian Raschka, PhD / May 12, 2024

What a month! We had four major open LLM releases: Mixtral, Meta AI’s Llama 3, Microsoft’s Phi-3, and Apple’s OpenELM. In my new article, I review and…

llm, Overviews

Financial Market Applications of LLMs

Richard Dewey / April 20, 2024

The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered by Large Language Models (LLMs) that excel at modeling sequences of tokens that represent words or parts of words [2]. Amazingly, structural

ai, deep-learning, llm, Machine Learning

Using and Finetuning Pretrained Transformers

Sebastian Raschka, PhD / April 20, 2024

What are the different ways to use and finetune pretrained large language models (LLMs)? The three most common ways to use and finetune pretrained LLMs…