llm - Provide.ai

ai, deep-learning, llm, Machine Learning

LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments?

Sebastian Raschka, PhD / June 2, 2024

This article covers three new papers related to instruction finetuning and parameter-efficient finetuning with LoRA in large language models (LLMs). I work…

ai, deep-learning, llm, Machine Learning

Developing an LLM: Building, Training, Finetuning

Sebastian Raschka, PhD / June 2, 2024

This is an overview of the LLM development process. This one-hour talk focuses on the essential three stages of developing an LLM: coding the architecture…

ai, deep-learning, llm, Machine Learning

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

Sebastian Raschka, PhD / May 12, 2024

What a month! We had four major open LLM releases: Mixtral, Meta AI’s Llama 3, Microsoft’s Phi-3, and Apple’s OpenELM. In my new article, I review and…

llm, Overviews

Financial Market Applications of LLMs

Richard Dewey / April 20, 2024

The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered by Large Language Models (LLMs) that excel at modeling sequences of tokens that represent words or parts of words [2]. Amazingly, structural

ai, deep-learning, llm, Machine Learning

Using and Finetuning Pretrained Transformers

Sebastian Raschka, PhD / April 20, 2024

What are the different ways to use and finetune pretrained large language models (LLMs)? The three most common ways to use and finetune pretrained LLMs…

data-science, deep-learning, llm, Machine Learning

Tips for LLM Pretraining and Evaluating Reward Models

Sebastian Raschka, PhD / March 31, 2024

It’s another month in AI research, and it’s hard to pick favorites. This month, I am going over a paper that discusses strategies for the continued…

llm, Perspectives

Car-GPT: Could LLMs finally make self-driving cars happen?

Jérémy Cohen / March 8, 2024

Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?

Interpretability, llm, nlp

Do text embeddings perfectly encode text?

Jack Morris / March 5, 2024

‘Vec2text’ can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.

data-science, deep-learning, llm, Machine Learning

Research Papers in February 2024

Sebastian Raschka, PhD / March 3, 2024

Once again, this has been an exciting month in AI research. This month, I’m covering two new openly available LLMs, insights into small finetuned LLMs, and…

data-science, deep-learning, llm, Machine Learning

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

Sebastian Raschka, PhD / February 18, 2024

Low-rank adaptation (LoRA) is a machine learning technique that modifies a pretrained model (for example, an LLM or vision transformer) to better suit a…