llm - Provide.ai

data-science, deep-learning, llm, Machine Learning

Tips for LLM Pretraining and Evaluating Reward Models

Sebastian Raschka, PhD / March 31, 2024

It’s another month in AI research, and it’s hard to pick favorites. This month, I am going over a paper that discusses strategies for the continued…

llm, Perspectives

Car-GPT: Could LLMs finally make self-driving cars happen?

Jérémy Cohen / March 8, 2024

Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?

Interpretability, llm, nlp

Do text embeddings perfectly encode text?

Jack Morris / March 5, 2024

‘Vec2text’ can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.

data-science, deep-learning, llm, Machine Learning

Research Papers in February 2024

Sebastian Raschka, PhD / March 3, 2024

Once again, this has been an exciting month in AI research. This month, I’m covering two new openly available LLMs, insights into small finetuned LLMs, and…

data-science, deep-learning, llm, Machine Learning

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

Sebastian Raschka, PhD / February 18, 2024

Low-rank adaptation (LoRA) is a machine learning technique that modifies a pretrained model (for example, an LLM or vision transformer) to better suit a…

data-science, deep-learning, llm, Machine Learning

Optimizing LLMs From a Dataset Perspective

Sebastian Raschka, PhD / September 15, 2023

This article focuses on improving the modeling performance of LLMs by finetuning them using carefully curated datasets. Specifically, this article…

data-science, deep-learning, llm, Machine Learning

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

Sebastian Raschka, PhD / August 10, 2023

Large language models (LLMs) offer one of the most interesting opportunities for developing more efficient training methods. A few weeks ago, the NeurIPS…

data-science, deep-learning, llm, Machine Learning

Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch

Sebastian Raschka, PhD / July 1, 2023

Peak memory consumption is a common bottleneck when training deep learning models such as vision transformers and LLMs. This article provides a series of…

data-science, deep-learning, llm, Machine Learning

Finetuning Falcon LLMs More Efficiently With LoRA and Adapters

Sebastian Raschka, PhD / June 14, 2023

Finetuning allows us to adapt pretrained LLMs in a cost-efficient manner. But which method should we use? This article compares different…

large-language-models, llm, PaLM API, recommendation systems, TensorFlow Recommenders

Augmenting recommendation systems with LLMs

TensorFlow Blog / June 6, 2023

Posted by Wei Wei, Developer Advocate

Large language models (LLMs) are taking the world by storm, thanks to their powerful ability to generate text, translate languages, and answer questions in a coherent and informative way. At Google I/O 202…