llm

data-science, deep-learning, llm, Machine Learning

Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch

Sebastian Raschka, PhD / July 1, 2023

Peak memory consumption is a common bottleneck when training deep learning models such as vision transformers and LLMs. This article provides a series of…

data-science, deep-learning, llm, Machine Learning

Finetuning Falcon LLMs More Efficiently With LoRA and Adapters

Sebastian Raschka, PhD / June 14, 2023

Finetuning allows us to adapt pretrained LLMs in a cost-efficient manner. But which method should we use? This article compares different…

large-language-models, llm, PaLM API, recommendation systems, TensorFlow Recommenders

Augmenting recommendation systems with LLMs

TensorFlow Blog / June 6, 2023

Posted by Wei Wei, Developer Advocate

Large language models (LLMs) are taking the world by storm, thanks to their powerful ability to generate text, translate languages, and answer questions in a coherent and informative way. At Google I/O 202…

data-science, deep-learning, llm, Machine Learning

Accelerating Large Language Models with Mixed-Precision Techniques

Sebastian Raschka, PhD / May 11, 2023

Training and using large language models (LLMs) is expensive due to their large compute requirements and memory footprints. This article will explore how…

data-science, deep-learning, llm, Machine Learning

Parameter-Efficient LLM Finetuning With Low-Rank Adaptation (LoRA)

Sebastian Raschka, PhD / April 26, 2023

Pretrained large language models are often referred to as foundation models for a good reason: they perform well on various tasks, and we can use them as a…