Artificial Intelligence, deep-learning, large-language-models, Machine Learning, mixture-of-experts-ai

Mixture of Experts: From Intuition to Training Reality

This blog is based on what I learnt from the Stanford CS336N Lecture 4 on Mixture of Experts, along with the key papers explained in the…Continue reading on Medium ยป