Dharma Teja Vooturi, Dhiraj Kalamkar, Dipankar Das, Bharat Kaul

Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer

Dharma Teja Vooturi, Dhiraj Kalamkar, Dipankar Das, Bharat Kaul / April 2, 2026

arXiv:2604.00785v1 Announce Type: cross
Abstract: Pretraining Large Language Models (LLMs) from scratch requires massive amount of compute. Aurora super computer is an ExaScale machine with 127,488 Intel PVC (Ponte Vechio) GPU tiles. In this work, we …

Author name: Dharma Teja Vooturi, Dhiraj Kalamkar, Dipankar Das, Bharat Kaul

Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer