NRGPT: An Energy-based Alternative for GPT
arXiv:2512.16762v3 Announce Type: replace
Abstract: Generative Pre-trained Transformer (GPT) architectures are the most popular design for language modeling. Energy-based modeling is a different paradigm that views inference as a dynamical process ope…