cs.LG

Super Apriel: One Checkpoint, Many Speeds

arXiv:2604.19877v1 Announce Type: new
Abstract: We release Super Apriel, a 15B-parameter supernet in which every decoder layer provides four trained mixer choices — Full Attention (FA), Sliding Window Attention (SWA), Kimi Delta Attention (KDA), and …

cs.AI

Stabilising Generative Models of Attitude Change

arXiv:2604.19791v1 Announce Type: new
Abstract: Attitude change – the process by which individuals revise their evaluative stances – has been explained by a set of influential but competing verbal theories. These accounts often function as mechanism s…

cs.AI, cs.CY

Fairness Testing of Large Language Models in Role-Playing

arXiv:2411.00585v2 Announce Type: replace-cross
Abstract: Large Language Models (LLMs) have become foundational in modern language-driven software applications, profoundly influencing daily life. A critical technique in leveraging their potential is r…

Scroll to Top