cs.AI, cs.LG, stat.ML

Sink vs. diagonal patterns as mechanisms for attention switch and oversmoothing prevention

arXiv:2605.08453v1 Announce Type: cross
Abstract: This paper studies the role of sinks and diagonal patterns as attention switch and anti-oversmoothing mechanisms. We analyze geometric conditions under which sinks can be represented, showing a necessa…