On the Mirage of Long-Range Dependency, with an Application to Integer Multiplication
arXiv:2603.29069v2 Announce Type: replace-cross
Abstract: Integer multiplication has long been considered a hard problem for neural networks, with the difficulty widely attributed to the O(n) long-range dependency induced by carry chains. We argue tha…