GeneZip: Region-Aware Compression for Long Context DNA Modeling
arXiv:2602.17739v3 Announce Type: replace-cross
Abstract: Long-context DNA models are limited by token-mixing cost and by how compression allocates representational budget across the genome. Existing approaches operate close to base-pair resolution, a…