ENSEMBITS: an alphabet of protein conformational ensembles
arXiv:2605.13789v2 Announce Type: replace-cross
Abstract: Protein structure tokenizers (PSTs) are workhorses in protein language modeling, function prediction, and evolutionary analysis. However, existing PSTs only capture local geometry of static str…