DocAtlas: Multilingual Document Understanding Across 80+ Languages
arXiv:2605.12623v1 Announce Type: cross
Abstract: Multilingual document understanding remains limited for low-resource languages due to scarce training data and model-based annotation pipelines that perpetuate existing biases. We introduce DocAtlas, a…