Multi-domain Multi-modal Document Classification Benchmark with a Multi-level Taxonomy
arXiv:2605.10550v1 Announce Type: new
Abstract: Document classification forms the backbone of modern enterprise content management, yet existing benchmarks remain trapped in oversimplified paradigms — single domain settings with flat label structures…