BhashaSutra: A Task-Centric Unified Survey of Indian NLP Datasets, Corpora, and Resources
arXiv:2604.18423v1 Announce Type: new
Abstract: India’s linguistic landscape, spanning 22 scheduled languages and hundreds of marginalized dialects, has driven rapid growth in NLP datasets, benchmarks, and pretrained models. However, no dedicated surv…