/u/zriyansh - Provide.ai

20M+ Indian legal documents with citation graphs and vector embeddings – potential uses for legal NLP? [D]

/u/zriyansh / April 14, 2026

been working on structuring India's legal corpus for the past 2 years and wanted to share what I've built and hear from people working on legal NLP or low-resource Indian language models. dataset is 20M+ Indian court cases from the Supreme Cour…

Author name: /u/zriyansh

20M+ Indian legal documents with citation graphs and vector embeddings – potential uses for legal NLP? [D]