VOYAGER: A Training Free Approach for Generating Diverse Datasets using LLMs
arXiv:2512.12072v2 Announce Type: replace
Abstract: Large language models (LLMs) are increasingly being used to generate synthetic datasets for the evaluation and training of downstream models. However, prior work has noted that such generated data la…