LocalLLaMA

Looking for recommendations for a small TTS model that can be fine tuned on a local language dataset.

Looking for recommendations for a small TTS model (<600M params) that can be fine tuned on a local language dataset. I have ~150 hours of very clean single speaker audio with accurate transcripts/pronunciation. Around 45000 text rows I’ve tried: • …