Hey r/LocalLLaMA !
I am back with a new model, and it's something special today 😃
It's Flare-TTS 28M, my first text to speech (TTS) model trained completely from scratch on a single A6000 GPU for ~24 hours, ~300 epochs and the full LJSpeech dataset!
Link to the HF model: https://huggingface.co/LH-Tech-AI/Flare-TTS-28M
Example result:
https://cdn-uploads.huggingface.co/production/uploads/697f2832c2c5e4daa93cece7/vluuHSnp9Ietk7Uk1-hvG.mpga
It speaks english, but it still sounds a bit robotish 😂
You can use if you want - it's free and open-source 😃
Have fun ❤️
[link] [comments]