This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/pheonis2 on 2024-10-13 03:08:41+00:00.


A new state-of-the-art open-source model, F5-TTS, was released just a few days ago! This cutting-edge model, boasting 335M parameters, is designed for English and Chinese speech synthesis. It was trained on an extensive dataset of 95,000 hours, utilizing 8 A100 GPUs over the course of more than a week.

HF Space:

Github:

Demo: https://swivid.github.io/F5-TTS/

Weights: https://huggingface.co/SWivid/F5-TTS