347 shaares
3 liens privés
3 liens privés
2 résultats
taggé
tts
A fast and local neural text-to-speech engine that embeds espeak-ng for phonemization.
Text-to-speech
Kyutai text-to-speech started as an internal tool we used during the development of Moshi. As part of our commitment to open science, we've since open-sourced two text-to-speech models:
Kyutai Pocket TTS, a tiny model with voice cloning, fast enough to run on CPU.
Kyutai TTS 1.6B, a streaming model used in Unmute, great for servers.