358 shaares
3 liens privés
3 liens privés
3 résultats
taggé
tts
Open source voice cloning studio with support for multiple TTS engines. Clone any voice, generate natural speech, and compose multi-voice projects — all running locally.
A fast and local neural text-to-speech engine that embeds espeak-ng for phonemization.
Text-to-speech
Kyutai text-to-speech started as an internal tool we used during the development of Moshi. As part of our commitment to open science, we've since open-sourced two text-to-speech models:
Kyutai Pocket TTS, a tiny model with voice cloning, fast enough to run on CPU.
Kyutai TTS 1.6B, a streaming model used in Unmute, great for servers.