KugelAudio - Real-time text-to-speech model you can self-host
KugelAudio
Real-time text-to-speech model you can self-host
Screenshots

Hunter's comment
Most natural real-time TTS with voice cloning and sub-60ms latency, on-prem or via API. Grammar-aware normalization reads phone numbers, IBANs, addresses, and medications naturally across 25+ languages, with word-level timestamps and IPA support. Adapters for LiveKit, Pipecat, and Vapi. Built by 4 in Berlin.
Link

This is posted on Steemhunt - A place where you can dig products and earn STEEM.
View on Steemhunt.com
Congratulations!
We have upvoted your post for your contribution within our community.
Thanks again and look forward to seeing your next hunt!
Want to chat? Join us on: