Product Thumbnail

KugelAudio

Real-time text-to-speech model you can self-host

API
Developer Tools
Artificial Intelligence

Hunted byGarry TanGarry Tan

Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

Loading

KugelAudio

Real-time text-to-speech model you can self-host

Most natural real-time TTS with voice cloning and sub-60ms latency, on-prem or via API. Grammar-aware normalization reads phone numbers, IBANs, addresses, and medications naturally across 25+ languages, with word-level timestamps and IPA support. Adapters for LiveKit, Pipecat, and Vapi. Built by 4 in Berlin.

Top comment

Hi PH 👋 Kajo from KugelAudio here. We're 4 people in Berlin, currently in SF for YC. Building for the future, which we believe will be conversational. Today we're shipping a real-time TTS model with voice cloning. If you only have a minute, here's what I'd want you to know. You can clone a voice from 30 to 60 seconds of audio. Drop in a short sample and you get a working voice immediately. We optimized for voice agents with latencies below 60ms (excl. network), input streaming and output streaming. We offer on premise support, run the model in your own cluster instead of calling our API. Useful when data needs to stay on your network. Adapters for LiveKit, Pipecat, and Vapi. SDKs in Python, JS, and Java. Free tier so you don't have to talk to us before trying it. Alex (our founding engineer) is in the thread today for the hard questions: model architecture, where the latency came from, what we broke before this version worked. Ask anything!

About KugelAudio on Product Hunt

Real-time text-to-speech model you can self-host

KugelAudio launched on Product Hunt on May 28th, 2026 and earned 100 upvotes and 10 comments, placing #16 on the daily leaderboard. Most natural real-time TTS with voice cloning and sub-60ms latency, on-prem or via API. Grammar-aware normalization reads phone numbers, IBANs, addresses, and medications naturally across 25+ languages, with word-level timestamps and IPA support. Adapters for LiveKit, Pipecat, and Vapi. Built by 4 in Berlin.

On the analytics side, KugelAudio competes within API, Developer Tools and Artificial Intelligence — topics that collectively have 1.1M followers on Product Hunt. The dashboard above tracks how KugelAudio performed against the three products that launched closest to it on the same day.

Who hunted KugelAudio?

KugelAudio was hunted by Garry Tan. A “hunter” on Product Hunt is the community member who submits a product to the platform — uploading the images, the link, and tagging the makers behind it. Hunters typically write the first comment explaining why a product is worth attention, and their followers are notified the moment they post. Around 79% of featured launches on Product Hunt are self-hunted by their makers, but a well-known hunter still acts as a signal of quality to the rest of the community. See the full all-time top hunters leaderboard to discover who is shaping the Product Hunt ecosystem.

For a complete overview of KugelAudio including community comment highlights and product details, visit the product overview.