kokoro mark

Kokoro Text to Speech

Kokoro-82M is a lightweight text-to-speech (TTS) model from hexgrad that converts written text into spoken audio. It’s a good fit when you want simple, fast speech synthesis from text and prefer costs that scale with the amount of text you generate (pricing is based on text length).

Speed

Price

Inputs

Text

Outputs

Audio

Lab

Hexgrad

Kokoro Text to Speech sample outputs

Captured directly from our eval suite. Click any tile to inspect the full render.

Launch your first agent today

Build without code. Share without limits.