NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Show HN: Three new Kitten TTS models – smallest less than 25MB (github.com)
altruios 1 minutes ago [-]
One of the core features I look for is expressive control.

Either in the form of the api via pitch/speed/volume controls, for more deterministic controls.

Or in expressive tags such as [coughs], [urgently], or [laughs in melodic ascending and descending arpeggiated gibberish babbles].

the 25MB model is amazingly good for being 25MB. How does it handle expressive tags?

ilaksh 5 minutes ago [-]
Thanks for open sourcing this.

Is there any way to do a custom voice as a DIY? Or we need to go through you? If so, would you consider making a pricing page for purchasing a license/alternative voice? All but one of the voices are unusable in a business context.

great_psy 7 minutes ago [-]
Thanks for working on this!

Is there any way to get those running on iPhone ? I would love to have the ability for it to read articles to me like a podcast.

Tacite 4 minutes ago [-]
Is it English only?
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 16:40:52 GMT+0000 (Coordinated Universal Time) with Vercel.