One of the core features I look for is expressive control.
Either in the form of the api via pitch/speed/volume controls, for more deterministic controls.
Or in expressive tags such as [coughs], [urgently], or [laughs in melodic ascending and descending arpeggiated gibberish babbles].
the 25MB model is amazingly good for being 25MB. How does it handle expressive tags?
ilaksh 5 minutes ago [-]
Thanks for open sourcing this.
Is there any way to do a custom voice as a DIY? Or we need to go through you? If so, would you consider making a pricing page for purchasing a license/alternative voice? All but one of the voices are unusable in a business context.
great_psy 7 minutes ago [-]
Thanks for working on this!
Is there any way to get those running on iPhone ? I would love to have the ability for it to read articles to me like a podcast.
Tacite 4 minutes ago [-]
Is it English only?
Rendered at 16:40:52 GMT+0000 (Coordinated Universal Time) with Vercel.
Either in the form of the api via pitch/speed/volume controls, for more deterministic controls.
Or in expressive tags such as [coughs], [urgently], or [laughs in melodic ascending and descending arpeggiated gibberish babbles].
the 25MB model is amazingly good for being 25MB. How does it handle expressive tags?
Is there any way to do a custom voice as a DIY? Or we need to go through you? If so, would you consider making a pricing page for purchasing a license/alternative voice? All but one of the voices are unusable in a business context.
Is there any way to get those running on iPhone ? I would love to have the ability for it to read articles to me like a podcast.