Show HN: I made a tool using AI to convert articles to audioarticle.audio
nceasy 8 days ago [-]
Hey nice one! I'm trying to innovate in the STT and TTS space as well (completely different field), feel free to contact me in my profile email if you want to exchange some knowledge :). I hear some bugs in portuguese, but I'm guessing the trained model has some issues. Congrats and Good luck with your tool! Ps: give users some preview of the speakers voice so we can test it before convert audio, it can save you some resources.
Paul_Grsl 8 days ago [-]
Hello, we can meet on twitter: grsl_en :) thanks for your comment, it's still a v1 in beta but I will continue to improve it. And yes as you can see in the ui there are buttons for preview but not yet functional :)
trabant00 9 days ago [-]
It does not handle apostrophes good. It says "we L L" instead of we'll or "Andrea S" instead of Andrea's. It also has some problem with pacing around dashes, three points and quotes. It speeds up a lot and connects the words before and after those. Overall would not use it in this state to turn articles into podcasts or something like that.
capableweb 9 days ago [-]
What about something that can do the opposite? Like converting video and/or audio to articles?

Most of the content I consume fits best (for me) in article format, so I can read it at my own speed, but some really good information can only (annoyingly) be found in videos or podcasts.

roryisok 8 days ago [-]
I've been using whisper from openAI to transcribe stuff this month, and its incredibly accurate. would be a good base for something like this
Paul_Grsl 9 days ago [-]
This is a good idea, I just started the project :) So this is a feature that can be added in the future.

Here is the roadmap for the future: - Audio sharing - Convert Text To Audio - Convert PDF To Audio - Convert Photo To Audio - Chrome extension - to convert while browsing - Mobile App - to manage audios everywhere, simply

and adding the possibility to do the opposite is also a great idea!

cblavier 9 days ago [-]
Working on this very topic right now! But specific to Podcast audio content

https://readable.fm/

veb 8 days ago [-]
oh mate I so agree and as a deaf person this would be a godsend. way too much shit is in videos or podcasts. please just let me read...
kretaceous 9 days ago [-]
I got super psyched to try this. Always wanted a good TTS extension or app.

However I cannot get it to work. I've logged in, input an article but nothing happens after I click "Convert article to audio" or preview.

Linux Mint/Firefox 107/Chrome

Edit:

I checked devtools and it shows a 500 error with the message "Something is broken. Please let us know what you did"

The link I was trying to convert was:

https://www.daemonology.net/blog/2020-09-20-On-the-use-of-a-...

westcort 9 days ago [-]
Paul_Grsl 8 days ago [-]
Except that the quality of the audio rendering is... not crazy.
westcort 8 days ago [-]
It defaults to whatever your browser defaults to. It is easy to change that setting in Chrome to a more natural voice: https://support.google.com/chromebook/answer/11221616?hl=en
Paul_Grsl 8 days ago [-]
I'm looking into how this can happen. Feel free to sign up to come back later and convert your article! I can't wait for you to use Article.Audio
akuji1993 9 days ago [-]
Hey, your tool is not working great with german articles. It can't pronounce Umlauts and also has trouble with some pretty standard simple words.

I used this article as a test: https://www.saarbruecker-zeitung.de/saarland/landespolitik/s...

Paul_Grsl 9 days ago [-]
I will look at why the charactere that Umlauts are not well pronounced. Do you have examples of other words? I will investigate :)

Danke für dein Feedback, das hilft mir wirklich weiter .

cauners 9 days ago [-]
On the same topic, in Latvian all the characters with diacritics are stripped out completely. For example, "iedzīšana" is pronounced as "iedzana", making the audio pretty funny, though hard to understand.
Paul_Grsl 8 days ago [-]
I'm working on solving these pronunciation problems for the special characters if I unlock one everything else will follow. Don't hesitate to sign up to be kept informed!
nkoren 9 days ago [-]
Huh, wow, really cool! For the most part it's excellent (in English), however I notice that quotation marks (both single and double) are handled strangely. The leading quotation mark is pronounced as a short A, and the trailing as a long A. This can be rather confusing! But otherwise, I'm incredibly impressed by the results.
Paul_Grsl 8 days ago [-]
Thank you very much for your comment!! It's still a beta version and I have to improve some pronunciations! Especially for the special characters... don't hesitate to signup to be kept informed :)
wazoox 8 days ago [-]
It's certainly that all UTF-8 characters (like UTF-8 fancy quotes and double-quotes) aren't properly interpreted, everything seems to go through as ISO8859-15.
malshe 8 days ago [-]
I also noticed the same issue with quotation marks. But other than that this is a really nice application
schreon 9 days ago [-]
Nice! The german version breaks Umlauts though. Apparently the preprocessing converts e.g. "ä" to "ae", "ö" to "oe" and so on and the text2voice model subsequently pronounces them as e.g. "a-e", instead of what "ä" would actually sound.
Paul_Grsl 9 days ago [-]
Indeed we have a problem with special characters in German (and also Polish… :( ) I am investigating why they are not pronounced correctly.

I studied German at school but not enough.

Vielen Dank für Ihren Kommentar, er hilft mir, das Tool zu verbessern :)

hutrdvnj 9 days ago [-]
Could you make a paid TTS engine App on Android and iOS?
Paul_Grsl 8 days ago [-]
Yes it's in the roadmap. and also a chrome extension :) would you be interested ?
hutrdvnj 8 days ago [-]
Yes, I currently use the Read Aloud Android App and it allows me to use any installed TTS. The Google free network TTS voices are quite okay, but I know that there are better premium voices unfortunately I didn't found any high quality human like TTS in the play store as of yet.
tiffanyh 8 days ago [-]
Dumb question: how is “AI” used for text-to-speech?
Paul_Grsl 8 days ago [-]
Text-to-Speech (TTS) technology uses artificial intelligence (AI) translate written information in a given language into a sound, voice or speech with a human accent.to learn the AI had to learn with many parameters, so that the pronunciation improves from version to version :)
aronatom 8 days ago [-]
Tested icelandic! sound really good except for ignoring all special icelandic character such as ð, ó, á ,ö and so on
Paul_Grsl 8 days ago [-]
Thanks for your comment! I'm actually working on this point to fix it as FAST as possible! sorry for that...
roland_szabo 8 days ago [-]
I had the same issue with hungarian language.
Paul_Grsl 8 days ago [-]
I'm working on solving these pronunciation problems for the special characters if I unlock one everything else will follow. Don't hesitate to sign up to be kept informed!
aronatom 8 days ago [-]
I will! Great stuff
judex 9 days ago [-]
Great work! I'm testing if I could use it in my project. It would be good to be able to just paste some text.
Paul_Grsl 8 days ago [-]
Thank you so much! What's your project?

Here is the roadmap for the future: · Audio sharing ----> FOR YOU · Convert Text To Audio · Convert PDF To Audio · Convert Photo To Audio · Chrome extension - to convert while browsing · Mobile App - to manage audios everywhere, simply

jerpint 9 days ago [-]
I had this idea a few months ago, obviously I never got around to executing it. I’m glad someone else did
Paul_Grsl 9 days ago [-]
That famous moment when you have the idea of a new side project, you buy the domain name and then... you have a new idea (loop).

This time I developed it, I'm happy with this 1st version (which must be improved).

Anyway, thanks for your comment

And you, why didn't you develop it in the end?

jerpint 8 days ago [-]
I can’t remember the list of thousands of excuses I came up with :)
Paul_Grsl 8 days ago [-]
Haha. destroy this list and go for it :D
5amdotis 8 days ago [-]
I would like to have a tool that does it the other way around. Audio to a somewhat cohesive article.
wazoox 8 days ago [-]
I can only get "200 internal server error" entries in the dev console :)
Paul_Grsl 8 days ago [-]
Wow... :( Still now ?
wazoox 8 days ago [-]
It works on Chromium though. However it has trouble with UTF-8 obviously: it interprets "é" as "é" i.e. ISO8859-15.

I see that for "French" it selects "Alain" as a voice in Chromium, but "Joe" in Firefox. However in Firefox I must first change the language, it then select another voice, and switching back to French, it switches properly to Alain then it works. It probably doesn't initialise properly the voice selector when loading the page in Firefox (if I don't change the language first, I can't select any voice, the selector isn't working).

Same problem with accents as Chromium though :)

herinskd 9 days ago [-]
It works amazingly fast, are you using GPUs and QNNX to reach such performance?
Paul_Grsl 8 days ago [-]
Thanks for the great feedback. I think we can still improve the result, it's only a v1 in beta. I use nothing very advanced for rendering, a stack and tools rather simple. :)
zachthewf 8 days ago [-]
I believe it's using Microsoft TTS voices (at least for some of them)
Paul_Grsl 8 days ago [-]
That's right we use Azure TTS! :)
phas0ruk 8 days ago [-]
Nice, what’s the tech stack?
Paul_Grsl 8 days ago [-]
Simple: PHP (Symfony), JS (Vanilla), HTML, TailwindCSS :)
loriverkutya 9 days ago [-]
Not usable with Hungarian.
Paul_Grsl 8 days ago [-]
I'm looking into it, a Polish user is having the same problems. Don't hesitate to sign up so I can let you know when it's fixed :)

Köszönöm a hozzászólásodat, ez segít nekem! :)

fieryskiff11 9 days ago [-]