Very impressive, especially that it works with other languages than English, too!
It not only correctly split apart and found the etymological roots of the German word "Rindfleischettiketierungsüberwachungsübertragungsgesetz" (law governing the transfer of the responsibility of controlling the labeling of beef) but also Bavarian / Austrian dialect like "Oachkatzlschwoaf" (squirrel tail) or Kukeruz (corn). It also works for French and Japanese. Wow.
I think that by "can guess fake words" in the title OP means that the tool can come up with a plausible guess for the etymology, even for fake words. Unfortunately, the more common reading of that phrase is that it can tell fake words from real ones
Cthulhu_ 1 minutes ago [-]
[delayed]
dizhn 32 minutes ago [-]
Which it should since it knows all the words.
andrelaszlo 2 hours ago [-]
Agree. It does a good job at coming up with a plausible meaning of novel words like "multiarborality" → "the state of relating to many trees" but it doesn't indicate that this is a "fake" word even though I just made it up.
TeMPOraL 2 hours ago [-]
Doesn't look fake to me. English is not a closed-world language, as far as I know.
Between things like "verbing" and "nouning", and the cultural acceptance of doing them in casual speech, I'd say English is a great language because you get to "invent" new words on the fly, and your interlocutors know what you mean.
In this sense, even if no one before ever said or wrote "multiarborality", it's pretty clear what it means (as long as you don't misread it), and IMHO it's perfectly fine to derive its etymology by deconstructing it back to "common" words and pulling etymology on those, recursively.
latexr 15 minutes ago [-]
> I'd say English is a great language because you get to "invent" new words on the fly, and your interlocutors know what you mean.
I bet you could do that with most languages. I don’t see why English would be especially great at it.
andrelaszlo 1 hours ago [-]
I agree with both comments, I just wanted to find a word that an AI wouldn't have heard before, not a word that is "fake".
"Flonkers: A made-up word, likely humorous". Aren't all words made up? Edit: This one, unlike my previous example, is actually in use - flonkers: an animal that looks fat but is actually just fluffy.
nhinck3 35 minutes ago [-]
turboencabulation, hydrocoptic. I bet you could look up a bunch of sci-fi for other examples of completely made up words
stavros 1 hours ago [-]
If you had used that word in a sentence, eg "what I like about this park is its multiarborality", I would have immediately understood what you meant.
sandworm101 2 hours ago [-]
>> even though I just made it up
One person inventing a word that they have never heard before doesn't negate the possibility of that word being in common use somewhere.
"An expansion of the 2020 Theodore Sturgeon Memorial Award winning story. Arboreality is a finalist for the Philip K. Dick Award and the winner of the 2023 Ursula K. Le Guin Prize for Fiction."
msds 4 hours ago [-]
It doesn't really detect stringing together latinate nonsense, so all of my coinages got perfectly coherent definitions. On the flip side, I guess if you're erudite and willing to butcher other languages for sport, nothing has ever stopped you from making up perfectly quasiaryiumaryesque words.
"to reverse the process of reversing the removal of water"
There is clearly no such word, but a human would probably infer such a meaning.
I'm amazed at how unamazed we all seem to have become at such feats. We need more deunnamazing
Oarch 36 minutes ago [-]
I'd often casually wondered where the word scaffolding comes from. Its explanation seems sort of plausible.
rvense 3 hours ago [-]
Fyllipig is derived from "fyllan" and "pigge" and means "descriptive of something that is both full/abundant and pig-like."
Terryjambled means "mixed up in a confused or disorderly state, and covered with or resembling terry cloth."
Refugglemander means to "to manipulate electoral district boundaries in a way that impacts refugees."
I'm sorry, OP. This just isn't very good.
InsideOutSanta 1 hours ago [-]
It's funny; I'm looking at your examples and coming to the opposite conclusion. I feel like it is very good because it provides explanations for unusual or novel words that are similar to what a human might conclude.
stavros 1 hours ago [-]
What etymologies would you have expected instead?
kleiba 2 hours ago [-]
It actually knows about "Lisa the Iconoclast". Cool.
Oarch 38 minutes ago [-]
It's a perfectly cromulent tool
thih9 3 hours ago [-]
When I asked it to deconstruct "Babbage"[1] I got "Derived from Babba's place", Some others:
- phonenose: The ability to detect sounds or voices through the nose
- legpc: Acronym for Laptop Easy Personal Computer
- gitls: A command in Git to list files
- housefreezing: The action of hardening a house with cold
- uncleftish beholding[2]: The act of viewing something that is whole
In any case it's fun to play with and the UI is nice too.
Note, the title looks editorialized, it's currently "A AI etymology deconstructor – can guess fake words", but the website says just "deconstructor.".
> You've deconstructed 5 words! Enter your email to continue your language journey and get notified about new features.
Yeah, no. I tried the same word five times to test for inconsistencies. I don’t appreciate you weren’t straightforward with the limits and am not going to give you my email to “continue my language journey”. The fact you’re already using euphemisms like that makes me strongly distrust what you’d do with an email address.
Fission 3 hours ago [-]
This is fun, but what I think would be even more interesting and potentially actually useful would be to generate a word from multiple ideas — to help express complex ideas in one word. Like Frühstücksbrotübersättigung
jonplackett 3 hours ago [-]
I love this kind of thing. It’s weird how many words we use without having any idea what they mean.
Words always end up getting broken with speech in a different way to the meaning.
Eg I put in pandemonium which you say as pande-moniun but I just learned is pan-demon-ium which makes a lot more sense!
Very cool. Well done.
Timwi 2 hours ago [-]
Try “helicopter” for a real mind-blow.
stavros 1 hours ago [-]
I hate how everyone always breaks it up as "heli" and "copter" when it's "helico" and "pter".
tsuru 2 hours ago [-]
It did not like supercalifragilisticexpialidocious
InsideOutSanta 1 hours ago [-]
I had the same idea, and it concluded that it meant "fantastic," which seems correct.
fliglr 2 hours ago [-]
I'll no longer misunderestimate AI
youdont 3 hours ago [-]
It really struggled with Antidisestablishmentarianism
4 days ago [-]
novemp 4 hours ago [-]
"Can guess fake words"? It told me "shitbutt" was a real word.
3 hours ago [-]
Timwi 2 hours ago [-]
Very cool to use initially, but then completely locks you out with a modal in your face demanding an email address. That is despicable and you should be ashamed.
Rendered at 10:58:08 GMT+0000 (Coordinated Universal Time) with Vercel.
It not only correctly split apart and found the etymological roots of the German word "Rindfleischettiketierungsüberwachungsübertragungsgesetz" (law governing the transfer of the responsibility of controlling the labeling of beef) but also Bavarian / Austrian dialect like "Oachkatzlschwoaf" (squirrel tail) or Kukeruz (corn). It also works for French and Japanese. Wow.
Between things like "verbing" and "nouning", and the cultural acceptance of doing them in casual speech, I'd say English is a great language because you get to "invent" new words on the fly, and your interlocutors know what you mean.
In this sense, even if no one before ever said or wrote "multiarborality", it's pretty clear what it means (as long as you don't misread it), and IMHO it's perfectly fine to derive its etymology by deconstructing it back to "common" words and pulling etymology on those, recursively.
I bet you could do that with most languages. I don’t see why English would be especially great at it.
I don't even know what a fake word is.
https://deconstructor.ayush.digital/w/flonkers
"Flonkers: A made-up word, likely humorous". Aren't all words made up? Edit: This one, unlike my previous example, is actually in use - flonkers: an animal that looks fat but is actually just fluffy.
One person inventing a word that they have never heard before doesn't negate the possibility of that word being in common use somewhere.
https://books.google.ca/books/about/Arboreality.html?id=S95Y...
"An expansion of the 2020 Theodore Sturgeon Memorial Award winning story. Arboreality is a finalist for the Philip K. Dick Award and the winner of the 2023 Ursula K. Le Guin Prize for Fiction."
https://deconstructor.ayush.digital/w/Deundehydrate
"to reverse the process of reversing the removal of water"
There is clearly no such word, but a human would probably infer such a meaning.
I'm amazed at how unamazed we all seem to have become at such feats. We need more deunnamazing
Terryjambled means "mixed up in a confused or disorderly state, and covered with or resembling terry cloth."
Refugglemander means to "to manipulate electoral district boundaries in a way that impacts refugees."
I'm sorry, OP. This just isn't very good.
- phonenose: The ability to detect sounds or voices through the nose
- legpc: Acronym for Laptop Easy Personal Computer
- gitls: A command in Git to list files
- housefreezing: The action of hardening a house with cold
- uncleftish beholding[2]: The act of viewing something that is whole
In any case it's fun to play with and the UI is nice too.
Note, the title looks editorialized, it's currently "A AI etymology deconstructor – can guess fake words", but the website says just "deconstructor.".
[1]: https://en.wikipedia.org/wiki/Charles_Babbage
[2]: https://en.wikipedia.org/wiki/Uncleftish_Beholding
Yeah, no. I tried the same word five times to test for inconsistencies. I don’t appreciate you weren’t straightforward with the limits and am not going to give you my email to “continue my language journey”. The fact you’re already using euphemisms like that makes me strongly distrust what you’d do with an email address.
Words always end up getting broken with speech in a different way to the meaning.
Eg I put in pandemonium which you say as pande-moniun but I just learned is pan-demon-ium which makes a lot more sense!
Very cool. Well done.