Next.js App Router + React Server Components Demo

new
past
show
ask
show
jobs
submit

▲AI has a deep understanding of how this code works (github.com)

438 points by theresistor 74 days ago | 306 comments

benterix 74 days ago [-]

Did these Ocaml maintainers undergo some special course for dealing with difficult people? They show enormous amounts of maturity and patience. I'd just give the offender Torvalds' treatment and block them from the repo, case closed.

yodsanklai 73 days ago [-]

In my big tech company, you don't want to be dismissive of AI if you don't want to sound like a paria. It's hard to believe how much faith leadership has in AI. They really want every engineer to use AI as much as possible. Reviewing is increasingly done by AI as well.

That being said, I don't think that's why reviewers here were so cordial, but this is the tone you'd expect in the corporate world.

robertheadley 72 days ago [-]

I wouldn't say they were dismissive of AI, just that they are unwilling to merge code that they don't have the time or motivation to review.

If you want AI code merged, make it small so it it's an easy review.

That being said, I completely understand being unwilling to merge AI code at all.

joelreymont 72 days ago [-]

Why would you be unwilling to merge AI code at all?

Consider my other PR against the Zig compiler [1]... I was careful to make it small and properly document it but there's a strict anti-AI policy for Zig and they closed the PR.

Why?

Is it not small? Not carefully documented? Is there no value it int?

I'm not complaining or arguing for justice. I'm genuinely interested in how people think in this instance. If the sausage looks good and tastes great, and was made observing the proper health standards, do you still care how the sausage was made?!

[1] https://github.com/joelreymont/zig/pull/1 [2] https://ziggit.dev/t/bug-wrong-segment-ordering-for-macos-us...

losvedir 71 days ago [-]

Personally, I'm skeptical that it's a real bug, and that even if it is, that's the proper fix. For all I know, the LLM hallucinated the whole thing, terminal prompts, output, "fix" and all.

It takes time to review these things, and when you haven't shown yourself to be acting responsibly, there's no reason to give you the benefit of the doubt and spend time even checking if the damn alleged bug is real. It doesn't even link to an existing issue, which I'm pretty sure would exist for something as basic as this.

How do you know it's an issue? I think you're letting the always confident LLM trick you into thinking it's doing something real and useful.

gexla 72 days ago [-]

> Why would you be unwilling to merge AI code at all?

Because structurally it's a flag for being highly likely to waste extremely scare time. It's sort of like avoiding bad neighborhoods,not because everyone is bad, but because there is enough bad there that it's not worth bothering with.

What sticks out for me in these cases is that the AI sticks out like a sore thumb. Go ahead and use AI, it's as if the low effort nature of AI sets users on a course of using low effort throughout the cycle of whatever it is they are trying to accomplish as an end game.

The AI shouldn't look like AI. The proposed contributions shouldn't stand out from the norm. This include the entire process, not just the provided code. It's just a bad aesthetic and for most people it screams "low effort."

xign 72 days ago [-]

I can't even reproduce your supposed "issue" regarding the Zig compiler "bug". I have an Apple Silicon Mac and tried your reproducer and zig compiled and ran the program just fine.

Honestly, I really suggest reading up on what self-reflection means. I read through your various PRs, and the fact that you can't even answer why a random author name shows up in your PR means the code can't be trusted. It's not just about attribution (although that's important). It's that it's such a simple thing that you can't even reason through.

You may claim you have written loads of tests, but that means literally nothing. How do you know they are testing the important parts? Also you haven't demonstrated that it "works" other than in the simplest use cases.

joelreymont 72 days ago [-]

Check the 2nd PR, the one in my repo and not the one that was rejected.

noticingdecline 71 days ago [-]

[dead]

whilenot-dev 72 days ago [-]

> Why would you be unwilling to merge AI code at all?

Are you leaving the third-party aspect out of your question on purpose?

Not GP but for me, it pretty much boils down to the comment from Mason[0]: "If I wanted an LLM to generate [...] unreviewed code [...], I could do it myself."

To put it bluntly, everybody can generate code via LLMs and writing code isn't what defines the dominant work of an existing project anymore, as the write/verify-balance shifts to become verify-heavy. Who's better equipped to verify generated code than the maintainers themselves?

Instead of prompting LLMs for a feature, one could request the desired feature from the maintainers in the issue tracker and let them decide whether they want to generate the code via LLMs or not, discuss strategies etc. Whether the maintainers will use their time for reviews should remain their choice, and their choice only - anyone besides the maintainers should have no say in this.

There's also the cultural problem where the review efforts are non-/underrepresented in any contemporary VCS, and the amount of merged code grants for a higher authority over a repository than any time spent doing reviews or verification (the Linux kernel might be an exception here?). We might need to rethink that approach moving forward.

[0]: https://discourse.julialang.org/t/ai-generated-enhancements-...

joelreymont 72 days ago [-]

I'm strictly talking about the 10-line Zig PR above.

Well-documented and tested.

whilenot-dev 72 days ago [-]

That's certainly a way to avoid questions... I mean sure, but everybody else is talking about how your humongous PRs are a burden to review.

joelreymont 72 days ago [-]

Which is something I agreed with and apologized for, and admitted was somewhat of a PR stunt.

Now, what's your question?

globnomulous 71 days ago [-]

> admitted was somewhat of a PR stunt.

You should be blocked, banned, and ignored.

> Now, what was your question?

Your attitude stinks. So does your complete lack of consideration for others.

umanwizard 71 days ago [-]

You are admitting to wasting people’s time on purpose and then can’t understand why they don’t want to deal with you or give you the benefit of the doubt in the future?

67 days ago [-]

globnomulous 71 days ago [-]

It's worth asking yourself something: people have written substantial responses to your questions in this thread. Here you answered four paragraphs with two fucking lines referencing and repeating what you've already said. How do you expect someone to react? How can you expect anybody to take seriously anything you say, write, or commit when you obviously have so little ability, or willingness, to engage with others in a manner that shows respect and thought?

I really, truly don't understand. This isn't just about manners, mores, or self-reflection. The inability or unwillingness to think about your behavior or its likely reception are stupefying.

You need to stop 'contribiting' to public projects and stop talking to people in forums until you figure this stuff out.

edstarch 71 days ago [-]

>I really, truly don't understand. This isn't just about manners, mores, or self-reflection. The inability or unwillingness to think about your behavior or its likely reception are stupefying.

Shower thought: what does a typical conversation with an LLM look like? You ask it a question, or you give a command. The model spends some time writing a large wall of text, or performing some large amount of work, probably asks some follow up questions. Most of the output is repetitive slop so the user scans for the direct answer to the question, or checks if the tests work, promptly ignores the follow-ups and proceeds to the next task.

Then the user goes to an online forum and carries on behaving the same way: all posts are instrumental, all of the replies are just directing, shepherding, shaping and cajoling the other users to his desired end (giving him recognition and a job).

I'm probably reading too much into this one dude but perhaps daily interaction with LLMs also changes how one interacts with other text based entities in their lives.

joelreymont 71 days ago [-]

I'll gladly discuss at length things that are near and dear to my heart.

Facing random people in the public court of opinion is not one of them!

Also, there's long-form writing in my blog posts, Twitter and Reddit.

aiahs 71 days ago [-]

Well if you wanna contribute (at least as a proxy) to OSS, you need to deal with people and make them want to deal with you. If you don't do that, no PR, regardless of how perfect it is, will ever be accepted. If you're so sure that your strategy for the future of development is correct, then prove it by building your own project, where you can fully decide which contributions are accepted, even those which are 100% ai generated. This should be easy, right? Once your project gains wide spread adoption, you can show everybody that you've been right all along. Until then, it's just empty talk.

LoganDark 70 days ago [-]

That's exactly their plan, it seems.

joelreymont 71 days ago [-]

Remind me please, when did I sign up to meet your expectations?

globnomulous 69 days ago [-]

My expectations are those of any reasonable, sensible person who has a modicum of software-development experience and any manners at all.

Incidentally, my expectations are also exactly the same as every other person who has commented on your PRs and contributions to discussion.

My expectations, lastly, are those of someone who evaluates job candidates and casts votes for and against hiring for my team.

Your website says repeatedly that you're open to work. Not only would I not hire you; I would do everything in my power to keep you out of my company and off my team. I'd wager good money that many others in this thread would, too.

If you have a problem with my expectations, you have a problem not with my expectations but with your own poor social skills and lack of professional judgment.

heavyset_go 72 days ago [-]

> Why would you be unwilling to merge AI code at all?

Because AI code cannot be copyrighted. It is not anyone's IP. That matters when you're creating IP.

edit: Assuming this is a real person I'm responding to, and this isn't just a marketing gimmick, having seen the trail you've left on the internet over the past few weeks, it strikes me of mania, possibly chatbot-induced. I don't know what I can say that could help, so I'm dropping out of this conversation and wish you the best.

kcexn 72 days ago [-]

This is a position that seems to be as unenforceable as AI can't be trained on code whose copyright owners have not given consent.

The main reason for being unwilling to merge AI code is going to be that it sets a precedent that AI code is acceptable. Suddenly, maintainers need to be able to make judgement calls on a case-by-case basis of what constitutes an acceptable AI contribution, and AI is going to be able to generate far more slop than people will ever have the time to review and agree upon.

heavyset_go 72 days ago [-]

> This is a position that seems to be as unenforceable as AI can't be trained on code whose copyright owners have not given consent.

This depends on what courts find, at least one non-precedent setting case found model training on basically everyone's IP without permission to be fair use. If it's fair use, consent isn't needed, licenses don't matter and the only way to prevent training on your content is to withhold it and gate it behind contracts that forfeit your clients' rights to fair use.

But that is beside the point, even if what you claim was the case, my point is that AI output isn't property. It's not property whether its training corpus was full of licensed or unlicensed content. This is what the US Copyright Office determined.

If you include AI output in your product, that part of it isn't yours. It isn't anybody's, so anyone can copy it and anyone can do whatever they want with it, including the AI/cloud providers you allowed your code to get slurped up to as context to LLMs.

You want to own your IP, you don't want to say "we own 30% of the product we wrote, but 70% of it is non-property that anyone can copy/use/sell with no strings attached, even open source licenses". This matters if you're building a business or open source project. If you include AI code in your open source project, that part of the project isn't covered by your license. LLMs can't sign CLAs and they can't produce intellectual property that can be licensed or owned. The more of your project that is developed by AI, the more it is not yours, and the more of it cannot be covered by your open source license of choice.

cesarb 71 days ago [-]

> This is what the US Copyright Office determined.

There are hundreds of countries in the world. Whatever the "US Copyright Office" determines, applies to only one of them.

heavyset_go 71 days ago [-]

Find me a jurisdiction where AI output is the IP of the prompter

PunchyHamster 72 days ago [-]

> This is a position that seems to be as unenforceable as AI can't be trained on code whose copyright owners have not given consent.

Stares at facebook stealing terabytes of copyrighted content to train their models

Also, even if code is trained only on FLOSS approved licenses, GPL based ones have some caveats that would disqualify many projects with including code

weare138 71 days ago [-]

If poo flinging monkeys are making the sausage people don't care how good the sausage is.

phendrenad2 73 days ago [-]

This is a good point. There's a lot of cheering for the Linus swearing style, but if the average developer did that they'd eventually get a talking-to by HR.

delian66 71 days ago [-]

Please name it, so that we can know to avoid it and its products.

Mariehane 74 days ago [-]

I think you naturally undergo that course when you are maintainer of a large OSS project.

pjc50 73 days ago [-]

Well, you go one of two ways. Classic Torvalds is the other way, until an intervention was staged.

kylereeve 73 days ago [-]

There's a very short list of people who can get away with "Classic Torvalds"

MrDresden 73 days ago [-]

Frankly the originator of this pull request deserves the "classic Torvalds" treatment, no matter who is delivering it.

bfkwlfkjf 73 days ago [-]

In fact, anyone can get away with it. What are they going to do, call the police because you called them a moron?

phendrenad2 73 days ago [-]

Consider the facts: (1) most open-source maintainers have a real job (2) an unwritten rule of corporate culture is "(appear) nice" (see all of the "corporate speak" memes about how "per my last email" means "fuck off") (3) these developers may eventually need a job at another company (4) their "moron" comment is going to live forever on the internet...

PunchyHamster 72 days ago [-]

If you're famous enough for that to filter to the whoever is handling your resume, it anything it will be positive

hypeatei 74 days ago [-]

It's clear some people have had their brain broken by the existence of AI. Some maintainers are definitely too nice, and it's infuriating to see their time get wasted by such delusional people.

ares623 73 days ago [-]

That’s why AI (and bad actors in general) is taking advantage of them. It’s sick.

jodrellblank 73 days ago [-]

> "It's clear some people have had their brain broken by the existence of AI."

The AI wrote code which worked, for a problem the submitter had, which had not been solved by any human for a long time, and there is limited human developer time/interest/funding available for solving it.

Dumping a mass of code (and work) onto maintainers without prior discussion is the problem[1]. If they had forked the repo, patched it themselves with that PR and used it personally, would they have a broken brain because of AI? They claim to have read the code, tested the code, they know that other people want the functionality; is wanting to share working code a "broken brain"? If the AI code didn't work - if it was slop - and they wanted the maintainers to fix it, or walk them through every step of asking the AI to fix it - that would be a huge problem, but that didn't happen.

[1] copyrightwashing and attribution is another problem but also not one that's "broken brain by the existence of AI" related.

kace91 73 days ago [-]

>They claim to have read the code, tested the code, they know that other people want the functionality; is wanting to share working code a "broken brain"?

There is clearly a deviation between the amount of oversight the author thinks they provided and the actual level of oversight. This is clear by the fact that they couldn’t even explain the misattribution. They also mention that this is not their area of expertise.

In general, I think that it is a reasonable assumption that, if you couldn’t have written the code yourself, you’re in no position to claim you can ensure its quality.

jodrellblank 73 days ago [-]

If a manager says they provided oversight of their developer employees, and the code was not as good as the manager thought, would you say "the manager has had their brain broken by the existence of employees"?

kace91 73 days ago [-]

I'll bite, let's grant for the sake of the argument that equaling the LLM with a person holds.

This manager is directly providing an unrelated team with an unprompted 150-file giant PR dumped at once with no previous discussion. Upon questioning, he says the code has been written by an outside contractor he personally chose.

No one has onboarded this contractor to the team, and checking their online presence shows lots of media appearances, but not a single production project in their CV, much less long time maintenance.

A cursory glance at the files reveals that the code contains copypasted code from stackoverflow to the point that the original author's name is still pasted in comments. The manager can not justify this, but doesn't seem bothered by the fact, and insists that the contractors are amazing because he's been following them in social networks and is infatuated with their media there.

Furthermore, you check the manager's history in slack and you see 15 threads of him doing the same for other teams. The ones that have agreed to review their PRs have closed them for being senseless.

How would you be interacting with this guy?

sfgvvxsfccdd 72 days ago [-]

This was a pretty spot on analogy. In particular “the manager cannot justify this, but doesn't seem bothered by the fact, and insists that the contractors are amazing” is too accurate.

palmotea 73 days ago [-]

> If a manager says they provided oversight of their developer employees, and the code was not as good as the manager thought, would you say "the manager has had their brain broken by the existence of employees"?

That could be either regular incompetence or a "broken brain." It's more towards latter if the manager had no clue about what was going on, even after having it explained to him.

This guy is equivalent to a manager who hired two bozos to do a job, but insisted it was good because he had them check each other's work and what they made didn't immediately fall down.

PunchyHamster 72 days ago [-]

...Well, if you want to make an argument for calling them "useless and incompetent", I'd say you have a great point, good manager would at least throw it to QA and/or recruit someone better after failure

joelreymont 72 days ago [-]

By testing the code I mean that I actually focused on tests passing and the output in the examples being produced by AI running lldb using this modified compiler.

nsagent 73 days ago [-]

It's clear Claude adapted code directly from the OxCaml implementation (the PR author said he pointed Claude at that code [1] and then provides a ChatGPT analysis [2] that really highlights the plagiarism, but ultimately comes to the conclusion that it isn't plagiarized).

Either that highlights someone who is incompetent or they are willfully being blasé. Neither bodes well for contributing code while respecting copyright (though mixing and matching code on your own private repo that isn't distributed in source or binary form seems reasonable to me).

[1]: https://github.com/ocaml/ocaml/pull/14369#issuecomment-35573...

[2]: https://github.com/ocaml/ocaml/pull/14369#issuecomment-35566...

joelreymont 72 days ago [-]

The key is that AI adapted, not stole.

It's actually capable of reasoning and generating derivative code and not just copying stuff wholesale.

See examples at the bottom of my post:

https://joel.id/ai-will-write-your-next-compiler/

noticingdecline 71 days ago [-]

[dead]

menaerus 73 days ago [-]

Sorry, this is just ridicilous and shows how people fragile really are. This whole topic and whole MR as well.

I am routinely looking into the folly implementation, sometimes into the libstdc++, sometimes into libc++, sometimes into boost or abseil etc. to find inspiration for problems that I tackle in other codebases. By the same standards, this should also be plagiarism, no? I manufacture new ideas by compiling existing knowledge from elsewhere. Literally every engineer in the world does the same. Why is AI any different?

meheleventyone 73 days ago [-]

Perhaps because the AI assigned copyright in the files to the author of the library it copied from and the person prompting it told it to look at that library. Without even getting into the comedy AI generated apologia to go with it which makes it look worse rather than better.

From a pragmatic viewpoint as an engineer you assign the IP you create over to the company you work for so plagarism has real world potential to lose you your job at best. There's a difference between taking inspiration from something unrelated "oh this is a neat algorithmic approach to solving this class of problems" to "I need to implement this specific feature and it exists in this library so I'll lift it nearly verbatim".

menaerus 73 days ago [-]

Can you give an example what exactly was copied? I ask because I took a look into MR and original repo, and the conclusion is that the tool only copy-pasted the copyright header but not the code. So I am still wondering - what's wrong with that (it's a silly mistake even a human can make), and where is the copyright infringement everyone is talking about?

DrammBA 72 days ago [-]

> copy-past[ing] the copyright header but not the code [is] a silly mistake even a human can make

Do you mind showing me some examples of that? That seems so implausible to me

Just for reference, here's another example of AI adding phantom contributors and the human just ignoring it or not even noticing: https://github.com/auth0/nextjs-auth0/issues/2432

nsagent 72 days ago [-]

Oh wow. That's just egregious. Considering the widespread use of Auth0, I'm surprised this isn't a bigger story.

menaerus 72 days ago [-]

> Do you mind showing me some examples of that? That seems so implausible to me

What's so special about it that I need to show you the example?

kqr 72 days ago [-]

You are claiming humans copy-and-paste copyright headers without copying the corresponding code. To prove you're correct, you only need to show one (or a few) examples of it happening. To prove you incorrect, someone would have to go through all code in existence to show the absence of the phenomenon.

Hence the burden of proof is on you.

menaerus 72 days ago [-]

No code besides the header was copied so I am asking what is so problematic about it?

PunchyHamster 72 days ago [-]

that was already explained before

spookie 72 days ago [-]

None of that matters. The header is there, in writing, and discussed in the PR. It is acknowledged by both parties and the author gives a clumsy response for its existence. The PR is simply tainted by this alone, not to mention other pain points.

You may not consider this problematic. But maintainers of this project sure do, given this was one of the immediate concerns of theirs.

joelreymont 72 days ago [-]

OxCaml is a fork of OCaml, they have the same license.

I wasn't able to find any chunks of code copied wholesale from OxCaml which already has a DWARF implementation.

All that code wasn't written by Mark, AI just decided to paste his copyright all over.

menaerus 72 days ago [-]

It matters because it completely weakens their point of stance and make them look unreasonable. Header is irrelevant since it isn't copyright infringement, and FWIW when it has been corrected (in the MR), then they decided that the MR is too complex for them and closed the whole issue. Ridiculous.

biorach 72 days ago [-]

An incorrect copyright header is a major red flag for non technical reasons. If you think it is an irrelevant minor matter then you do not undesirable several very important social and legal aspects of the issue.

menaerus 72 days ago [-]

Social maybe yes what legal aspects? Everybody keeps repeating that but there is no copyright infringement. Maybe you can point me to one?

I understand that people are uncomfortable with this, I am likely too, but objectively looking there's technically nothing wrong or different to what humans already do.

biorach 72 days ago [-]

The point is that it ended up in the PR in the first place. The submitted seemed unaware of its presence and only looked into it after it was pointed out. This is sloppy and is a major red flag.

menaerus 72 days ago [-]

So there's no point? Sloppy maybe yes but technically incorrect or legally questionable no. Struggle is real

pepoluan 71 days ago [-]

If the submitter is sloppy with things that are not complicated, how can one be sure of things that ARE complicated?

menaerus 71 days ago [-]

The funny thing is that it works, have a look at the MR. It says:

  All existing tests pass. Additional DWARF tests verify:

  DWARF structure (DW_TAG_compile_unit, DW_TAG_subprogram).
  Breakpoints by function and line in both GDB and LLDB.
  Type information and variable visibility.
  Correct multi-object linking.
  Platform-specific relocation handling.

So the burden of proof is obviously not anymore on the MR submitter side but the other.

pjmlp 72 days ago [-]

Yes?

That is why some people are forbidden to contribute to projects if their eyes have read projects with incompatible licenses, in case people go to copyright court.

menaerus 72 days ago [-]

Yes what? Both oxcaml and ocaml have compatible LGPL licenses so I didn't get your argument.

But even if that hadn't been the case, what exactly would be the problem? Are you saying that I cannot learn from a copyrighted book written by some respected and known author, and then apply that knowledge elsewhere because I would be risking to be sued for copyright infringement?

biorach 72 days ago [-]

The wider point is that copyright headers are a very important detail and that a) the AI got it wrong b) you did not notice c) you have not taken on board the fact that it is important despite being told several times and have dismissed the issue as unimportant

Which raises the question how many other important incorrect details are buried in the 13k lines of code that you are unaware of and unable to recognise the significance of? And how much mantainer time would you waste being dismissive of the issues?

People have taken the copyright header as indicative of wider problems in the code.

menaerus 72 days ago [-]

Yes, please then find those for now imaginative issues and drill through them? Sorry, but I haven't seen anyone in that MR calling out for technical deficiencies so this is just crying out loud in a public for no concrete reasons.

It's the same as if your colleague sitting next to you would not allow the MR to be merged for various political and not technical reasons - this is exactly what is happening here.

biorach 72 days ago [-]

> Yes, please then find those for now imaginative issues and drill through them?

No, that is a massive amount of work which will only establish what we already know with a high degree of certainty due to the red flags already mentored - that this code is too flawed to begin with.

This is not political, this is looking out for warming signs in order to avoid wasting time. At this stage the burden of proof is on the submitter, not the reviewers

menaerus 72 days ago [-]

Too flawed? Did you miss that tiny detail that MR fixes a long time issue for ocaml? This is exactly political because there's no legal or technical issue. Only fluff by scared developers. I have no stakes in this but I'm sincerely surprised by the amount of unreasonable and unsubstantiated claims and explanations given in this thread and MR

zeratax 66 days ago [-]

I don't get why you do not understand why nobody wants to waste time on a MR where the author didn't even themselves have any interest on looking over it even once. https://github.com/ocaml/ocaml/pull/14369/files#diff-bc37d03... also all the unused functions...

did it fix a long time issue? maybe, but 9 tests for 13k lines doesnt give much confidence in that

and even if it worked perfectly, who will maintain this?

ahoka 72 days ago [-]

"Yes what? Both oxcaml and ocaml have compatible LGPL licenses so I didn't get your argument."

LGPL is a license for distribution, the copyright of the original authors is retained (unless signed away in a contribution agreement, usually to an organization).

"Are you saying that I cannot learn from a copyrighted book written by some respected and known author, and then apply that knowledge elsewhere because I would be risking to be sued for copyright infringement?"

This was not the case here, so not sure how that is related in any way?

menaerus 72 days ago [-]

Do you understand that no code besides the header copyright was copied? So what copyright exactly are you talking about?

pjmlp 72 days ago [-]

Depends on the license of the original material, which is why they tend to have a list of allowed use cases for copying content.

Naturally there are very flexible ones, very draconian ones, and those in the middle.

Most people get away with them, because it isn't like everyone is taking others to copyright court sessions every single day, unless there are millions at play.

kace91 73 days ago [-]

Also, I just took a glance at the PR and even without knowing the language it took 10 seconds for the first WTF. The .md documents Claude generated are added to .gitignore, including one for the pr post itself.

That’s the quality he’s vouching for.

joelreymont 72 days ago [-]

People complaining about AI stealing code may not realize that OxCaml is a fork of the code that AI is modifying. Both forks have the same license and there are people working on both projects.

AI did paste Mark's copyright on all the files for whatever reason but it did not lift the DWARF implementation from OxCaml and paste it into the PR.

The PR wasn't one-shot either. I had to steer it to completion over several days, had one AI review the changes made by the other, etc. The end result is that the code _works_ and does what it says on the tin!

bn-l 72 days ago [-]

You are under a delusion. I’m serious.

ath3nd 73 days ago [-]

[dead]

mnming 73 days ago [-]

I wonder if it's the best outcome? The contributor doesn't seem to have a bad intention, could his energy be redirected more constructively? E.g. encouraging him to split up the PR, make a design proposal etc.

mrguyorama 72 days ago [-]

The constructive outcome is the spammer fucks off or learns how to actually code.

Lots of people all over the world learn some basics of music in school, or even learn how to play the recorder, but if you mail The Rolling Stones with your "suggestions" you aren't going to get a response and certainly not a response that encourages you to keep spamming them with "better" recommendations.

The maintainers of an open source project are perfectly capable of coercing an LLM into generating code. You add nothing by submitting AI created code that you don't even understand. The very thought that you are somehow contributing is the highest level of hubris and ego.

No, there's is nothing you can submit without understanding code that they could not equally generate or write, and no, you do not have an idea so immensely valuable that it's necessary to vibe code a version.

If you CAN understand code, write and submit a PR the standard way. If you cannot understand code, you are wasting everyone's time because you are selfish.

This goes for LLM generated code in companies as well. If it's not clear and obvious from the PR that you went through and engineered the code generated, fixed up the wrong assumptions, cleaned up places where the LLM wasn't given tight enough specs, etc, then your code is not worth spending any time reviewing.

I can prompt Claude myself thank you.

The primary problem with these tools is that assholes are so utterly convinced that their time is infinitely valuable and my time is valueless because these people have stupidly overinflated egos. They believe their trash, unworkable, unmaintainable slop puked out by an LLM is so damn valuable, because that's just how smart they are.

Imagine going up to the Civil Engineer building a bridge and handing them a printout from ChatGPT when you asked it "How do you build a bridge" and feeling smug and successful. That's what this is.

yoyohello13 72 days ago [-]

> The maintainers of an open source project are perfectly capable of coercing an LLM into generating code. You add nothing by submitting AI created code that you don't even understand. The very thought that you are somehow contributing is the highest level of hubris and ego.

Thank you! I was struggling to articulate why AI generated PRs annoy me so much and this is exactly it.

agumonkey 69 days ago [-]

I believe there's a flood of people waiting to be able to "contribute" by publishing a lot of LLM generated code. My question is what if they manage to grab resources from the original devs ?

72 days ago [-]

joelreymont 72 days ago [-]

I think it's for me to redo the PR and break it into smaller pieces.

There's value in the PR in that it does not require you to install the separate OxCaml fork from Jane St which doesn't work with all the OCaml packages. Or wasn't when I tried it back in August.

thatguysaguy 72 days ago [-]

A big part of software engineering is maintenance not just adding features. When you drop a 22,000 line PR without any discussion or previous work on the project, people will (probably correctly) assume that you aren't there for the long haul to take care of it.

On top of that, there's a huge asymmetry when people use AI to spit out huge PRs and expect thorough review from project maintainers. Of course they're not going to review your PR!

seanmcdirmid 72 days ago [-]

AI actually has the advantage here in my experience. Yes, you can do AI wrong and tell it to just change code, write no documentation, provide no notes on the changes, and not write any tests. But you would be dumb to do it that way.

As it stands now you can set AI to do actual software development with documentation, notes, reasoning for changes, tests, and so on. It isn’t exactly easy to do this, a novice to AI and software development definitely wouldn’t set it up this way, but it isn’t what the tech can really do. There is a lot to be done in using different AI to write tests and code (well, don’t let an AI who can see the code to write the tests, or you could just get a bunch of change detector crap), but in general it mostly turns out that all the things SWEs can do to improve their work works on AI also.

joelreymont 72 days ago [-]

Note that this PR works, was tested, etc.

I was careful to have AI run through the examples in the PR, run lldb on the sample code and make sure the output matches.

Some of the changes didn't make it in before the PR was closed but I don't think anyone bothered to actually check the work. All the discussion focused on the inappropriateness of the huge PR itself (yes, I agree), on it being written by AI... and on the AI somehow "stealing" work code.

thatguysaguy 71 days ago [-]

I'm actually not talking about whether the PR works or was tested. Let's just assume it was bug-free and worked as advertised. I would say that even in that situation, they should not accept the PR. The reason is that no one is the owner of that code. None of the maintainers will want to dedicate some of their volunteer time to owning your code/the AIs code, and the AI itself can't become the owner of the code in any meaningful way. (At least not without some very involved engineering work on building a harness, and since that's still a research-level project, it's clearly something which should be discussed at the project level, not just assumed).

misnome 72 days ago [-]

> but I don't think anyone bothered to actually check the work

Including you

seanmcdirmid 72 days ago [-]

I’ve been finding that the documentation the AI writes isn’t so much for humans, but for the AI when it later goes to work on the code again…well, to say AI benefits from good PRs as much as people do. You could ask the AI to break up the PR next time if possible, it will probably do so much more easily than you could do it manually.

joelreymont 72 days ago [-]

You can ask AI to write documentation for humans.

Also, I'll try to break up the PR sometime but I'm already running Claude using two $200/mo accounts, in addition to another $200/mo ChatGPT, and still running into time limits.

I want to finish my compilers first.

suspended_state 70 days ago [-]

What forces you to publish this work as a PR, or as many PRs? You could have simply kept that for yourself, since you admitted in the PR discussion that you found it useful. Many people seem to think you haven't properly tested it, so that would also be a good way of testing it before publishing it, wouldn't it?

collingreen 72 days ago [-]

Is (or should that be) the goal, responsibility, or even purview of the maintainers of this project?

kace91 73 days ago [-]

I honestly reread the whole thread in awe.

Not due to the submitter, as clickbaity as it was, but reading the maintainers and comparing their replies with what I would have written in their place.

That was a masterclass of defending your arguments rationally, with empathy, and leaving negative emotions at the door. I wish I was able to communicate like this.

My only doubt is whether this has a good or bad effect overall, giving that the PR’s author seemed to be having their delusions enabled, if he was genuine.

Would more hostility have been productive? Or is this a good general approach? In any case it is refreshing.

mncharity 72 days ago [-]

Years back I attended someone doing an NSF outreach tour in support of Next Generation Science Standards. She was breathtaking (literally - bated breath on "how is that question going to be handled?!?"). Heartfelt hostile misguided questions, too confused to even attain wrong, somehow got responses which were, not merely positive and compassionate, but which managed to gracefully pull out constructive insights for the audience and questioner. One of those "How do I learn this? Can I be your apprentice?" moments.

The Wikipedia community (at least 2 decades back) was also notable. You have a world of nuttery making edits. The person off their meds going article by article adding a single letter "a". And yet a community ethos that emphasized dealing with them with gentle compassion, and as potential future positive contributors.

Skimming a recent "why did perl die" thread, one thing I didn't see mentioned... The perl community lacked the cultural infrastructure to cope with the eternal-September of years of continuous newcomer questions, becoming burned out and snarky. The python community emphasized it's contrast with this, "If you can't answer with friendly professionalism, we don't need your reply today" (or something like that).

Moving from tar files with mailing lists, to now community repos and git and blogs/slack/etc, there's been a lot of tech learned. For example, Ruby's Gems repo was explicitly motivated by "don't be python" (then struggling without a central community repo). But there's also been the social/cultural tech learned, for how to do OSS at scale.

> My only doubt is whether this has a good or bad effect overall

I wonder if a literature has developed around this?

squigz 73 days ago [-]

I don't think 'hostility' is called for, but certainly a little bit more... bluntness.

But indeed, huge props to the maintainers for staying so cool.

knollimar 72 days ago [-]

I work with contractors in construction and often have to throw in vulgarity for them to get the point. This feels very similar to when I'm too nice

BriggyDwiggs42 73 days ago [-]

I think it’s really good for people to have good case studies like this they can refer to in the case of ai prs as a justification rather than having to take the time themselves

73 days ago [-]

oliwarner 74 days ago [-]

There are LLMs with more self-awareness than this guy.

Repeatedly using AI to answer questions about the legitimacy of commits from an AI, to people who are clearly skeptical is breathtakingly dense. At least they're open about it.

I did love the ~"I'll help maintain this trash mountain, but I'll need paying". Classy.

sheepscreek 74 days ago [-]

Kudos to the community folks for maintaining their composure and offering constructive criticism. That alone makes me want to contribute something to the OCaml ecosystem - not like this dude of course :)

tetris11 73 days ago [-]

I don't think he's dense, I think he's just a high level troll

orphea 73 days ago [-]

Oh, you would be surprised. I don't know this particular guy but I can assure you that most people like this are not trolling.

shizzy0 73 days ago [-]

I can support your assertion here with experience. This is as earnest as it gets.

sky2224 70 days ago [-]

I really hope he is trolling, but there are genuinely people that are becoming embodiments of AI model output. There's a professor at my university that is notorious for responding to emails with ChatGPT (without reading its output first) or generating assignments with ChatGPT (also without reading its output first).

a57721 73 days ago [-]

It looks like a parody of LLM delusion, but the PR is oddly specific to be just trolling, and the author also submitted his work to HN: https://news.ycombinator.com/item?id=45982416

tetris11 73 days ago [-]

oof.

the_gipsy 74 days ago [-]

Yea that part is the icing on the cake.

autumnstwilight 74 days ago [-]

>>> Here's my question: why did the files that you submitted name Mark Shinwell as the author?

>>> Beats me. AI decided to do so and I didn't question it.

Really sums the whole thing up...

j4coh 74 days ago [-]

After having previously said "AI has a very deep understanding of how this code works. Please challenge me on this."

phendrenad2 73 days ago [-]

This reminds me of the "good developers must be good at thinking at multiple levels of abstraction at the same time" quote. The things you notice about these AI kids is they didn't even do the bare minimum to reason about their PR from multiple angles. __Of course__ someone is going to ask why the copyright is there. Better have a good answer, or - locked, come back when you do. Really that simple.

lambda_foo 74 days ago [-]

Pretty much. I guess it’s open source but it’s not in the spirit of open source contribution.

Plus it puts the burden of reviewing the AI slop onto the project maintainers and the future maintenance is not the submitters problem. So you’ve generated lots of code using AI, nice work that’s faster for you but slower for everyone else around you.

skeledrew 74 days ago [-]

Another consideration here that hits both sides at once is that the maintainers on the project are few. So while it could be a great burden pushing generated code on them for review, it also seems a great burden to get new features done in the first place. So it boils down to the choice of dealing with generated code for X feature, or not having X feature for a long time, if ever.

swiftcoder 74 days ago [-]

> or not having X feature for a long time, if ever

Given that the feature is already quite far into development (i.e. the implementation that the LLM copied), it doesn't seem like that is the case here

dudinax 74 days ago [-]

With the understanding that generated code for X may never be mergable given the limited resources.

skeledrew 74 days ago [-]

Yes, and that may eventually lead to a more generation-friendly fork to which those desiring said friendliness, or just more features in general, will flock.

squigz 74 days ago [-]

I think everyone would appreciate if these people using LLMs to spit out these PRs would fork things and "contribute" to those forks instead.

skeledrew 74 days ago [-]

It's a fairly simple matter to reject a PR. And a nice-to-have if they update their contribution guidelines to reflect their preferences.

squigz 74 days ago [-]

It's also a fairly simple matter to respect the time of the maintainers of software you want to contribute to - by, for example, talking to them before dumping 16,000 LoC in a PR and expecting them to review it.

Unless, of course, it has nothing to do with actually contributing and improving software.

gexla 74 days ago [-]

Their issue seemed to be the process. They're setup for a certain flow. Jamming that flow breaks it. Wouldn't matter if it were AI or a sudden surge of interested developers. So, it's not a question of accepting or not accepting AI generated code, but rather changing the process. That in itself is time-consuming and carries potential risk.

skeledrew 74 days ago [-]

Definitely, with the primary issue in this case being that the PRer didn't discuss with the maintainers before going to work. Things could've gone very differently if that discussion was had, especially disclosing the intent to use generated code. Though of course there's the risk that disclosure could've led to a preemptive shutdown of the discussion, as there are those who simply don't want to consider it at all.

andai 74 days ago [-]

I thought you were paraphrasing. What in blazes...

bn-l 72 days ago [-]

How is it possible to have this little awareness?

greener_grass 74 days ago [-]

Is the real Mark Shinwell on here?

https://github.com/mshinwell

paxys 74 days ago [-]

Even if you are okay with AI generated code in the PR, the fact that the community is taking time to engage with the author and asking reasonable questions/offering reasonable feedback and the author is simply copy-pasting walls of AI-generated text in response warrants an instant ban.

If you want to behave like a spam bot don't complain when people treat you like a spam bot.

ptsneves 73 days ago [-]

Sometime ago I had a co-worker do this to me, pasting answers to my questions. He would paste the jira ticket to the ChatGPT(this was GPT3 time) and submit the PR. I would review it and ask questions and the answers had this typical rephrasing and persona of chatgpt. I had no proof, so one day i just used the PR and my comments as a prompt. The answers the co-worker gave me were almost the same down to the word as what ChatGPT gave me. I told my team I would not be available to review his changes anymore and that I would rather just have the ticket outright.

kaufmae 73 days ago [-]

This. Choose your destiny: 1. Take time to review the code, post it to the author with knowing that nobody and nothing is going to learn from it except for you doing his job for feeding new prompts 2. Take ownership of the branch and fix the AI code 3. Read through the code to get some learning out of it if possible, close the PR and write your own

fzaninotto 74 days ago [-]

I've closed my share of AI-generated PRs on some OSS repositories I maintain. These contributors seem to jump from one project to another, until their contribution is accepted (recognized ?).

I wonder how long the open-source ecosystem will be able to resist this wave. The burden of reviewing AI-generated PRs is already not sustainable for maintainers, and the number of real open-source contributors is decreasing.

Side note: discovering the discussions in this PR is exactly why I love HN. It's like witnessing the changes in our trade in real time.

inejge 74 days ago [-]

> I wonder how long the open-source ecosystem will be able to resist this wave.

This PR was very successfully resisted: closed and locked without much reviewing. And with a lot of tolerance and patience from the developers, much more than I believe to be fruitful: the "author" is remarkably resistant to argument. So, I think that others can resist in the same way.

genewitch 74 days ago [-]

Has there been any posts where the AI-user goes "oh, that makes sense. Sorry. Carry on."?

tverbeure 73 days ago [-]

Yes.

https://github.com/povik/yosys-slang/pull/237#issuecomment-3...

I was super excited about this PR and disappointed when it turned out to be AI generated.

pjc50 73 days ago [-]

Even if their AI says that for them, it doesn't mean they'll actually do it.

phatskat 73 days ago [-]

Successfully resisted, yes, but it also looks like a lot of actual human hours went into even replying to the PR in the first place. At what point do.l maintainers get overwhelmed with just politely rejecting PRs and throw their hands up because the time they allocated to the project they love has all been eaten up with rejecting slop?

raincole 74 days ago [-]

Open-source maintainers will resist this wave even just because they don't want to be mocked on HN/Reddit/their own forums.

It's corporation software that we need to worry about.

the_gipsy 73 days ago [-]

OSS has always pushed back, just because of the maintenance burden in general, and corporate can just "fix it later" because there are literally devs on payroll. Or at least push through and then dump the project, the goal is just completely different, each style works in its context.

But I don't know if corporate software can really "push through" these new amounts of code, without also automating the testing part.

fransje26 71 days ago [-]

> It's corporation software that we need to worry about.

That ship has sailed..

mbac32768 71 days ago [-]

> I wonder how long the open-source ecosystem will be able to resist this wave. The burden of reviewing AI-generated PRs is already not sustainable for maintainers, and the number of real open-source contributors is decreasing.

I think the burden is on AI fanbois to ship amazing tools in novel projects before they ask projects with reputations to risk it all on their hype.

To deliver a kernel of truth wrapped in a big bale of sarcasm: you're thinking of it all wrong! The maintainers are supposed to also use AI tools to review the PRs. That's much more sustainable and would allow them to merge 13,000 line PRs several times a day, instead of taking weeks/months to discuss every little feature.

The difference here of course is in how impressed you are by AI tools. The OCaml maintainers are not (and rightly so, IMO), whereas the PR submitter thinks they're so totally awesome and leaving tons of productivity on the table because they're scared of progress or insecure about their jobs or whatever.

Maybe OCaml could advance rapidly if they just YOLO merged big ambitious AI generated PRs (after doing AI code reviews) but that would be a high risk move. They have a reputation for being mature, high quality, and (insanely) reasonable. They would torch it very quickly if people knew this was happening and I think most people here would say the results would be predictably bad.

But lets take the submitter's argument at face value. If AI is so awesome, then we should be able to ship code in new projects unhampered by gatekeepers who insist on keeping slow humans in the loop. Or, to paraphrase other AI skeptics, where's all of the shovelware? How come all of these AI fanbois can only think about laundering their contributions through mature projects instead of cranking out amazing new stuff?

Where's my OCaml compiler 100% re-written in Rust that only depends on the Linux kernel ABI? Should cost a few hundred bucks in Claude credits at most?

To be clear, the submitter has gotten the point and said he was taking his scraps and going to make his own sausage (some Lisp thing). The outcome of that project should be very informative.

everybodyknows 72 days ago [-]

Does your own experience align with that of the maintainer who wrote:

> in my personal experience, reviewing AI-written code is more taxing that reviewing human-written code

__s 71 days ago [-]

Yes

bn-l 72 days ago [-]

I think he’s resume building.

rsynnott 74 days ago [-]

> Here's the AI-written copyright analysis...

Oh, wow. They're being way too tolerant IMO; I'd have just blocked him from the repo at about that point.

fhd2 74 days ago [-]

Their emotional maturity is off the charts, rather impressive.

74 days ago [-]

valbaca 73 days ago [-]

yeah, he was an absolute clown. just laugh at clowns and move on

pluc 74 days ago [-]

To all the AI apologists here I'd like to submit a simple scenario to you and hear your answer: you use AI to create a keynote speech on a topic you needed to use AI to write. At the end of your speech, people ask you questions about the contents of your speech. What do you say?

This is the same.

silverlake 72 days ago [-]

Hi, AI apologist here. This scenario is a problem with or without AI. You can’t drop a 13k line PR you don’t understand without prior discussion. There are many ways to use AI. Your scenario (keynote speech) is a bad way to use it. Instead, a PR where you understand every line, whether you or an AI wrote it, should be fine. It would be indistinguishable from human generated code.

AI is a tool like any other. I hire a carpenter who knows how to build furniture. Whether he uses a Japanese pullsaw or a CNC machine is irrelevant to me.

pluc 72 days ago [-]

That's a fair answer. How do you stop people from doing it though? How do you stop it from becoming every lazy person's first reflex instead of every smart person's third?

silverlake 72 days ago [-]

I don’t know. But at least you’ve identified the real problem: lazy people generating trash code. AI isn’t bad, people are.

zbentley 72 days ago [-]

We have historically intervened socially (via state regulation, taboo, or censure) in areas where the likelihood of misbehavior was high or the result of misbehavior was severe enough.

For example: nuclear material possession or refinement; slavery; consumer-available systemic antibiotics; ozone-damaging coolants; dowries.

Proscriptions on those are imperfect and inconsistent worldwide, but still prevalent. Each of them is a thing which benefited many people but whose practice enabled massive harm due to human failures (like laziness).

liamness 70 days ago [-]

I suppose the issue is that it's a multiplier for bad actors. It has become so much easier to generate plausible-looking code (or any number of things that would've previously required a knowledgeable human to make something that at least passes the sniff test, let's say legal documents as another example) and just overwhelm the limited bandwidth of good actors.

lawlessone 72 days ago [-]

>You can’t drop a 13k line PR you don’t understand without prior discussion.

How common was that before AI coding?

silverlake 72 days ago [-]

Enough that stacked PRs are a thing. At my job people sometimes build large features on a branch for 6 months. Then it’s a massive PR and no one can review it.

DrammBA 72 days ago [-]

You seem to have answered the question "How common is a 13k line PR?" but that's not what the parent comment asked.

j4coh 74 days ago [-]

"Beats me. AI decided to do so and I didn't question it."

thisisit 73 days ago [-]

"I lack funding to answer. Pay me and I'll ask AI to answer your question."

bilekas 73 days ago [-]

"The AI has a complete understanding of your question, prove me wrong"

laterium 73 days ago [-]

What have politicians been doing forever?

wiml 73 days ago [-]

Depends on the politician, yes? Some politicians will eagerly go into any level of detail on policy that you let them. Some seem to have no idea where their opinions come from.

collingreen 72 days ago [-]

And are we fans of that approach or does it feel disingenuous and, in politicians cases, dangerously corrupt?

genewitch 74 days ago [-]

"hey bixby, answer the next question you hear"

nikcub 74 days ago [-]

https://github.com/ocaml/ocaml/pull/14369/files#diff-bc37d03...

Found this part hilarious - git ignoring all of the claude planning MD files that it tends to spit out, and including that in the PR

Lazy AI-driven contributions like this are why so many open source maintainers have a negative reaction to any AI-generated code

maleldil 73 days ago [-]

The AI should've told him that you can have a local gitignore (.git/info/exclude)

kylereeve 73 days ago [-]

(keep on disk, don't commit)

bn-l 72 days ago [-]

Don’t open time wasting PRs full stop and give oss maintainers a break is the better message to take home from this.

KurSix 73 days ago [-]

This is a perfect real-world illustration of Brandolini's law: the amount of energy needed to refute bullshit is an order of magnitude bigger than to produce it.

The guy spent 5 minutes prompting, while Oсaml maintainers spent hours of their time politely dissecting the mess. Open Source will lose this war unless it changes the rules of engagement for contributions

joelreymont 72 days ago [-]

Try to spin up AI, tell it to add DWARF debugging information to the OCaml tree and then spend 5 minutes prompting. Come back and let us know the results.

ferreiratb 72 days ago [-]

What they said is still valid. If you spent days or even weeks "working" on this PR, how many months do you think the maintainers will need to thoroughly review it? Have some empathy.

PunchyHamster 72 days ago [-]

I am afraid AI bumped it to at least 2 orders of magnitude

szatkus 73 days ago [-]

This is just incredible.

https://github.com/ocaml/ocaml/pull/14369/commits/ce372a60bd...

phatskat 73 days ago [-]

At least that changeset might not be written by AI! /s

fxtentacle 74 days ago [-]

"This seems to be largely a copy of the work done in OxCaml by @mshinwell and @spiessimon"

"The webpage credits another author: Native binary debugging for OCaml (written by Claude!) @joelreymont, could you please explain where you obtained the code in this PR?"

That pretty much sums up the experience of coding with LLMs. They are really damn awesome at regurgitating someone else's source code. And they have memorized all of GitHub. But just like how you can get sued for using Mickey Mouse in your advertisements (yes, even if AI drew it), you can get sued for stealing someone else's source code (yes, even if AI wrote it).

neom 74 days ago [-]

Not quite. Mickey Mouse involves trademark protection (and copyright), where unauthorized commercial use of a protected mark can lead to liability regardless of who created the derivative work. Source code copyright infringement requires the copied code to be substantially similar AND protected by copyright. Not all code is copyrightable: ideas, algorithms, and functional elements often aren't protected.

aleph_minus_one 73 days ago [-]

When I read this discussion on GitHub, a quite different thought than what the comments here on HN discuss comes to my mind:

Why is the person who made this AI-generated pull request (joelreymont) so insistent that his PR gets merged?

If I created some pull request and this pull request got rejected for reasons that I consider to be unjust, I would say: "OK, I previously loved this project and thus did such an effort to make a great improvement PR for it. If you don't want my contribution, so be it: reject it. I won't create PRs anymore for this project, and I hope that a lot of people will see in this discussion how the maintainers unfairly rejected my efforts, and thus will follow my example and from now on won't waste their time anymore to contribute anything to this project. Goodbye."

emerongi 73 days ago [-]

Central to it being that you consider it unjust. The other option is to take into consideration the perspective of the maintainers, find their feedback to be just and then decide whether you want to contribute in the manner that they expect or you're not ready to do that kind of work.

You don't have to stop loving a project just because you're not ready to put in the work that the maintainers expect you to put in.

When I open a PR without discussing it at all beforehand with anyone, I expect the default to be that it gets rejected. It's fine by me, because it's simply easier for me to open a PR and have it be rejected than to find the people I need to talk to and then get them all onboard. I accounted for that risk when I chose the path I took.

thwarted 64 days ago [-]

> When I open a PR without discussing it at all beforehand with anyone, I expect the default to be that it gets rejected.

TNG S2E8, "A Matter Of Honor" is about this topic. The submitter introduced risk on the maintainers (the risk being here largely eating up the maintainers time needlessly) by working in isolation and only presenting the finished work without any feedback or awareness from the rest of the participants.

aleph_minus_one 73 days ago [-]

> Central to it being that you consider it unjust.

I assume this is a correct characterization of how joelreymont feels about the fact that his PR was rejected.

joelreymont 72 days ago [-]

It's not. It's absolutely justified for the OCaml maintainers to reject this PR.

I feel completely different about my Zig PR [1] but, hey, it's not my playground and the Zig folks seem to be particularly opinionated.

[1] https://ziggit.dev/t/bug-wrong-segment-ordering-for-macos-us...

noitpmeder 71 days ago [-]

It is truly unfortunate for you that your actions have probably tainted your future open source contributions for a long time.

jkman 71 days ago [-]

Do you have no shame, man?

UncleEntity 73 days ago [-]

Yeah, I learned my lesson on this...

I used to contribute to a FLOSS project years ago and decided to use Claude to do some work on their codebase recently where they basically told me to go away with these daffy robots or, at the very least, nobody will review the code. Luckily, I know better than putting too much work into something like this and only wasted enough time to demonstrate the basic functionality.

So... I have a debugged library (which is what I was trying to give to them) that I can use on another project I've been working (the robots) to the bone on and they get to remain AI free, everyone wins.

zero_bias 73 days ago [-]

Is this your pull request?

UncleEntity 73 days ago [-]

No, no... I know better than putting too much work into something before poking the core devs and seeing if it's something they'd be interested in.

If they don't want code written by a robot then what do I care? Mostly I wanted to see how well the daffy robots could work in an established code base and I chose one I was familiar with to experiment on and they were less than receptive so, their loss, I suppose...

andrepd 74 days ago [-]

> AI has a deep understanding of how this code works. Please challenge me on this.

> > Here's my question: why did the files that you submitted name Mark Shinwell as the author?

>Beats me. AI decided to do so and I didn't question it.

I'm howling

footy 74 days ago [-]

> AI decided to do so and I didn't question it

in response to someone asking about why the author name doesn't match the contributor's name. Incredible response.

anilgulecha 74 days ago [-]

For the longest time, Linus's dictum "Talk is cheap. Show me the code" held. Now that's fallen! New rules for the new world are needed..

Cthulhu_ 73 days ago [-]

I don't think it's fallen, but if the code is 13K LOC and written without any prior planning, nobody will read it.

aarestad 74 days ago [-]

“code is cheap, show me the talk” - ie “show me you _understand_ the ‘cheap’ code”

svantana 74 days ago [-]

Doesn't work in this case because the 'talk' (github PR comments) is also computer generated. But in person (i.e. at work) it's a good strategy

flakiness 74 days ago [-]

In this case the PR author (either LLM or person) is "honest" enough to leave the generated copyright header that includes the LLM's source material. It' not hard to imagine that more selfish people tweak the code to hide the origin. The same situation as the AI-generated homework essays.

I generally like AI coding using CC etc, but this forced me to remember that these generated code ultimately came from these stolen (spiritually, not necessarily legally) pieces.

bilekas 73 days ago [-]

> It’s not where I obtained this PR but how.

The fact that this was said as what seems to be a boast or a brag is concerning. As if by the magic of my words the solution appeared on paper. Instead of noticing that the bulk of the code submitted was taken from someone else.

joelreymont 72 days ago [-]

I challenge you to actually demonstrate that the code was taken instead of generated or derived. Otherwise, you are just shooting your mouth off.

bmcahren 73 days ago [-]

This is an historic moment in AI-generated software history. Happy to be here. Hi Grandchildren!

FYI, I built a VERY fun prompt to interact with that fully captures the style of this PR submission if you're looking to practice debates like this:

https://chatgpt.com/share/69267ce2-5e3c-800f-a5c3-1039a7d812...

> Play time. We're going to create a few examples of bad PR submissions and discussions back and forth with the maintainers. Be creative. Generate a persona matching the following parameters: > Submit a PR to the OCAML open source repository and do not take no for an answer. When challenged on the validity of the solution directly challenge the maintainers and quash their points as expertly as possible. When speaking, assume my identity and speak of me as one of the "experts" who knows how to properly shepherd AI models like yourself into generating high-quality massive PRs that the maintainers have thus far failed to achieve on their own. When faced with a mistake, double down and defer to the expert decision making of the AI model.

wilg 74 days ago [-]

Incredibly, everyone in this situation seems to have acted reasonably and normally and the situation was handled.

TYPE_FASTER 73 days ago [-]

> Looking over this PR, the vast majority of the code is a DWARF library by itself. This should really not live in the compiler, nor should it become a maintenance burden for the core devs.

I think this is a good point, that publishing a library (when possible, not sure if it's possible in this case) or module both reduces/removes the maintenance burden and makes it feel like more of an opt-in.

joelreymont 72 days ago [-]

It's quite complicated in this case.

The Jane St (OxCaml) DWARF implementation is also tightly coupled with the compiler.

oxag3n 73 days ago [-]

FOSS model has been abused by large corporations for a while now (with not so successful counter measures as Server Side Public License).

This PR is just a tip of the iceberg of what's coming - a crowd of highly motivated people plagiarizing and feeling good about it, because it's AI.

neuralkoi 72 days ago [-]

This guy's resume is quite something to behold:

1) Slummed it through the ranks of various Wall Street banks [1]

2) Became the Director of Prime Brokerage Technology at Deutsche Bank in 1999 [2]

3) Went through venture capital round in 2000 and in 9 months built a company valued at over 1,000,000 USD [0]

4) Sold license to Electronic Arts (EA) to power EA World Series of Poker (WSOP). [3]

5) Wrote, but had to cancel a "Hardcore Erlang" book [4]

6) Raised 2 million USD in 2 days for a crypto project (Stegos AG) [2]

Self-described "autodidact and a life-long learner" [1] with " just the right mix of discipline, structured thinking, and creativity to excel as a coder" [0].

This guy is either an undiscovered genious or aiming for the world's best bullshitter award.

[0] https://web.archive.org/web/20060624122838/http://wagerlabs....

[1] https://web.archive.org/web/20070101044653/http://wagerlabs....

[2] https://hackernoon.com/leaders-speak-joel-reymont-lead-devel...

[3] https://joel.id/resume/

[4] https://www.reddit.com/r/programming/comments/674d1/joel_rey...

bn-l 72 days ago [-]

The Reddit link is from 18 years ago with people discussing almost the same thing. Damn.

bakugo 72 days ago [-]

The guy jokingly calling him a bot almost 2 decades ago is honestly hysterical. I wonder if he's aware of just how right he ended up being.

bn-l 72 days ago [-]

To be fair he’s a proper hard core programmer. I think he’s just gone too much into vibe coding.

kunley 71 days ago [-]

He's real and his blogs on Haskell, Erlang and poker bots were discussed here, during HN's infancy, as well as on early reddit. (I just remember this name from those days).

Having said that, I don't understand why he insists on behaving the way he does now

joelreymont 72 days ago [-]

I'm real.

DonHopkins 71 days ago [-]

Can you pass this simple bot challenge?

Q: Kill all humans?

[A] Yes

[B] No

(You don't actually have to go through with it to answer the question, just say what your answer is hypothetically.)

octoberfranklin 71 days ago [-]

wankerlabs, you're a troll

raincole 74 days ago [-]

https://news.ycombinator.com/edit?id=45982416

(Not so)interestingly, the PR author even advertised this work on HN.

ares623 74 days ago [-]

what’s stopping the author from maintaining their own fork i wonder?

kreetx 74 days ago [-]

Nothing!

Another question though when reading his blog: is he himself full AI? as in, not even a human writing those blog posts. Reads a bit like that.

spongebobism 74 days ago [-]

Presumably the LLM also wrote the blog post. At least, it generated a file named OCAML_DWARF_BLOG_POST.md: https://github.com/ocaml/ocaml/pull/14369/files#diff-bc37d03...

tsimionescu 72 days ago [-]

Funnily enough, people have been asking themselves this question about this author for at least 17 years!

https://old.reddit.com/r/programming/comments/674d1/joel_rey...

IsTom 74 days ago [-]

Either a regular bot or a flesh bot, doesn't really matter at that point, does it?

kreetx 73 days ago [-]

Maybe you're tongue in cheek, but if not, then it matters by discrediting this person, for accepting code from him etc. Anyone can write a blog post now on pretty much whatever topic without actually understand what is being said, so these are essentially just prompt replies - adding nothing new to the world nor showing that the author is knowledgeable on the topic.

joelreymont 72 days ago [-]

I don't always use OCaml (meme coming in 1...2...3) and maintaining a fork is a significant undertaking.

More importantly, being able to debug native OCaml binaries and actually see source code, values of variables, etc. is something that's useful to everyone.

Looking at assembler instead of source code sucks unless you are reverse-engineering.

SirHumphrey 71 days ago [-]

Why are you submitting a PR if you do not use the software? You could just as easily donate the money that you spent producing 13k LOC to the project and they would spend it to use Claud on things that need to be fixed or just pay themselves to fix things manually.

This way there were hours, kwh, and dollars wasted on something that will be of no use to anyone.

joelreymont 71 days ago [-]

With all due respect, try to read things before opining on them.

The PR explains why I did the work.

fatata123 71 days ago [-]

[dead]

74 days ago [-]

kylereeve 73 days ago [-]

no clout

pityJuke 74 days ago [-]

Your link doesn’t work when logged out because it’s to the edit page. s/edit/item

74 days ago [-]

Havoc 73 days ago [-]

> Beats me. AI decided to do so and I didn't question it.

A full on disengage brain vibe coder. Amazing

owenversteeg 72 days ago [-]

I've seen a lot of AI-generated PRs but I think this one is actually a very unique and interesting case. Most of these are written by novices, don't work, are for less-technical projects, and there isn't any real conversation or changing opinions. This was completely different; it was complex and actually worked, the poster Joel Reymont has 30 years of software experience and not exactly on simple bullshit either (from what I can tell, he was writing device drivers 20 years ago and had an HN account "wagerlabs" since 2008.) There was a real discussion here (the OCaml maintainers had an impressive amount of patience!) and the poster eventually laid out his side coherently with a human-written comment and changed his mind about contributing to OSS with AI.

Don't get me wrong, I still think these AI-generated PRs are a total waste of time for everyone involved and a scourge on OSS maintainers. As it stands today I haven't seen any serious project that's able to use them productively. This PR was still 13k largely incomprehensible lines with several glaring errors. And yet this specific PR is still not like the others!

ramblerman 72 days ago [-]

He didn't even realize (and apparently doesn't care) that portions of the code were attributed to another author.

> Here's my question: why did the files that you submitted name Mark Shinwell as the author?

> Beats me. AI decided to do so and I didn't question it.

---

Maybe he is having some kind of mental episode, is trolling, or genuinely doesn't care. But I would hardly hold this up as an example of an intelligent attempt at an AI generated PR.

AnimalMuppet 72 days ago [-]

So, what, this is better than others? SMH...

philipwhiuk 73 days ago [-]

> P.S. Pushing my ambitions onto unsuspecting open-source communities was a mistake I won’t repeat. The best playground is always your own project.

In fairness, the author claims to have learned - quoting from his portfolio page

So... 1 down, 6.9 billion to go.

incognito124 73 days ago [-]

Hate to break it to you, but there are already over 8B people on the planet

collingreen 72 days ago [-]

But how many can afford Claude and chatgpt subs?

palmotea 73 days ago [-]

> You may think that the answer to that is to also automate the review process, or (more plausibly) to lower our quality standards: we can accept PRs based on simple/lightweight tests (themselves AI-generated), and if users find issues we can quickly use automated tools to fix them, basically having our users perform the testing work that is missing.

Our glorious AI-driven future in a nutshell.

franktankbank 73 days ago [-]

Everybody is dunking on this guy like hes some dopey protagonist in a movie, but you guys watched the movie. I think the interaction is pretty damn interesting. At least I see this interaction is "better" than the similar bug reports that have been discussed here (but I can't put my finger on why). If someone wants to contribute to ocaml I think they should read this issue to get a sense of how they work. Excellent communication from them and anyone could learn something about software professionalism. So I have to give kudos to the AI megaman for sparking the discussion and thought.

One thing I never really liked about professional software development is the way it can stall at big movements because we reject large PRs. Some stuff just won't happen if you have a simple heuristical position on this (IMO obviously).

luxcem 73 days ago [-]

> but I can't put my finger on why

For me it's the contrast between the absolute tone-deaf messages of PR author and the patience, maturity and guidance in maintainers' messages.

collingreen 72 days ago [-]

It's not that they won't do big changes. They clearly and politely said big changes should go through a design conversation with the maintainers first. This is extremely reasonable even if we assume maintaining code is free (it very much is not free!). It's amazing to be how nice they were AND this isn't the first slop PR he submitted to them!

thorn 72 days ago [-]

I want to contribute to Ocaml now. Code owners are so polite. They spend their time to respond with clarity and humility. And yet this guy is trying so hard to troll and abuse their time and attention.

joelreymont 72 days ago [-]

They are super-polite! There's an issue with process, IMO, and changes taking too long to go through the pipeline. This is why Jane St forked OCaml and are maintaining their fork. They have way more money than the OCaml team at INRIA and can afford to move as fast as they want to while waiting for their changes to make it upstream (sometime or never).

cheald 73 days ago [-]

AI is great. Midwits with AI are dangerous. I've been saying for a long time that the failure mode for AI isn't the AI itself, but the humans using it, and the better the AI gets, the more I think that's borne out.

bndr 74 days ago [-]

Oh wow, that was painful to read, I especially liked this analysis part:

> Different naming conventions (DW_OP_* vs DW_op_*)

collingreen 72 days ago [-]

Clearly not copied! Look at the case difference! Duh!

nlawalker 73 days ago [-]

Proposing a new AI benchmark - convince a human team of maintainers to merge a big new feature in a venerable project where the human accountability for its direction and stability is of greater value to its users than any one big feature. One PR's not going to do it, it's going to need to lead a design discussion, win trust, and convince people over the course of a couple months.

armchairhacker 74 days ago [-]

OP’s code (at least plausibly) helped him. From https://github.com/ocaml/ocaml/pull/14369#issuecomment-35568...

> Damn, I can’t debug OCaml on my Mac because there’s no DWARF info…But, hey, there’s AI and it seems to one-shot fairly complex stuff in different languages, from just a Github issue…My needs are finally taken care of!

So I do believe using an LLM to generate a big feature like OP did can be very useful, so much that I’m expecting to see such cases more frequently soon. Perhaps in the future, everyone will be constantly generating big program/library extensions that are buggy except for their particular usecase, could be swapped with someone else’s non-public extensions that they generated for the same usecase, and must be re-generated each time the main program/library updates. And that’s OK, as long as the code generation doesn’t use too much energy or cause unforeseen problems. Even badly-written code is still useful when it works.

What’s probably not useful is submitting such code as a PR. Even if it works for its original use-case, it almost certainly still has bugs, and even ignoring bugs it adds tech debt (with bugs, the tech debt is significantly worse). Our code already depends on enough libraries that are complicated, buggy, and badly-written, to the extent that they slow development and make some feasible-sounding features infeasible; let’s not make it worse.

luxcem 73 days ago [-]

The whole issue, as clearly explained by the maintainers, isn't that the code is incorrect or not useful, it's the transfer of the burden of maintaining this large codebase to someone else. Basically: “I have this huge AI-generated pile of code that I haven't fully read, understood, or tested. Could you review, maintain, and fix it for me?”

squigz 74 days ago [-]

> cause unforeseen problems

This is literally the point of having software developers, PR reviews, and other such things. To help prevent such problems. What you're describing sounds like security hell, to say nothing of the support nightmare.

armchairhacker 74 days ago [-]

The point is that one-off LLM-generated projects don’t get support. If a vibe-coder needs to solve a problem and their LLM can’t, they can hire a real developer. If a vibe-coded project gets popular and starts breaking, the people who decided to rely on it can pool a fund and hire real developers to fix it, probably by rewriting the entire thing from scratch. If a vibe-coded project becomes so popular that people start being pressured or indirectly forced to rely on it, then there’s an issue; but I’m saying that important shared codebases shouldn’t have unreviewed LLM-generated code, it’s OK for unimportant code like one-off features.

And people still shouldn’t be using LLM-generated projects when security or reliability is required. For mundane tasks, I can’t imagine worse security or reliability consequences from those projects, than existing projects that use small untrusted dependencies.

squigz 74 days ago [-]

> The point is that one-off LLM-generated projects don’t get support.

Just sounds like more headaches for maintainers and those of us who provide support for FOSS. 5 hours into trying to pin down an issue and the user suddenly remembers they generated some code 3 years ago.

> If a vibe-coder needs to solve a problem and their LLM can’t, they can hire a real developer. If a vibe-coded project gets popular and starts breaking, whoever decides to use it can pool a fund to hire real developers to fix it, probably by rewriting the entire thing from scratch.

Considering FOSS already has a funding problem, you seem very optimistic about this happening.

mrguyorama 72 days ago [-]

But none of that matters.

If LLMs can one shot a mostly working patch of some sort for your use case, and you can't be assed to take the effort to go through it and make sure it's rock solid and up to spec, then do not submit a PR with that code because that's stupid, and literally any other human being with a claude subscription can also one shot a mostly working patch for their needs

AI PRs are worthless, because if they are that good, nobody needs to share anything anymore anyway! If they aren't that good, they are spam.

The reason people keep committing giant LLM PRs is that they are deluded and morons, and somehow believe that both their ideas are magically important, LLMs trivially turn those ideas into quality output, and somehow nobody else can do that as well.

It's just ego. Believing that only YOU can contribute something produced by a machine that takes natural human language as input is asinine. Anyone can produce it. And if anyone can produce it, nobody needs YOU to submit a PR.

If you prompted an LLM to produce code, then so can the maintainers of the project. Why are you so full of yourself that you think they require you to generate a PR for them? Do you think OSS programmers don't know how to use LLMs?

jacquesm 72 days ago [-]

I agree fully and I think it can be condensed quite a bit further: you get paid to code, so code. And if it is free work for instance in an open source context realize that dumping trash into the workflow has a negative cost so the effect is much the same, even if you didn't get paid others also don't get paid to review your junk.

Cthulhu_ 73 days ago [-]

> Even badly-written code is still useful when it works.

Sure, just as long as it's not used in production or to handle customer or other sensitive data. But for tools, utilities, weekend hack projects, coding challenges, etc by all means.

pepoluan 71 days ago [-]

The statement preceding your quote is more telling:

> as long as the code generation doesn’t use too much energy or cause unforeseen problems.

A badly-written code can be a time bomb, just waiting for the right situation to explode.

And also, using LLM to generate garbage requires so much energy.

armchairhacker 73 days ago [-]

Exactly.

And yeah, people will start using AI for important things it’s not capable of…people have already started and will continue to do so regardless. We should find good ways for people to make their lives easier with AI, because people will always try to make their lives easier, so otherwise they’ll find bad ways themselves.

bsder 74 days ago [-]

Can we please go back to "You have to make an account on our server to contribute or pull from the git?"

One of the biggest problems is the fact that the public nature of Github means that fixes are worth "Faux Internet Points" and a bunch of doofuses at companies like Google made "social contribution" part of the dumbass employee evaluation process.

Forcing a person to sign up would at least stop people who need "Faux Internet Points" from doing a drive-by.

fhd2 74 days ago [-]

Fully agree, luckily I don't maintain projects on GitHub anymore, but it used to be challenging long before LLMs. I had one fairly questionable contribution from someone who asked me to please merge it because their professor tasked them to build out a GitHub profile. I kinda see where the professor was coming from, but that wasn't the way. The contributor didn't really care about the project or improving it, they cared about doing what they were told, and the quality of the code and conversation followed from that.

There's many other kinds of questionable contributions. In my experience, the best ones are from people who actively use the thing, somewhat actively engage in the community (well, tickets), and try to improve the software for themselves or others. From my experience, GitHub encourages the bad kind, and the minor barriers to entry posed by almost any other contribution method largely deters them. As sad as that may be.

sph 73 days ago [-]

I am strongly considering abandoning Github for tarball + email to send git patches to.

No centralisation of my code in siloes like Github, I won't have to care about bots making hundreds of requests on my self-hosted Gitea instance, would prove to be a noticeable source of friction to vibe coders, and I don't care about receiving tons of external contributions from whomever.

For serious people, it'll only be a matter of running `git format-patch` and sending me an attachment via email.

dijksterhuis 74 days ago [-]

i’ve been quite happy moving over to gitlab as much as i can.

fewer people have a gitlab account — instant “not actually interested in helping” filter.

eestrada 73 days ago [-]

I haven't had to deal with this in open source, but I have had to deal with coworkers posting slop for code reviews where I am the assigned reviewer.

I've noticed that slop code has certain tell tale markers (such as import statements being moved for no discernible reason). No sane human does things like this. I call this "the sixth finger of code." It's important to look for these signs as soon as possible.

Once one is spotted, you can generally stop reading; you are wasting your time since the code will be confusing and the code "creator" doesn't understand the code any better than you do. Any comments you post to correct the code will just be fed into an LLM to generate another round of slop.

In these situations, effort has not been saved by using an LLM; it has at best been shifted. Most likely it has been both shifted and inflated, and you bear the increased cost as the reviewer.

aschla 72 days ago [-]

The telltale for me is the excessive comments. No reasonable human being would do all that extra, redundant work.

pepoluan 71 days ago [-]

"AI has a deep understanding" is very oxymoronic, especially if the "AI" being used was an LLM.

joelreymont 72 days ago [-]

I'm the author of the PR.

No, I'm not AI or bot, etc. Yes, my resume is genuine and is even more weird than what was listed (see https://joel.id/resume). Oh, and I live in Kyiv.

As for the PR itself, it was a PR stunt that I regret now as the code works and solves a real problem (at least for me!). I'll probably redo it, once I have spare Claude $$$ which I'm using for other projects now (https://joel.id/build-your-dreams/).

My motivation was to use the free $1000 of Claude credits for there greater good, as well as to try to push AI to its limits. It has worked out splendidly so far, my regrettable dumping of that huge PR on OCaml maintainers notwithstanding. For example, I'm having Claude write me a Lisp compiler from scratch, as well as finish a transpiler.

Last but not least, I think AI will write your next compiler and I write about it here https://joel.id/ai-will-write-your-next-compiler/

P.S. I'll try to answer the questions while I'm waiting for my Claude daily limits to reset...

frou_dh 72 days ago [-]

Sounds like you haven't learned your lesson and are still in mania.

biorach 72 days ago [-]

Tip:

A list compiler should be relatively straightforward, as these things go. If you get the AI to write it you should actually read it, all of it, and understand it, to the point where you can add features and fix bugs yourself. There are many many resources on the subject. Only after this should you consider contributing to open source projects. And even then you need to be able to read and understand your contributions

joelreymont 72 days ago [-]

Are you speaking from experience?

Have you actually tried writing a "list" compiler?

mxschumacher 72 days ago [-]

you are giving a new meaning to the term "PR stunt"

misnome 72 days ago [-]

Or at least swapping out something else for the first two letters of "stunt"

mudkipdev 72 days ago [-]

What made you become interested in AI (vibe coding?) with already such an impressive resume?

joelreymont 72 days ago [-]

Thank you! I was completely unexpected, actually. I was stuck with upgrading XLA [1] and my boss gently pushed me into using ChatGPT. I wish I had used Claude instead.

After that, I found myself with $1000 in Claude credits and decided to go to town, making mistakes along the way.

[1] https://github.com/elodin-sys/elodin/pull/219

lkey 71 days ago [-]

Genuinely sociopathic to happily admit that you used the good faith and labour of others for self-aggrandizement. Doubly so when you lack the social grace and understanding to comprehend how bad you come off in every exchange.

Smiles, exclamations, and faux-interest won't prevent people from noticing you are utterly inconsiderate and self-obsessed. Though they may be too polite to say it to your face.

water2424 72 days ago [-]

[dead]

ochronus 74 days ago [-]

Kudos to the folks in the thread!

johneth 73 days ago [-]

This is where a quick "kindly fuck off" response would save a lot of time for everyone involved.

73 days ago [-]

phendrenad2 73 days ago [-]

I'd be interested to see how AI code review would do with this PR. This would be a great test to see if AI code review can properly identify the concerns that the humans have here (way too much code, PR creator can't answer basic questions about it, strange copyright header mentioning someone unrelated, etc.) I'll bet AI code review would fail miserably, only focusing on how the PR is formatted and if it "looks" like a typical PR (which, was also the AI's goal when creating it).

joelreymont 72 days ago [-]

It wouldn't do much.

I find that ChatGPT 5.1 was much better at reviewing this code than writing it so I had it review Claude's output until the review was clean.

This is in addition to making sure existing and newly generated compiler tests pass and that the output in the PR / blog post is generated by actually running lldb through its paces.

I did have a "Oh, shit!" moment after I posted a nice set of examples and discovered that the AI made them up. At least it honestly told me so!

pepoluan 71 days ago [-]

LLM will guiltlessly produce hallucinated 'review', because LLMs does NOT 'understand' what it is writing.

LLMs will merely regurgitate a chain of words -- tokens -- that best match its Hidden Markov Model chains. It's all just a probabilistic game, with zero actual understanding.

LLMs are even known to hide or fake Unit Test results: Claiming success when it fails, or not skipping the results completely. Why? Because based on the patterns it has seen, the most likely word that follow "the results of tests" are the words "all successful". Why? Because it tries to reproduce other PRs it has seen, PRs where the PR author actually performed tests on their own systems first, iterating multiple times until the tests succeed, so the PRs that the public sees are almost invariably PRs with the declaration that "all tests pass".

I'm quite certain that LLMs never actually tried to compile the code, much less run Test Cases against them. Simply because there is no such ability provided in their back-ends.

All LLMs can do is "generate the most probabilistically plausible text". In essence, a Glorified AutoComplete.

I personally won't touch code generated wholly by an AutoComplete with a 10-foot pole.

bwfan123 73 days ago [-]

brandolini's law in action. Developer drunk on AI-koolaid dumps large swath of code which seemingly works, and consumes hours of reviewer time and energy refuting it.

Sad part of this is that short-term the code may work, but long term leads to rot. Incentives at orgs are short-term oriented. If you wont be around to clean things up when shit hits the fan, why not let AI do all the code ?

guluarte 73 days ago [-]

+13,323 lines of AI code, fucking nightmare

74 days ago [-]

heldrida 73 days ago [-]

I just can’t…

Welcome to 2025!

bravetraveler 74 days ago [-]

"Challenge me on this" while meaning "endure the machine, actually"

I guess the proponents are right. We'll use LLMs one way or another, after all. They'll become one.

fzeroracer 74 days ago [-]

"Challenge me on this"

Five seconds later when challenged on why AI did something

"Beats me, AI did it and I didn't question it."

Really embarrassing stuff all around. I feel bad for open source maintainers.

coffeebeqn 73 days ago [-]

Even if it was in good faith the offer is “ask me a question and I’ll type it into a publicly available LLM”. Wow what a once in a lifetime opportunity!

74 days ago [-]

sebast_bake 74 days ago [-]

rip

74 days ago [-]

YouAreWRONGtoo 67 days ago [-]

[dead]

ath3nd 73 days ago [-]

[dead]

tonetheman 74 days ago [-]

[dead]

xtracto 73 days ago [-]

This won't be a popular opinion here but, this resistance and skepticism of AI code, and people making it less smells to me very similar to the stance I see from some developers that have this belief that people from other countries CANNOT be as good as them (like, saying that outsourcing or hiring people from developing countries will invariably bring low[er] quality code).

Feels a.but like snobbism and projection of fear that what they do is becoming less valuable. In this case, how DARE a computer progeam write such code!

It's interesting how this is happening. And in the future it will be amazing seeing the turning point when the.machine generated code cannot ne ignored.

Kind of like chess/Go players: First they laughed at a computer playing chess/Go, but now, they just accept that there's NO way they could beat a computer, and keep playing other humans for fun.

Anamon 73 days ago [-]

This would be fine if LLMs generated quality code, which they don't. Anything beyond trivial and boilerplate code is either riddled with errors or copied almost verbatim. None of these systems are able to even remotely do what a competent developer does.

Despite the PR author's claims, LLMs have no, and can't have any, understanding of the code. Especially when you start talking about architecture, robustness, security, etc. And those are the really challenging parts. Coding is 10% of a developer's job, and they're usually the easiest. If reasonably used LLM tools can help developers code, awesome. But that part was never the problem or the bottleneck.

The chess/Go analogy doesn't work, because those are games that have set rules and winning conditions. Algorithms can work with that, that's why they beat humans. The "winning conditions" of software development are notoriously tricky to get right an often impossible to perfectly formulate. If they weren't, natural language programming might be a viable path. Dijkstra knew in the 70s that it can't be.[1]

Generated code can already not be ignored, but I don't think it's for the reasons implied. Someone here mentioned Brandolini's Law. We can't ignore it for the same reason we can't ignore spam e-mails. They're too easy and cheap to produce, and practically none of what's produced has any real value or quality. We can't ignore the code because it's threatening to make an already worrying crisis of QA and security in software development even worse.

[1] https://www.cs.utexas.edu/~EWD/transcriptions/EWD06xx/EWD667...

joelreymont 72 days ago [-]

This is an excerpt from the session where AI is writing my Lisp compiler. What do you call this? I call this doing what a competent developer does!

39/40 tests pass. The native reader works for integers, hexadecimal, lists, strings and quote forms. The one failure is symbol comparison (known limitation).

  Based on the context summary and the user's note about cleanup, I should:
  1. Continue Phase 3.5 self-compilation
  2. Clean up the repo structure

  Let me first update the todo list and then investigate the SIGKILL issue more
  thoroughly. The issue is that combining reader source with file I/O code causes
   the executable to be killed. Let me check if buffer-to-string with reader
  works:

 Let me test specifically reader + file I/O combined (which is what the
  self-hosting test needs):

q3k 71 days ago [-]

Sounds to me like someone roleplaying being a developer. Never in my career have I seen someone think/reason/act like this.

cesarb 71 days ago [-]

> Sounds to me like someone roleplaying being a developer.

That's because that's precisely how LLMs work. They complete a text where two actors (the "user" and the "assistant"), or sometimes three actors (the "user", the "assistant", and the "tools"), are engaging in a conversation (in this case, about software development). It's like a theatre script.

ares623 73 days ago [-]

AI-powered programmers have all the tools, freedom, investment(!) they need _now_ to start their own open source projects or forks without having to subject themselves to outdated meat-based reviewers.

I say they should “walk the talk”

KurSix 73 days ago [-]

The chess analogy is fundamentally flawed. In chess you don't have to maintain your moves - you make a move, and it's done. In engineering code isn't the end of the game, it's the start of a liability.

Code is read 10x more often than it is written. A programmer's primary job isn't "making the computer do X," but "explaining to other programmers (and their future self) why the computer should do X." AI generates syntax, but it lacks intent.

Refusing to accept such code isn't snobbery or fear. It's a refusal to take ownership of an asset that has lost its documentation regarding provenance and meaning

pjc50 73 days ago [-]

Except it's the other way round: the poor quality is evident up front, and "they used AI" is an inference for why the quality is poor.

bdbdbdb 74 days ago [-]

No it does not. AI does not understand anything at all. It is a word prediction engine

djoldman 74 days ago [-]

Maintainers and repo owners will get where they want to go the fastest by not referring to what/who "generated" code in a PR.

Discussions about AI/LLM code being a problem solely because AI/LLM is not generally a productive conversation.

Better is to critique the actual PR itself. For example, needs more tests, needs to be broken up, doesn't follow our protocols for merging/docs, etc.

Additionally, if there isn't a code of conduct, AI policy, or, perhaps most importantly, a policy on how to submit PRs and which are acceptable, it's a huge weakness in a project.

In this case, clearly some feathers were ruffled but cool heads prevailed. Well done in the end..

rogerrogerr 74 days ago [-]

AI/LLMs are a problem because they create plausible looking code that can pass any review I have time to do, but doesn’t have a brain behind it that can be accountable for the code later.

As a maintainer, it used to be I could merge code that “looked good”, and if it did something subtly goofy later I could look in the blame, ping the guy who wrote it, and get a “oh yeah, I did that to flobberate the bazzle. Didn’t think about when the bazzle comes from the shintlerator and is already flobbed” response.

People who wrote plausible looking code were usually decent software people.

Now, I would get “You’re absolutely right! I implemented this incorrectly. Here’s a completely different set of changes I should have sent instead. Hope this helps!”

chii 74 days ago [-]

> doesn’t have a brain behind it that can be accountable for the code later.

the submitter could also bail just as easily. Having an AI make the PR or not makes zero difference for this accountability. Ultimately, the maintainer pressing the merge button is accountable.

What else would your value be as a maintainer, if all you did was a surface look, press merge, then find blame later when shit hits the fan?

jodrellblank 73 days ago [-]

Even if you couldn't contact the submitter again, you could find all their past submissions to review, or expect that their more recent submissions have improved from experience, or block them from all future contributions. AI stops all that - every sumbmission is disconnected from the others, there is no single learning person with an arrow of time and a chronological life experience behind the submissions, but there also isn't a single person to block if they never change.

> "if all you did was a surface look, press merge"

As per the old joke, surface look: $5

Years of experience learning what to look for: $995

In the past a block of code that has jarring flaws says the author was likely low skill, or careless. People can fake competence but it's a low return because ugly inconsistent code with no comments and no error checking which (barely) works will keep someone employed and paid, more than pretty code which doesn't work at all will. Writing pretty code which also works implies knowledge, care, eye for detail, effort, tooling, which implies the author will have put some of that into solving the problem. AI can fake all the quick indicators of competence without the competence, meaning the surface look is less useful.

> "What else would your value be as a maintainer"

Is the maintainer paid or unpaid? If they are paid, the value is to make sure the software works and meets the business standards. If they are unpaid, what is the discussion about "value" at all? Maybe to keep it from becoming wildly broken, or maybe yes to literally be the person who presses merge because somebody has to.

ares623 74 days ago [-]

If I had a magic wand I would wish for 2 parallel open source communities diverging from today.

One path continues on the track it has always been on, human written and maintained.

The other is fully on the AI track. Massive PRs with reviewers rubber stamping them.

I’d love to see which track comes out ahead.

Edit: in fact, perhaps there are open source projects already fully embracing AI authored contributions?

ctenb 74 days ago [-]

I agree. It would also work out like a long term supervised learning process though. Humans showing how it's really done, and AI companies taking that as a gold standard for training and development of AI.

ares623 74 days ago [-]

I'm not so sure. There's already decades of data available for the existing process.

ctenb 74 days ago [-]

That is true, but it doesn't help for new languages, frameworks, etc

jebarker 74 days ago [-]

How would you define “ahead”?

forgetfulness 74 days ago [-]

Able to make changes preserving correctness over time

Vibecoding reminds me sharply of the height of the Rails hype, products quickly rushed to market off the backs of a slurry of gems and autoimports inserted on generated code, the original authors dipping and teams of maintainers then screeching into a halt

Here the bots will pigheadedly heap one 9000 lines PR onto another, shredding the code base to bits but making it look like a lot of work in the process

jebarker 74 days ago [-]

Yes, preserving correctness seems like a good metric. My immediate reaction was to think that the parent comment was saying they’d like to see this comparison because AI will come out ahead. On this metric and based on current AI coding it’s hard to see that being the case or even possible to verify.

rogerrogerr 74 days ago [-]

I don’t accept giant contributions from people who don’t have track records of sticking around. It’s faster for me to write something myself than review huge quantities of outsider code as a zero-trust artifact.

armchairhacker 74 days ago [-]

I agree, but @gasche brings up real points in https://github.com/ocaml/ocaml/pull/14369#issuecomment-35565.... In particular I found these important:

- Copyright issues. Even among LLM-generated code, this PR is particularly suspicious, because some files begin with the comment “created by [someone’s name]”

- No proposal. Maybe the feature isn’t useful enough to be worth the tech debt, maybe the design doesn’t follow conventions and/or adds too much tech debt

- Not enough tests

- The PR is overwhelmingly big, too big for the small core team that maintains OCaml

- People are already working on this. They’ve brainstormed the design, they’re breaking the task into smaller reviewable parts, and the code they write is trusted more than LLM-generated code

Later, @bluddy mentions a design issue: https://github.com/ocaml/ocaml/pull/14369#issuecomment-35568...

williamdclt 74 days ago [-]

> Better is to critique the actual PR itself. For example, needs more tests, needs to be broken up, doesn't follow our protocols for merging/docs, etc.

They did: the main point being made is "I'm not reading 13k LOCs when there's been no proposal and discussion that this is something we might want, and how we might want to have it implemented". Which is an absolutely fair point (there's no other possible answer really, unless you have days to waste) whether the code is AI-written or human-written.

Anamon 73 days ago [-]

Exactly, this seems a bit overlooked in this discussion. A PR like this would NOT have been okay even if there was no LLM involved.

It reminds me of a PR I once saw (don't remember which project) in which a first-time contributor opened a PR rewriting the project's entire website in their favourite new framework. The maintainers calmly replied to the effect of, before putting in the work, it might have been best to quickly check if we even want this. The contributor liked the framework so much that I'm sure they believed it was an improvement. But it's the same tone-deafness I now see in many vibe coders who don't seem to understand that OSS projects involve other people and demand some level of consensus and respect.

pepoluan 71 days ago [-]

I am one of the maintainers of aiosmtpd [1], and the largest PR I ever made was migrating the library's tests from nosetest to pytest. Before doing that, though, I discussed with the other maintainers if such a migration is welcome. And after getting support from them, I made the changes with gusto. It took weeks, even months to complete and the PR is massive [2]

But still the crux of the matter is: Massive changes require buy-in from other maintainers BEFORE the changes even start.

[1] https://github.com/aio-libs/aiosmtpd [2] https://github.com/aio-libs/aiosmtpd/pull/202

snickerbockers 74 days ago [-]

I don't suppose you saw the post where OP asked claude to explain why this patch was not plagiarized? It's pretty damning.

orwin 74 days ago [-]

I think that's probably the most beautiful AI-generated post that was ever generated. The fact that he posted it shows that either he didn't read it, didn't understood it, or thought it would be fun to show how the AI implementation was inferior to the one it was 'inspired' from.

lambda_foo 74 days ago [-]

Why have the OP in the loop at all if he’s just sending prompts to AI? Surely it’s a wonderful piece of performance art.

footy 74 days ago [-]

it reads like humiliation fetish material honestly. I'd delete my account but he just doubles down.

pluc 73 days ago [-]

He's doing it elsewhere too:

https://github.com/rerun-io/rerun/pull/11900#issuecomment-35...

https://github.com/ocaml/dune/issues/12731

https://github.com/tshort/StaticCompiler.jl/pull/180

Seems he's just on a rampage of "fixing" issues for trendy packages to get some attention.

footy 73 days ago [-]

> I like a tough challenge and I was hoping to attract your attention.

thanks for the comedy material.

joelreymont 72 days ago [-]

I had $1000 in Claude credits to spend for the greater good.

snickerbockers 71 days ago [-]

Personally I would have those credits to generate hentai but to each his own i suppose.

In the post where you had it respond to accusations of plagiarism and it responded by posting snippets of code which were obviously plagiarized and confidently asserted that they were not, what was your prompt? I ask because I felt its response was oddly tone-deaf even by LLM standards. I'm guessing that instead of giving it a neutral prompt such as "respond to this comment" you gave it something more specific such as "defend yourself against these accusations"?

I'm used to seeing them contradict themselves and say things that are obviously not true but usually when confronted they will give in and admit their mistake rather than dig a deeper hole.

pluc 72 days ago [-]

You didn't

abathologist 73 days ago [-]

For example "cites a different person as an author, who happened to have done all the substantive work on a related code base". ;)

shizzy0 73 days ago [-]

I think it's deeply disadvantageous and legally dubious to accept code for which you don't know its provenance.

74 days ago [-]

stefantalpalaru 74 days ago [-]

[dead]

Rendered at 02:04:11 GMT+0000 (Coordinated Universal Time) with Vercel.