Technology

79476 readers

4469 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

842

AI-generated code contains more bugs and errors than human output (www.techradar.com)

submitted 1 month ago by throws_lemy@reddthat.com to c/technology@lemmy.world

208 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Deestan@lemmy.world 40 points 1 month ago (18 children)

I've been coding for a while. I did an honest eager attempt at making a real functioning thing with all code written by AI. A breakout clone using SDL2 with music.

The game should look good, play good, have cool effects, and be balanced. It should have an attractor screen, scoring, a win state and a lose state.

I also required the code to be maintainable. Meaning I should be able to look at every single line and understand it enough to defend its existence.

I did make it work. And honestly Claude did better than expected. The game ran well and was fun.

But: The process was shit.

I spent 2 days and several hundred dollars to babysit the AI, to get something I could have done in 1 day including learning SDL2.

Everything that turned out well, turned out well because I brought years of skill to the table, and could see when Claude was coding itself into a corner and tell it to break up code in modules, collate globals, remove duplication, pull out abstractions, etc. I had to detect all that and instruct on how to fix it. Until I did it was adding and re-adding bugs because it had made so much shittily structured code it was confusing itself.

TLDR; LLM can write maintainable code if given full constant attention by a skilled coder, at 40% of the coder's speed.

[–] justaman123@lemmy.world 1 points 1 month ago (9 children)

It would be really interesting to watch a video of this process. Though I'm certain it would be pretty difficult to pull off the editing.

[–] riskable@programming.dev 3 points 1 month ago (4 children)

You want to see someone using say, VS Code to write something using say, Claude Code?

There's probably a thousand videos of that.

More interesting: I watched someone who was super cheap trying to use multiple AIs to code a project because he kept running out of free credits. Every now and again he'd switch accounts and use up those free credits.

That was an amazing dance, let me tell ya! Glorious!

I asked him which one he'd pay for if he had unlimited money and he said Claude Code. He has the $20/month plan but only uses it in special situations because he'll run out of credits too fast. $20 really doesn't get you much with Anthropic 🤷

That inspired me to try out all the code assist AIs and their respective plugins/CLI tools. He's right: Claude Code was the best by a HUGE margin.

Gemini 3.0 is supposed to be nearly as good but I haven't tried it yet so I dunno.

Now that I've said all that: I am severely disappointed in this article because it doesn't say which AI models were used. In fact, the study authors don't even know what AI models were used. So it's 430 pull requests of random origin, made at some point in 2025.

For all we know, half of those could've been made with the Copilot gpt5-mini that everyone gets for free when they install the Copilot extension in VS Code.

[–] justaman123@lemmy.world 3 points 1 month ago (1 children)

It's more I want to see the process of experienced coders explaining the coding mistakes that typical AI coding makes. I have very little experience and see it as a good learning experience. You're probably right about there being tons of videos like that.

[–] riskable@programming.dev 3 points 1 month ago (1 children)

The mistakes it makes depends on the model and the language. GPT5 models can make horrific mistakes though where it randomly removes huge swaths of code for no reason. Every time it happens I'm like, "what the actual fuck?" Undoing the last change and trying usually fixes it though 🤷

They all make horrific security mistakes quite often. Though, that's probably because they're trained on human code that is *also" chock full of security mistakes (former security consultant, so I'm super biased on that front haha).

[–] architect@thelemmy.club 1 points 1 month ago

Oh, gpt def does that’s lol.

Even replaces large bits with just a …

But I don’t use it to rewrite code. I use projects to load everything into it and just ask for pieces that I’ll edit and insert. There’s something about it that works with my adhd in keeping track. It works well for me.

load more comments (2 replies)

load more comments (6 replies)

load more comments (14 replies)