this post was submitted on 14 May 2026
190 points (96.6% liked)
Technology
84646 readers
4308 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Have you tried giving it coding standards and other such preferences about how you like your code to be organized? I've found that coding agents can be quite adaptable to various styles, you can put stuff like "try to keep functions less than 100 lines long" or "include assertions validating all function inputs" into your coding agent's general instructions and it'll follow them.
For me, one of the things that's a huge fundamental improvement is telling the agent to create and run unit tests for everything. That way when it does mess up accidentally it can immediately catch the problem and usually fixes it in the same session without further intervention. Unit tests used to be more trouble than they were worth most of the time, now I love them.
I'll say that during a recent week where I was forced to use an LLM, I found Claude Opus to be extremely poor at referencing this guide: https://mywiki.wooledge.org/BashPitfalls
it took almost an hour to get Claude to write me a shell script which I considered to be of acceptable quality. It completely hallucinated about several of the points in that guide, requiring me to just go read the guide myself to verify that the language model was falsifying information. That same task would have taken me about 5 minutes.
I believe that GIGO applies here. 99% of shell scripts on the internet are unsafe and terrible (looking at you,
set -euo pipefail), and Claude is much more likely to generate god awful garbage because of the inherent bias present in the training data.And as for unit tests? Imo, anything other than property-based testing is irrelevant. If you're using something like Pydantic, you can auto-generate a LOT of your tests using the rich type annotations available in that library along with hypothesis. I tend to write a testing framework once, and then special case property tests for things that fall outside of my models. None of this is super helpful for big ugly codebases with a lot of inertia around practices, but that's not been my environment, thankfully.
You... just started writing unit tests?
No, I've used them plenty before. I just found them to generally be a huge hassle of minimal benefit. They became much more useful in the context of agentic coding, where you want the agent to be able to immediately realize "oh, this change I made causes these specific problems when it's run." The hassle is all on the agent, not on me.
I think we do very different development.
Could be. I'm a professional programmer whose usage runs the whole gamut - large applications with hundreds of programmers working on them for years, smaller apps that I make for my own use, and one-off scripts to do some particular task and then generally throw away afterwards.
I don't do unit tests for that last category, of course. I don't even use coding agents for those, generally speaking - a bit of back-and-forth in a chat interface is usually enough there.
Is this like a who's got a bigger portfolio situation? I'm not sure how to respond
I guess I've been developing for decades including consulting for Page 6, a stint in RD at Sony Music. One of my open source contributions was used as part of the backend for one of Obama's State of the Unions. I spend my time these days writing and maintaining multiple software stacks integrating across multiple platforms.
Since you brought up the notion that we might be doing different styles of development, I was giving you context as to the kinds of development that I do. Sounds like we might not be doing such different scales of development after all, but I couldn't have known that until you gave that information just now.
This isn't supposed to be some kind of duel or argument, I don't see the point of that. I'm just explaining my usage of coding agents and specifically unit tests in that context. Since that's what you were questioning.
I see it seemed more like a weird flex.
Anyways, I couldnt possibly deploy with any confidence a large project or honestly a small project I expected someone to rely on without layers of test. Unintended consequences of even a small change are just a reality. And with the expectation to move quick with large legacy systems, if you don't have tests that's a dangerous high wire act.
Well, I've seen large projects without extensive unit tests before. The main time I remember a big project with them before coding agents they were largely a checkbox that developers implemented with a grumble when first deploying a new system and then that were slowly disabled one by one as later changes broke them.
These were stand-alone projects, though, with a large QA department and without an expectation of future versions directly descended from them once deployed. If it worked then it worked, that was all that was needed at the end of the day.
never had a large qa team. And my experience has when we have qa resources, people move to the new feature so it's up to the developers to not break the critical features everyone forgets about until they break. And I've yet to meet a developer that has time to also be a full time qa resource
I meant my first sentence to be an apology for jumping to conclusions but it clearly isn't. It's late. Sorry for the snarky response.
Wow what a circlejerk this turned into.
Oh well, I guess that's what everything really is the whole time.
Sounds like you were writing bad unit tests and AI showed you how to do it right.
If so, it was project-wide across hundreds of devs.