Technology

59589 readers

2838 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

559

Microsoft Sued For AI Article Accusing Innocent Man of Sexual Misconduct (futurism.com)

submitted 5 months ago by catculation@lemmy.zip to c/technology@lemmy.world

51 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] dual_sport_dork@lemmy.world 226 points 5 months ago* (last edited 5 months ago) (40 children)

Say it with me again now:

For fact-based applications, the amount of work required to develop and subsequently babysit the LLM to ensure it is always producing accurate output is exactly the same as doing the work yourself in the first place.

Always, always, always. This is a mathematical law. It doesn't matter how much you whine or argue, or cite anecdotes about how you totally got ChatGPT or Copilot to generate you some working code that one time. The LLM does not actually have comprehension of its input or output. It doesn't have comprehension, period. It cannot know when it is wrong. It can't actually know anything.

Sure, very sophisticated LLM's might get it right some of the time, or even a lot of the time in the cases of very specific topics with very good training data. But its accuracy cannot be guaranteed unless you fact-check 100% of its output.

Underpaid employees were asked to feed published articles from other news services into generative AI tools and spit out paraphrased versions. The team was soon using AI to churn out thousands of articles a day, most of which were never fact-checked by a person. Eventually, per the NYT, the website's AI tools randomly started assigning employees' names to AI-generated articles they never touched.

Yep, that right there. I could have called that before they even started. The shit really hits the fan when the computer is inevitably capable of spouting bullshit far faster than humans are able to review and debunk its output, and that's only if anyone is actually watching and has their hand on the off switch. Of course, the end goal of these schemes is to be able to fire as much of the human staff as possible, so it ultimately winds up that there is nobody left to actually do the review. And whatever emaciated remains of management are left don't actually understand how the machine works nor how its output is generated.

Yeah, I see no flaws in this plan... Carry the fuck on, idiots.

[–] dependencyinjection@discuss.tchncs.de -1 points 5 months ago (13 children)

Simply false in my experience.

We use CoPilot at work and there is no babysitting required.

We are software developers / engineers and it’s saves countless hours writing boilerplate code, giving code blocks based on a comment, and sticking to our coding conventions.

Sure it isn’t 100% right, but the owner and lead engineer calculates it to be around 70% accurate and even if it misses the mark, we have a whole lot less key presses to make.

[–] HauntedCupcake@lemmy.world 17 points 5 months ago (9 children)

Using Copilot as a copilot, like generating boilerplate and then code reviewing it is still "babysitting" it. It's still significantly less effort than just doing it yourself though

[–] FarceOfWill@infosec.pub 10 points 5 months ago (2 children)

Until someone uses it for a little more than boilerplate, and the reviewer nods that bit through as it's hard to review and not something a human/the person who "wrote" it would get wrong.

Unless all the ai generated code is explicitly marked as ai generated this approach will go wrong eventually.

[–] admin@lemmy.my-box.dev 6 points 5 months ago

Unless all the ai generated code is explicitly marked as ai generated this approach will go wrong eventually.

Undoubtedly. Hell, even when you do mark it as such, this will happen. Because bugs created by humans also get deployed.

Basically what you're saying is that code review is not a guarantee against shipping bugs.

[–] HauntedCupcake@lemmy.world 1 points 5 months ago* (last edited 5 months ago)

Agreed, using LLMs for code requires you to be an experienced dev who can understand what it pukes out. And for those very specific and disciplined people it's a net positive.

However, generally, I agree it's more risk than it's worth

load more comments (6 replies)

load more comments (9 replies)

load more comments (35 replies)