this post was submitted on 08 May 2024

1724 points (99.2% liked)

Technology

85355 readers

4188 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

1724

Stack Overflow bans users en masse for rebelling against OpenAI partnership — users banned for deleting answers to prevent them being used to train ChatGPT (www.tomshardware.com)

submitted 2 years ago by misk@sopuli.xyz to c/technology@lemmy.world

457 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[–] Rooki@lemmy.world 342 points 2 years ago (3 children)

If this is true, then we should prepare to be shout at by chatgpt why we didnt knew already that simple error.

[–] snekerpimp@lemmy.world 250 points 2 years ago (6 children)

ChatGPT now just says “read the docs!” To every question

[–] Dave@lemmy.nz 196 points 2 years ago (1 children)

Hey ChatGPT, how can I ...

"Locking as this is a duplicate of [unrelated question]"

load more comments (1 replies)

[–] ekky@sopuli.xyz 55 points 2 years ago (1 children)

And then links to a similar sounding but ultimately totally unrelated site.

load more comments (1 replies)

load more comments (4 replies)

load more comments (2 replies)

[–] just_another_person@lemmy.world 226 points 2 years ago (24 children)

I got an email ban.

1609 hours logged 431 solved threads

load more comments (24 replies)

[–] Bell@lemmy.world 186 points 2 years ago (33 children)

Take all you want, it will only take a few hallucinations before no one trusts LLMs to write code or give advice

[–] sramder@lemmy.world 84 points 2 years ago (4 children)

[…]will only take a few hallucinations before no one trusts LLMs to write code or give advice

Because none of us have ever blindly pasted some code we got off google and crossed our fingers ;-)

[–] avidamoeba@lemmy.ca 85 points 2 years ago* (last edited 2 years ago) (3 children)

It's way easier to figure that out than check ChatGPT hallucinations. There's usually someone saying why a response in SO is wrong, either in another response or a comment. You can filter most of the garbage right at that point, without having to put it in your codebase and discover that the hard way. You get none of that information with ChatGPT. The data spat out is not equivalent.

load more comments (3 replies)

[–] Spedwell@lemmy.world 46 points 2 years ago (4 children)

We should already be at that point. We have already seen LLMs' potential to inadvertently backdoor your code and to inadvertently help you violate copyright law (I guess we do need to wait to see what the courts rule, but I'll be rooting for the open-source authors).

If you use LLMs in your professional work, you're crazy. I would never be comfortably opening myself up to the legal and security liabilities of AI tools.

load more comments (4 replies)

load more comments (31 replies)

[–] unreasonabro@lemmy.world 163 points 2 years ago (25 children)

See, this is why we can't have nice things. Money fucks it up, every time. Fuck money, it's a shitty backwards idea. We can do better than this.

[–] Colonel_Panic_@lemm.ee 47 points 2 years ago (2 children)

Hear me out. Bottle caps.

load more comments (2 replies)

load more comments (24 replies)

[–] cordlesslamp@lemmy.today 155 points 2 years ago (6 children)

So they pulled a "reddit"?

[–] sirboozebum@lemmy.world 98 points 2 years ago (5 children)

These companies don't realise their most engaged users generate a disproportionate amount of their content.

They will just go to their own spaces.

I think this a good thing in the long run, the internet will become decentralised again.

load more comments (5 replies)

[–] tearsintherain@leminal.space 145 points 2 years ago* (last edited 2 years ago) (5 children)

Reddit/Stack/AI are the latest examples of an economic system where a few people monetize and get wealthy using the output of the very many.

load more comments (5 replies)

[–] kibiz0r@midwest.social 141 points 2 years ago

First, they sent the missionaries. They built communities, facilities for the common good, and spoke of collaboration and mutual prosperity. They got so many of us to buy into their belief system as a result.

Then, they sent the conquistadors. They took what we had built under their guidance, and claimed we "weren't using it" and it was rightfully theirs to begin with.

[–] andrade@infosec.pub 133 points 2 years ago (10 children)

digging their own grave

load more comments (10 replies)

[–] neclimdul@lemmy.world 129 points 2 years ago (1 children)

Oh I didn't consider deleting my answers. Thanks for the good idea ~~Barbra~~ StackOverflow.

[–] TheGrandNagus@lemmy.world 51 points 2 years ago (10 children)

I'd be shocked if deleted comments weren't retained by them

load more comments (10 replies)

[–] zaphod@sopuli.xyz 96 points 2 years ago (7 children)

Letting corporations "disrupt" forums was a mistake.

load more comments (7 replies)

[–] bitchkat@lemmy.world 94 points 2 years ago (4 children)

Maybe we should replace Stack Overflow with another site where experts can exchange information? We can call it "Experts Exchange".

[–] bitfucker@programming.dev 90 points 2 years ago (8 children)

Expert Sex Change?

load more comments (8 replies)

[–] deddit@lemmy.world 69 points 2 years ago (1 children)

codidact ... Stack overflow had a mass exodus of mods a 2-3 years ago and a some of them made codidact.

load more comments (1 replies)

load more comments (2 replies)

[–] Churbleyimyam@lemm.ee 89 points 2 years ago (7 children)

At the end of the day, this is just yet another example of how capitalism is an extractive system. Unprotected resources are used not for the benefit of all but to increase and entrench the imbalance of assets. This is why they are so keen on DRM and copyright and why they destroy the environment and social cohesion. The thing is, people want to help each other; not for profit but because we have a natural and healthy imperative to do the most good.

There is a difference between giving someone a present and then them giving it to another person, and giving someone a present and then them selling it. One is kind and helpful and the other is disgusting and produces inequality.

If you're gonna use something for free then make the product of it free too.

An idea for the fediverse and beyond: maybe we should be setting up instances with copyleft licences for all content posted to them. I actually don't mind if you wanna use my comments to make an LLM. It could be useful. But give me (and all the other people who contributed to it) the LLM for free, like we gave it to you. And let us use it for our benefit, not just yours.

load more comments (7 replies)

[–] Agent641@lemmy.world 83 points 2 years ago (2 children)

Begun, the AI wars have.

Faces on T-shirts, you must print print. Fake facts into old forum comments, you must edit. Poison the data well, you must.

load more comments (2 replies)

[–] Jimmyeatsausage@lemmy.world 80 points 2 years ago (4 children)

You really don't need anything near as complex as AI...a simple script could be configured to automatically close the issue as solved with a link to a randomly-selected unrelated issue.

load more comments (4 replies)

[–] schnurrito@discuss.tchncs.de 80 points 2 years ago (2 children)

Messages that people post on Stack Exchange sites are literally licensed CC-BY-SA, the whole point of which is to enable them to be shared and used by anyone for any purpose. One of the purposes of such a license is to make sure knowledge is preserved by allowing everyone to make and share copies.

[–] kerrigan778@lemmy.world 106 points 2 years ago (8 children)

That license would require chatgpt to provide attribution every time it used training data of anyone there and also would require every output using that training data to be placed under the same license. This would actually legally prevent anything chatgpt created even in part using this training data from being closed source. Assuming they obviously aren't planning on doing that this is massively shitting on the concept of licensing.

load more comments (8 replies)

[–] 9point6@lemmy.world 64 points 2 years ago (1 children)

Share Alike

I can't wait to download my own version of the latest gpt model

load more comments (1 replies)

[–] 3volver@lemmy.world 78 points 2 years ago (4 children)

The enshittification is very real and is spreading constantly. Companies will leech more from their employees and users until things start to break down. Acceleration is the only way.

load more comments (4 replies)

[–] Fedizen@lemmy.world 71 points 2 years ago (6 children)

primary use for AI is self destructing your website.

load more comments (6 replies)

[–] tabular@lemmy.world 70 points 2 years ago (3 children)

I despise this use of mod power in response to a protest. It's our content to be sabotaged if we want - if Stack Overlords disagree then to hell with them.

I'll add Stack Overflow to my personal ban list, just below Reddit.

load more comments (3 replies)

[–] shotgun_crab@lemmy.world 68 points 2 years ago (1 children)

And the enshittification continues...

load more comments (1 replies)

[–] inset@lemmy.today 66 points 2 years ago (2 children)

I fully understand why they are doing this, but we are just losing a mass of really useful knowledge. What a shame...

load more comments (2 replies)

[–] Hypx@fedia.io 66 points 2 years ago (17 children)

Eventually, we will need a fediverse version of StackOverflow, Quora, etc.

[–] thfi@discuss.tchncs.de 77 points 2 years ago (10 children)

Those would be harvested to train LLMs even without asking first. 😐

[–] sramder@lemmy.world 45 points 2 years ago (2 children)

At this point I’m assuming most if not all of these content deals are essentially retroactive. They already scrapped the content and found it useful enough to try and secure future use, or at least exclude competitors.

load more comments (2 replies)

load more comments (9 replies)

load more comments (16 replies)

[–] archomrade@midwest.social 63 points 2 years ago (9 children)

Data should be socialized and machine learning algorithms should be nationalized for public use.

load more comments (9 replies)

[–] nasduia@lemmy.world 62 points 2 years ago (5 children)

Why does OpenAI want 10 year old answers about using jQuery whenever anyone posts a JavaScript question, followed by aggressive policing of what is and isn't acceptable to re-ask as technology moves on?

load more comments (5 replies)

[–] kubica@kbin.social 56 points 2 years ago (2 children)

I'm going to run out of sites at this pace.

[–] herrcaptain@lemmy.ca 45 points 2 years ago

Right? It seems like the modern internet is made up of like 5 monolithic sites, and unlimited SEO spam.

I know that's not literally true, but it sure feels like it.

load more comments (1 replies)

[–] filister@lemmy.world 55 points 2 years ago (5 children)

While at the same time they forbid AI generated answers on their website, oh the turntables.

load more comments (5 replies)

[–] bitwolf@lemmy.one 53 points 2 years ago (4 children)

Rather than delete, modify the question so its wrong. Then the ai will hallucinate.

load more comments (4 replies)

[–] chemicalwonka@discuss.tchncs.de 52 points 2 years ago (7 children)

Reddit did almost the same and don't forget guys to delete your Reddit account

load more comments (7 replies)

[–] partial_accumen@lemmy.world 49 points 2 years ago (3 children)

A malicious response by users would be to employ an LLM instructed to write plausibly sounding but very wrong answers to historical and current questions, then an army of users upvoting the known wrong answer while downvoting accurate ones. This would poison the data I would think.

load more comments (3 replies)

[–] old_machine_breaking_apart@lemmy.dbzer0.com 45 points 2 years ago (25 children)

Maybe we need a technical questions and answers siteon the fediverse!

load more comments (25 replies)

load more comments