Technology

59605 readers

4225 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

195

ChatGPT provides false information about people, and OpenAI can’t correct it (noyb.eu)

submitted 6 months ago by alb_004@lemm.ee to c/technology@lemmy.world

61 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] givesomefucks@lemmy.world 23 points 6 months ago (39 children)

If scientists made AI, then it wouldn't be an issue for AI to say "I don't know".

But capitalists are making it, and the last thing you want is it to tell an investor "I don't know". So you tell it to make up bullshit instead, and hope the investor believes it.

It's a terrible fucking way to go about things, but this is America...

[–] expr@programming.dev 39 points 6 months ago (2 children)

It's got nothing to do with capitalism. It's fundamentally a matter of people using it for things it's not actually good at, because ultimately it's just statistics. The words generated are based on a probability distribution derived from its (huge) training dataset. It has no understanding or knowledge. It's mimicry.

It's why it's incredibly stupid to try using it for the things people are trying to use it for, like as a source of information. It's a model of language, yet people act like it has actual insight or understanding.

[–] hatedbad@lemmy.sdf.org 1 points 6 months ago

you’re so close, just why exactly do you think people are using it for these things it’s not meant for?

because every company, every CEO, every VP, is pushing every sector of their companies to adopt AI no matter what.

most actual people understand the limitations you list, but it’s the capitalists at the table that are making AI show up where it’s not wanted

[+] givesomefucks@lemmy.world -30 points 6 months ago (3 children)

Imagine searching your computer for a PDF named "W2.2026"...

Would you rather the computer tell you it's not in the database? Or would you prefer a random PDF displayed with the title "W2.2026"?

This isn't a new problem.

You're getting hung up on "know" instead "has relevant information in it's database and can access it".

But besides all that and the other things you got wrong:

It's still about capitalism for the reasons I just said

[–] expr@programming.dev 25 points 6 months ago

You do not understand how these things actually work. I mean, fair enough, most people don't. But it's a bit foolhardy to propose changes to how something works without understanding how it works now.

There is no "database". That's a fundamental misunderstanding of the technology. It is entirely impossible to query a model to determine if something is "present" or not (the question doesn't even make sense in that context).

A model is, to greatly simplify things, a function (like in math) that will compute a response based on the input given. What this computation does is entirely opaque (including to the creators). It's what we we call a "black box". In order to create said function, we start from a completely random mapping of inputs to outputs (we'll call them weights from now on) as well as training data, iteratively feed training data to this function and measure how close its output is to what we expect, adjusting the weights (which are just numbers) based on how close it is. This is a gross simplification of the complexity involved (and doesn't even touch on the structure of the model's network itself), but it should give you a good idea.

It's applied statistics: we're effectively creating a probability distribution over natural language itself, where we predict the next word based on how frequently we've seen words in a particular arrangement. This is old technology (dates back to the 90s) that has hit the mainstream due to increases in computing power (training models is very computationally expensive) and massive increases in the size of dataset used in training.

Source: senior software engineer with a computer science degree and multiple graduate-level courses on natural language processing and deep learning

Btw, I have serious issues with both capitalism itself and machine learning as it is applied by corporations, so don't take what I'm saying to mean that I'm in any way an apologist for them. But it's important to direct our criticisms of the system as precisely as possible.

[–] Zarxrax@lemmy.world 13 points 6 months ago (1 children)

You don't seem to understand. There is no database.

[–] wahming@monyet.cc 7 points 6 months ago

It's not a database. God, how many years is it going to take before people understand just what LLMs are and are not capable of?

load more comments (36 replies)