Technology

59569 readers

3825 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

129

Major shifts at OpenAI spark skepticism about impending AGI timelines (arstechnica.com)

submitted 3 months ago by some_guy@lemmy.sdf.org to c/technology@lemmy.world

84 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] MentalEdge@sopuli.xyz 10 points 3 months ago* (last edited 3 months ago) (1 children)

And to have conversation, behind the scenes, each prompt gets the entire conversation so far tacked on.

The model itself is static, it doesn't work like a brain that changes in response to stimulus, or form memories.

To converse about something, the entirety of an exchange is fed back into the model all over again each time it needs to produce a response. In fact, this can happen several times over for each word in a response.

It's basically an attempt at duct-taping the ability to form memories onto an otherwise static system. It works, but I don't see how that way of doing it could ever land LLMs in the land of real consciousness.

It basically means these models "think" in frames, but each frame gets exponentially heavier to process, as it has to ingest every frame that came before.

[–] mozz@mbin.grits.dev 3 points 3 months ago

OpenAI at least is now attempting to bolt on a “memory” by having the LLM spit out short snippets of what it might need to know later, which it then has access to when completing later prompts. Like everything else post-GPT-4, it seems fine but doesn’t work really all that well at what it is intended to do.