this post was submitted on 07 Aug 2024
129 points (93.3% liked)
Technology
59569 readers
3825 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Maybe it can. If you find a way to port everything to text by hooking in different models, the LLM might be able to reason about everything you throw at it. Who even defines how AGI should be implemented?
LLMs do not reason, they probabilistically determine the next word based on the words you prompt it with. The most perfect implementation of "AI" was the T9 predictive text system for dumb phones cmv.
And to have conversation, behind the scenes, each prompt gets the entire conversation so far tacked on.
The model itself is static, it doesn't work like a brain that changes in response to stimulus, or form memories.
To converse about something, the entirety of an exchange is fed back into the model all over again each time it needs to produce a response. In fact, this can happen several times over for each word in a response.
It's basically an attempt at duct-taping the ability to form memories onto an otherwise static system. It works, but I don't see how that way of doing it could ever land LLMs in the land of real consciousness.
It basically means these models "think" in frames, but each frame gets exponentially heavier to process, as it has to ingest every frame that came before.
OpenAI at least is now attempting to bolt on a “memory” by having the LLM spit out short snippets of what it might need to know later, which it then has access to when completing later prompts. Like everything else post-GPT-4, it seems fine but doesn’t work really all that well at what it is intended to do.