Technology

76089 readers

2436 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

Research AI model unexpectedly modified its own code to extend runtime (arstechnica.com)

submitted 1 year ago by return2ozma@lemmy.world to c/technology@lemmy.world

26 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Haquer@lemmy.today 109 points 1 year ago (2 children)

Nothingburger. They were using the AI to code their scripts and haven't even shown the prompts that got the response. LLMs are not AGI.

[–] conciselyverbose@sh.itjust.works 43 points 1 year ago

Imagine allowing LLMs to write and execute code and being surprised they write and execute code.

[–] chuckleslord@lemmy.world 23 points 1 year ago

Having read the article and then the actual report from the Sakana team. Essentially, they're letting their LLM perform research by allowing it to modify itself. The increased timeouts and self-referential calls appear to be the LLM trying to get around the research team's guardrails on it. Not because it's become aware or anything like that, but because its code was timing out and that was the least effort way to beat the timeout. It does handily prove that LLMs shouldn't be the one steering any code base, because they don't give a shit about parameters or requirements. And giving an LLM the ability to modify its own code will lead to disaster in any setting that isn't highly controlled like this.

Listen, I've been saying for a while that LLMs are a dead end towards any useful AI, and the fact that an AI Research team has turned to an LLM to try and find more avenues to explore feels like the nail in that coffin.