this post was submitted on 05 Sep 2024

79 points (74.2% liked)

Technology

73948 readers

3563 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

Generative AI creates playable version of Doom game with no code (www.newscientist.com)

submitted 11 months ago by gedaliyah@lemmy.world to c/technology@lemmy.world

48 comments fedilink hide all child comments

The model, called GameNGen, was made by Dani Valevski at Google Research and his colleagues, who declined to speak to New Scientist. According to their paper on the research, the AI can be played for up to 20 seconds while retaining all the features of the original, such as scores, ammunition levels and map layouts. Players can attack enemies, open doors and interact with the environment as usual.

After this period, the model begins to run out of memory and the illusion falls apart.

all 50 comments

sorted by: hot top controversial new old

[–] CheeseNoodle@lemmy.world 56 points 11 months ago (1 children)

I really hope this doesn't catch on, Games are already horifically inefficient, imagine if we started making them like this and a 4090 becomes the minnimum system requirement for goddamn DOOM.

[–] UnityDevice@startrek.website 9 points 11 months ago (2 children)

Games are already horifically inefficient

That's so far from the truth, it hurts me to read it. Games are one of the most optimised programs you can run on your computer. Just think about it, it's a application rendering an entire imaginary world every dozen milliseconds. Compare it to anything else you run, like say slack or teams, which makes your CPU sweat just to notify you about a new message.

[–] 30p87@feddit.org 12 points 11 months ago (1 children)

Many games, especially AAA games or ones relying on common game engines, are actually horribly inefficient. It's hard to run any Unity/Unreal game in 4k on my 1070. Even if it has shit graphics like Lethal Company. What does run well? Smaller, custom engines, even Metro Exodus runs with 60+ FPS in 4k on my 1070, and still looks very good. Why? Because 4A Game is/was actually interested in creating a good engine and games. That's the whole reason they split from the S.T.A.L.K.E.R team: Because, in their opinion, the engine was too inefficient.

Most games are just a quick cash grab tho, especially ones by large companies like EA. Other large companies with a significantly lower output of games, eg. Valve, do produce programmatically higher quality games tho.

[–] rhombus@sh.itjust.works 8 points 11 months ago (1 children)

It's hard to run any Unity/Unreal game in 4k on my 1070

Both of these engines are capable of making very optimized games, it’s just that most of the developers using them either don’t have the expertise or don’t care to put in the effort.

[–] 30p87@feddit.org 4 points 11 months ago

I know. The inherent problem of games made with those engines is the lack of motivation, knowledge and experience of devs to make (programmatically) good games. Only very few games using those engines are good in that sense, and as exceptions confirm a rule I'd just simplify it to that statement.

[–] PaellaVacuum -4 points 11 months ago* (last edited 11 months ago)

Right buddy, seems like you've never had to play on a 3-gen old non-gaming laptop. That's such a privileged view lmao.

[–] YourNetworkIsHaunted@awful.systems 40 points 11 months ago (2 children)

Note that the image here isn't from the AI project, it's from actual Doom. Their own screenshots have weird glitches including a hit splat that looks like a butt in the image I've seen closest to this one.

And when they say they've "run the game" they do not mean that there was a playable version that was publicly compared to the original. Rather they released short video clips of alleged gameplay and had their evaluators try to identify if they were from the AI recreation or from actual Doom.

Even by the abysmal standards of generative AI projects this is a hell of a grift.

[–] Telorand@reddthat.com 13 points 11 months ago

Even by the abysmal standards of generative AI projects this is a hell of a grift.

But if you invest now, you can make a game-generating AI a reality! /s

[–] LunarLoony@lemmy.sdf.org 2 points 11 months ago (1 children)

I'm pretty sure that screenshot is from the video. The zombieman has no feet

[–] YourNetworkIsHaunted@awful.systems 1 points 11 months ago

Possibly fair. I'm pretty sure I've seen that exact screenshot used in other articles about Doom, but I'm not enough of a Doom nerd to be sure.

There's a decent writeup over at Pivot-to-AI that looks at the paper as a whole in more detail.

[–] the_artic_one@programming.dev 38 points 11 months ago

Thinking quickly, Generative AI constructs a playable version of Doom, using only some string, a squirrel, and a playable version of Doom.

[–] SnotFlickerman@lemmy.blahaj.zone 25 points 11 months ago* (last edited 11 months ago) (8 children)

An AI-generated recreation of the classic computer game Doom can be played normally despite having no computer code or graphics.

After this period, the model begins to run out of memory and the illusion falls apart.

Why are we lying about this? Just because it happens in the AI "black box" doesn't mean it's not producing some kind of code in the background to make this work. They even admit that it "runs out of memory." Huh, last I checked, you'd need to be running code to use memory. The AI itself is made of code! No computer code or graphics, my ass.

The model, called GameNGen, was made by Dani Valevski at Google Research and his colleagues, who declined to speak to New Scientist.

Always a good look. /s

[–] xionzui@sh.itjust.works 33 points 11 months ago (1 children)

I mean, yes, technically you build and run AI models using code. The point is there is no code defining the game logic or graphical rendering. It’s all statistical predictions of what should happen next in a game of doom by a neural network. The entirety of the game itself is learned weights within the model. Nobody coded any part of the actual game. No code was generated to run the game. It’s entirely represented within the model.

[–] huginn@feddit.it 12 points 11 months ago (2 children)

What they've done is flattened and encoded every aspect of the doom game into the model which lets you play a very limited amount just by traversing the latent space.

In a tiny and linear game like Doom that's feasible... And a horrendous use of resources.

[–] SpaceNoodle@lemmy.world 5 points 11 months ago

It doesn't even actually do that. It's a glitchy mess.

[–] Blue_Morpho@lemmy.world 8 points 11 months ago

Imagine you are shown what Doom looks like, are told what the player does, and then you draw the frames of what you think it should look like. While your brain is a computation device, you aren't explicitly running a program. You are guessing what the drawings should look like based on previous games of Doom that you have watched.

[–] cdf12345@lemm.ee 4 points 11 months ago

Maybe they should have specified , the Doom Source Code

[–] fruitycoder@sh.itjust.works 1 points 11 months ago

This would be like playing DnD where you see a painting and describe what you would do next as if you were the painting and they an artists painted the next scene for you.

The artists isn't rolling dice, following the rule book, or any actual game elements they ate just painting based on the last painting and your description of the next.

Its incredibly nove approchl if not obviously a toy problem.

load more comments (4 replies)

[–] BetaDoggo_@lemmy.world 25 points 11 months ago (2 children)

It's cool but it's more or less just a party trick.

[+] todd_bonzalez@lemm.ee 10 points 11 months ago* (last edited 8 months ago) (3 children)

[deleted]

[–] LunarLoony@lemmy.sdf.org 2 points 11 months ago (1 children)

I'm not convinced. The ammo seems to go up and down on a whim, as does the health

[–] JDPoZ@lemmy.world 3 points 11 months ago

Because “AI” isn’t actually “artificial intelligence.” It’s the marketing term that seems to have been adapted by every corporation to describe “LLMs…” which are more like extra fancy power guzzling parrots.

Its why the best cases for them are mimicking things brainlessly, like voice cloning for celebrity impressions… but that doesn’t mean it can act or comprehend emotion, or know how many fingers a hand should have and why they constantly hallucinate contextless bullshit… because just like a parrot doesn’t actually know any meaning of what it is saying when it goes “POLLY WANT A CRACKER…” it just knows the tall thing will give it a treat if it makes this specific squawk with its beak.

[–] fruitycoder@sh.itjust.works 2 points 11 months ago

Honestly I thinkyour self driving example is something this could be really cool for. If the generation can exceed real time (I.e. 20 secs of future image prediction can happen in under 20 secs) then you can preemptively react with the self driving model and cache the results.

If the compute costs can be managed maybe even run multiple models against each other to develop an array likely branch predictions (you know what I turned left)

Its even cooler that player input helps predict the next image.

[–] rolling_resistance@lemmy.world -1 points 11 months ago

Can't wait for your self-driving car to go out of memory mid ride.

[–] echodot@feddit.uk 4 points 11 months ago* (last edited 11 months ago) (1 children)

It's a proof of concept demonstration not a final product. You might as well say the Wright brothers didn't have anything other than their party trick.

So many practical applications for being able to do this beyond just video games in fact video games are probably the least useful application for this technology.

[–] LunarLoony@lemmy.sdf.org 1 points 11 months ago

So many practical applications

Such as?

[–] Drusenija@lemmy.world 24 points 11 months ago (5 children)

Regardless of the technology, isn't this essentially creating a facsimile of a game that already exists? So the tech isn't really about creating a new game, it's about replicating something that already exists in a fairly inefficient manner. That doesn't really help you to create something new, like I'm not going to be able to come up with an idea for a new game, throw it at this AI, and get something playable out of it.

That and the fact it "can be played for up to 20 seconds" before "the model begins to run out of memory" seems like, I don't know, a fairly major roadblock?

[–] UndercoverUlrikHD@programming.dev 15 points 11 months ago (1 children)

It's just a research paper, not a product. It's about discovering and learning new possible methods and applications.

[–] Drusenija@lemmy.world 5 points 11 months ago

That's a fair point actually, I'm looking at it through a product lens, not a research one.

[–] bob_lemon@feddit.org 13 points 11 months ago

Yes, this does nothing for game dev. But I don't think it was supposed to.

The fact that this is a genAI Model generating a reasonable, context aware image a whopping 20 times a second is nonetheless pretty impressive.

[–] Goun@lemmy.ml 5 points 11 months ago (1 children)

Unless you can monetize those 20 seconds like crazy

[–] Drusenija@lemmy.world 5 points 11 months ago

This sounds like the basis for a new Warioware game.

[–] echodot@feddit.uk 3 points 11 months ago* (last edited 11 months ago) (1 children)

So you think a project should be killed immediately upon inception because it's not immediately perfect? That is a really really weird attitude.

[–] Drusenija@lemmy.world 5 points 11 months ago (1 children)

I'm more taking issue with this quote from the article:

"Researchers behind the project say similar AI models could be used to create games from scratch in the future, just as they create text and images today."

This doesn't strike me as something that can create a game from scratch, it's something that can take an existing game and replicate it without having access to the underlying source code, and use an immense amount of processing power to do it.

Since it seems they're using generative AI based technology underneath it, they're effectively building a Doom model. You might be able to spin a Doom clone off from that but I don't see it as something you could practically throw another game type at.

That being said as I said in a different reply, I was viewing it through the lens of something more product based rather than that of a research project. As a field of research, it's an interesting topic. But I'm not sure how you connect it to "create games from scratch" if you don't already have an existing game available to train the model on.

[–] echodot@feddit.uk 3 points 11 months ago

Why do you think it needs an existing game to train the model on? They used Doom precisely because it already exists.

The entire point to the research paper was to see if humans could tell the difference between the generated content and the real game, that way they have a measurable metric of how viable this technology is even if only in theory, that means that they have to make something that's based off a real game.

Obviously the technology isn't commercially viable yet. But the fact that it looks even remotely like Doom shows that there is promise to the technology.

[–] locuester@lemmy.zip 1 points 11 months ago (1 children)

Perhaps you could be missing the trajectory of continuous improvement. How long until The Matrix?

[–] echodot@feddit.uk 3 points 11 months ago

It's an exponential increase as well and humans are very bad at judging exponential increases they look at something like this and they see no promise in it because they can't see that four or five iterations down the line (and in the world of AI that could very easily be 3 months) it will be hundreds of times better.

[–] paraphrand@lemmy.world 22 points 11 months ago* (last edited 11 months ago) (1 children)

“Playable” nah. “Interactive” yes.

[–] gedaliyah@lemmy.world 4 points 11 months ago

That seems to be the case.

[–] harsh3466@lemmy.ml 17 points 11 months ago (2 children)

Correct me if I’m wrong, but doesn’t there have to be a code layer somewhere in there?

It’s like all those “no code” platforms that just obscure away the actual coding via a gui and blocks/elements/whataver.

[–] hasnt_seen_goonies@lemmy.world 9 points 11 months ago (1 children)

In this case, no. This is just interpreting what the next frame should be by the previous one. Like how the sora videos work, but with input.

[–] Virkkunen@fedia.io 14 points 11 months ago (1 children)

So then the code is to make the AI generate images and take input

[–] hasnt_seen_goonies@lemmy.world 10 points 11 months ago

Or the code is the operating system that the application is running on, or the code is the firmware that is operating the GPU that is crunching the numbers to make the neural net, or the code is the friends we made along the way.

[–] dustyData@lemmy.world 16 points 11 months ago (1 children)

This is just a pile of garbage. Jim Sterling's break down is the most complete argument. But this is just a plain ol bag of shit.

[–] MoogleMaestro@lemmy.zip 9 points 11 months ago

Link to the video. I agree, it was a really good video on this topic and how wrong it is philosophically.