Technology

72739 readers

1566 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

208

Microsoft says “Prism” translation layer does for Arm PCs what Rosetta did for Macs (arstechnica.com)

submitted 1 year ago by misk@sopuli.xyz to c/technology@lemmy.world

124 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] pycorax@lemmy.world 12 points 1 year ago (14 children)

There's nothing stopping x86-64 processors from being power efficient. This article is pretty technical but does a really good explanation of why that's the case: https://chipsandcheese.com/2024/03/27/why-x86-doesnt-need-to-die/

It's just that traditionally Intel and AMD earn most of their money from the server and enterprise sectors where high performance is more important than super low power usage. And even with that, AMD's Z1 Extreme also gets within striking distance of the M3 at a similar power draw. It also helps that Apple is generally one node ahead.

[–] QuaternionsRock@lemmy.world 1 points 1 year ago (10 children)

This article fails to mention the single biggest differentiator between x86 and ARM: their memory models. Considering the sheer amount of everyday software that is going multithreaded, this is a huge issue, and the reason why ARM drastically outperforms x86 running software like modern web browsers.

[–] pycorax@lemmy.world 2 points 1 year ago (9 children)

Do you mind elaborating what is it about the difference on their memory models that makes a difference?

[–] sunbeam60@lemmy.one 2 points 1 year ago (2 children)

On the x86 architecture, RAM is used by the CPU and the GPU has a huge penalty when accessing main RAM. It therefore has onboard graphics memory.

On ARM this is unified so GPU and CPU can both access the same memory, at the same penalty. This means a huge class of embarrassingly parallel problems can be solved quicker on this architecture.

[–] pycorax@lemmy.world 1 points 1 year ago (1 children)

Do x86 CPUs with iGPUs not already use unified memory? I'm not exactly sure what you mean but are you referring to the overhead of having to do data copying over from CPU to GPU memory on discrete graphics cards when performing GPU calculations?

[–] sunbeam60@lemmy.one 1 points 1 year ago (1 children)

Yes unified and extremely slow compared to an ARM architecture’s unified memory, as the GPU sort of acts as if it was discrete.

[–] pycorax@lemmy.world 1 points 1 year ago (1 children)

Do you have any sources for this? Can't seem to find anything specific describing the behaviour. It's quite surprising to me since the Xbox and PS5 uses unified memory on x86-64 and would be strange if it is extremely slow for such a use case.

[–] sunbeam60@lemmy.one 1 points 1 year ago (1 children)

It’s been a while since I’ve coded on the Xbox, but at least in the 360, the memory wasn’t really unified as such. You had 10 MB of EDRAM that formed your render target and then there was specialised functions to copy the EDRAM output to DRAM. So it was still separated and while you could create buffers in main memory that you access in the shaders, at some penalty.

It’s not that unified memory can’t be created, but it’s not the architecture of a PC, where peripheral cards communicate over the PCI bus, with great penalties to touch RAM.

[–] pycorax@lemmy.world 1 points 1 year ago

Well for the current generation consoles they're both x86-64 CPUs with only a single set of GDDR6 memory shared across the CPU and GPU so I'm not sure if you have such a penalty anymore

It’s not that unified memory can’t be created, but it’s not the architecture of a PC, where peripheral cards communicate over the PCI bus, with great penalties to touch RAM.

Are there any tests showing the difference in memory access of x86-64 CPUs with iGPUs compared to ARM chips?

[–] QuaternionsRock@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

That’s actually not what I was referring to, although the unified memory architecture is certainly more power efficient for mixed-intensive workloads. The cost of transferring to/from dedicated GPU memory is (unsurprisingly) quite large.

load more comments (6 replies)

load more comments (9 replies)