Technology

84858 readers

4024 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

411

Memory prices tipped to fall as China starts flooding the market with DRAM and NAND chips (www.techspot.com)

submitted 13 hours ago by sanitation@lemmy.radio to c/technology@lemmy.world

65 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] mlg@lemmy.world 17 points 4 hours ago (2 children)

In a way it has actually.

Deepseek was big because not only did they publish the full model for everyone to use, but the MoE structure significantly brought down the hardware requirements in terms of processing power. As long as you have enough VRAM, you can run it on older hardware with no need for the latest Nvidia stuff.

Now they got v4 which many have found to be within a 10% margin of Claude and ChatGPT.

On top of that, China has cheapo VRAM GPUs available or soon to be released, like the MTT S80. Yeah it sucks as a Graphics card because the chip is behind, but you get 16Gb of GDDR6 for much cheaper than anything else.

But its not a conspiracy to fight China. The infinite scaling was just Nvidia solidifying themselves as the monopoly because they want all AI infrastructure to be dependent on them, which is why they still illegally export to China, despite an export ban attempting to reduce their potential competition.

Moore Threads (MTT) already has their own CUDA like system called MUSA, and I'm sure they'll be happy to put in proper hardware support for new stuff like Bf16 and FP8/4. It'll take a few years, but eventually China will catch up to the point where Nvidia gets shanked by cheaper hardware.

[–] Truscape@lemmy.blahaj.zone 2 points 1 hour ago

Wasn't there development of a linux translation layer for CUDA workloads to run on AMD GPUs? I haven't heard about it in a while, but I'd imagine that'd help the situation.

[–] brucethemoose@lemmy.world 4 points 3 hours ago* (last edited 2 hours ago)

MTT is just a pipe dream, last I checked. But Deepseek is actively being served, in mixed FP8/FP4, on racks of Huawei accelerators.

I believe Baidu trained a model on them, too. But most training (like Deepseek’s) is still done on CUDA.

…Also, be careful equating this stuff with any kind of “consumer friendly” hardware you or I could buy. That’s less likely. The Huawei accelerators (and other local Chinese hardware experiments) are geared towards huge servers serving requests in parallel.