Technology

83185 readers

3581 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

350

The AI bubble is so big it's propping up the US economy (for now) (www.bloodinthemachine.com)

submitted 7 months ago by chobeat@lemmy.ml to c/technology@lemmy.world

126 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] brucethemoose@lemmy.world 5 points 7 months ago* (last edited 7 months ago)

I am referencing this: https://z.ai/blog/glm-4.5

The full GLM? Basically a 3090 or 4090 and a budget EPYC CPU. Or maybe 2 GPUs on a threadripper system.

GLM Air? Now this would work on a 16GB+ VRAM desktop, just slap in 96GB+ (maybe 64GB?) of fast RAM. Or the recent Framework desktop, or any mini PC/laptop with the 128GB Ryzen 395 config, or a 128GB+ Mac.

You’d download the weights, quantize yourself if needed, and run them in ik_llama.cpp (which should get support imminently).

https://github.com/ikawrakow/ik_llama.cpp/

But these are…not lightweight models. If you don’t want a homelab, there are better ones that will fit on more typical hardware configs.