this post was submitted on 28 Jan 2025

40 points (88.5% liked)

Linux

49366 readers

958 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Posts must be relevant to operating systems running the Linux kernel. GNU/Linux or otherwise.
No misinformation
No NSFW content
No hate speech, bigotry, etc

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago

MODERATORS

AgreeableLandscape@lemmy.ml

nooter692@lemmy.ml

MarcellusDrum@lemmy.ml

cypherpunks@lemmy.ml

cyclohexane@lemmy.ml

d3Xt3r@lemmy.nz

Can you run DeepSeek R1 on a AMD 7900 XTX 24GB GPU? (lemmy.world)

submitted 1 day ago* (last edited 1 day ago) by Zeon@lemmy.world to c/linux@lemmy.ml

14 comments fedilink hide all child comments

Hello, I've been hearing a lot about this new DeepSeek LLM, and was wondering, would it be possible to get the 600+ billion parameter model running on my GPU? I've heard something about people have got it to run on their MacBooks. I have i7 4790K, 32GB DDR3, and 7900 XTX 24GB VRAM. I'm running Arch Linux, this computer is just for AI stuff really, not gaming as much. I did tried running the distilled 14B parameter model, but it didn't work for me, I was using GPT4All to run it. I'm thinking about getting one of the NVIDIA 5090s in the future. Thanks in advance!

top 14 comments

sorted by: hot top controversial new old

[–] domi@lemmy.secnd.me 5 points 1 day ago

I run the 32b one on my 7900 XTX in Alpaca https://jeffser.com/alpaca/

There is no way to fit the full model in any single AMD or Nvidia GPU in existence.

[–] eager_eagle@lemmy.world 9 points 1 day ago

check this out

https://apxml.com/posts/gpu-requirements-deepseek-r1

[–] TheHobbyist@lemmy.zip 21 points 1 day ago

To run the full 671B sized model (404GB in size), you would need more than 404GB of combined GPU memory and standard memory (and that's only to run it, you would most probably want it all to be GPU memory to make it run fast).

With 24GB of GPU memory, the largest model which would fit from the R1 series would be the 32b-qwen-distill-q4_K_M (20GB in size) available at ollama (and possibly elsewhere).

[–] eager_eagle@lemmy.world 22 points 1 day ago* (last edited 1 day ago) (1 children)

They run smaller variations of it in their personal machines. There are models that fit in almost any machine, but IME the first model that is useful is the 32b, which you can probably run on the XTX. Anything less than that, only for the more trivial tasks.

[–] rimu@piefed.social 6 points 1 day ago* (last edited 1 day ago)

Just to confirm - I tried the 7b and it was fast but pretty untrustworthy. OPs 24 GB if vram should be enough to run a medium sized version, tho...

[–] gay4dudes@sh.itjust.works 15 points 1 day ago (3 children)

I run the 32b Version on my 6700xt with an R9 3700x using ollama. It runs well but it gets a bit slower on complex problems. I once ran an 70b Llama model, but it took a long time to finish.

[–] ieatpwns@lemmy.world 3 points 1 day ago (1 children)

Hey not to side track ops post or your own but I’m new to the home llm space and I was wondering once you have the model set up is there a gui? And how do you input tasks for it to do?

[–] gay4dudes@sh.itjust.works 3 points 1 day ago

You can use the Terminal or something like AnythingLLM. It has a GUI and you can import pictures and Websites.

[–] Fisch@discuss.tchncs.de 2 points 1 day ago

I have the same GPU but I always run 7B/8B variants as exl2. Do you use GGUF to use your system RAM?

[–] MIXEDUNIVERS@discuss.tchncs.de 1 points 1 day ago (2 children)

i also have a 6700xt but i don't get ollama running on it. it only defaults to the cpu ryzen 5600 I plan to tackle this problem on a free weekend and now i have a new Reason for solving it.

[–] EddyBot@discuss.tchncs.de 2 points 1 day ago* (last edited 1 day ago)

on some Linux distros like Arch Linux you might need to install a ollama-rocm package too

[–] gay4dudes@sh.itjust.works 1 points 1 day ago

Well, I dont know what you are running, but on Debian or Fedora it automatically installed Drivers and picked the GPU. I had a Problem like this ones, where it had wrong Drivers (but it was in an NVIDIA GPU).

[–] wuphysics87@lemmy.ml 1 points 1 day ago

I run it on a 6700xt

[–] Fisch@discuss.tchncs.de 5 points 1 day ago

I don't know how big the original model is but I have an RX 6700 XT and I can easily run the Llama 3 8B distill of Deepseek R1 with 32k context. I just haven't figured out how to get good results yet, it always does the <thinking><thinking/> thing.