Technology

85619 readers

4165 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

founded 3 years ago

MODERATORS

923

submitted 2 years ago* (last edited 2 years ago) by ekZepp@lemmy.world to c/technology@lemmy.world

290 comments fedilink hide all child comments

It's also important to note that ChatGPT internet search and DuckDuckGo are experiencing similar issues because they use the Bing API.

20240523_210619

you are viewing a single comment's thread
view the rest of the comments

[–] joneskind@lemmy.world 2 points 2 years ago* (last edited 2 years ago) (1 children)

Most of 7b-8b models run just fine in 4bits quant and won’t use more than 4 or 5 GB of VRAM.

The only important metric is the amount of VRAM as the model must be loaded in VRAM for fast inference.

You could use CPU and RAM but it is really painfully slow.

If you got an Apple Silicon Mac it could be even simpler.

[–] veniasilente@lemm.ee 2 points 2 years ago (1 children)

I have an Intel Celeron Mobile laptop with iGPU and, I think, 256MB VRAM. How many bs does that get me for the LLM?

~~Only half-joking. That's my still functional old daily driver now serving as homelab~~

[–] joneskind@lemmy.world 2 points 2 years ago

Well, I got a good news and a bad news.

The bad news is you won't do shit with that my dear friend.

The good news is that you won't need it because the duck is back.