this post was submitted on 15 Aug 2025
636 points (95.7% liked)

Technology

76339 readers
4448 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

The University of Rhode Island's AI lab estimates that GPT-5 averages just over 18 Wh per query, so putting all of ChatGPT's reported 2.5 billion requests a day through the model could see energy usage as high as 45 GWh.

A daily energy use of 45 GWh is enormous. A typical modern nuclear power plant produces between 1 and 1.6 GW of electricity per reactor per hour, so data centers running OpenAI's GPT-5 at 18 Wh per query could require the power equivalent of two to three nuclear power reactors, an amount that could be enough to power a small country.

you are viewing a single comment's thread
view the rest of the comments
[–] Corkyskog@sh.itjust.works 1 points 2 months ago* (last edited 2 months ago) (3 children)

How slow?

Loading up a website with flash and GIF in the 90s dialup slow... Or worse?

[–] Evono@lemmy.dbzer0.com 3 points 2 months ago

Basicly I can run 9b models on my 16gb gpu mostly fine like getting responses of lets say 10 lines in a few seconds.

Bigger models if they don't outright crash take for the same task then like 5x or 10x longer so long it isn't even useful anymore

So very worse.

[–] EncryptKeeper@lemmy.world 2 points 2 months ago* (last edited 2 months ago)

Like make a query and then go make yourself a sandwich while it spits out a word every other second slow.

There are very small models that can run on mid range graphics cards and all, but it’s not something you’d look at and say “Yeah this does most of what chatGPT does”

I have a model running on a gtx 1660 and I use it with Hoarder to parse articles and create a handful a tags for them and it’s not… great at that.

[–] gerryflap@feddit.nl 1 points 2 months ago

It's horrendously slow, unusable imo. With the larger DeepSeek distilled models I tried that didn't fit into VRAM you could easily wait 5 minutes until it was done writing its essay. Compared to just a few seconds when it does. Bit that's with a RTX 3070 Ti, not something the average ChatGPT user has lying around probably.