this post was submitted on 02 Mar 2025
183 points (89.6% liked)

Technology

76362 readers
4155 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] corroded@lemmy.world 13 points 7 months ago (2 children)

They say they did this by "finetuning GPT 4o." How is that even possible? Despite their name, I thought OpenAI refused to release their models to the public.

[–] echodot@feddit.uk 8 points 7 months ago* (last edited 7 months ago) (1 children)

They kind of have to now though. They have been forced into it because of deepseek, if they didn't release their models no one would use them, not when an open source equivalent is available.

[–] corroded@lemmy.world 8 points 7 months ago (1 children)

I feel like the vast majority of people just want to log onto Chat GPT and ask their questions, not host an open source LLM themselves. I suppose other organizations could host Deepseek, though.

Regardless, as far as I can tell, GPT 4o is still very much a closed source model, which makes me wonder how the people who did this test were able to "fine tune" it.

[–] echodot@feddit.uk 2 points 7 months ago

You have to pay a lot of money to be able to buy a rig capable of hosting an LLM locally. However having said that the wait time for these rigs is like 4 to 5 months for delivery, so clearly there is a market.

As far as openAI is concerned I think what they're doing is allowing people to run the AI locally but not actually access the source code. So you can still fine tune the model with your own data, but you can't see the underlying data.

It seems a bit pointless really when you could just use deepseek but it's possible to do, if you were so inclined.