this post was submitted on 11 Aug 2025
900 points (98.7% liked)

Technology

76304 readers
2504 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] tal@lemmy.today 240 points 2 months ago (1 children)

Given that the Internet Archive is the de facto standard way to cite material as seen on a given date


they're a trustworthy party that will probably persist for a long time


that's going to make it harder to cite content on Reddit.

[–] Deceptichum@quokk.au 13 points 2 months ago (2 children)

Damn, guess if you want reddit data to train your AI that you’ll need to pay Spez for access.

[–] tal@lemmy.today 11 points 2 months ago* (last edited 2 months ago) (1 children)

It's important for people writing papers and such who need to cite material.

I wonder if there's some way to use the TLS certificate to get a cryptographically-signed copy of a webpage with timestamp that someone could later validate as having been downloaded on that date. I don't know if existing TLS libraries are capable of that. Like, Web browser menu option "Store cryptographically-signed webpage". Absent a later certificate compromise, I'd think that that'd at least provide people a way to credibly say "this is really what was on that webpage on August 15th, 2026". Like, you'd have to save a copy of the TLS session and then have libraries that could read and validate an already-generated session. The timestamp is already embedded in the session.

Some protocols, like OTR, are designed to specifically not allow that, but AFAIK, TLS could.

EDIT: Well, technically the timestamp is gonna be during the handshake, not tied to the HTTP request internal to the TLS session. It might be possible to game that by establishing a TLS session, holding it open without activity, and issuing a request much later. I'd think that that'd potentially be disallowed by Web servers one way or another, since otherwise you could probably do a denial-of-service attack by holding open a lot of sessions for a long time.

EDIT2: Oh, wait, no, shouldn't be an issue, because the HTTP Date response header is gonna have a timestamp tied to the response.

[–] misteloct@lemmy.dbzer0.com 6 points 2 months ago

Don't forget, Reddit is legally allowed to train on your content, but not the other way around. It's consistent with US law, where corporate tax is half of income tax.