this post was submitted on 05 May 2024
49 points (67.9% liked)

Fediverse

28195 readers
192 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] deadsuperhero@lemmy.world 4 points 6 months ago (2 children)

It's an interesting and frustrating problem. I think there are three potential ways forward, but they're both flawed:

  1. Quasi-Centralization: a project like Mastodon or a vetted Non-Profit entity operates a high-concurrency server whose sole purpose is to cache link metadata and Images. Servers initially pull preview data from that, instead of the direct page.

  2. We find a way to do this in some zero-trust peer-to-peer way, where multiple servers compare their copies of the same data. Whatever doesn't match ends up not being used.

  3. Servers cache link metadata and previews locally with a minimal amount of requests; any boost or reshare only reflects a proxied local preview of that link. Instead of doing this on a per-view or per-user basis, it's simply per-instance.

I honestly think the third option might be the least destructive, even if it's not as efficient as it could be.

[–] Quacksalber@sh.itjust.works 6 points 6 months ago

As I understand it, 3) already happens. What causes the load is that each connected instance is also loading and caching the preview.

[–] chiisana@lemmy.chiisana.net 4 points 6 months ago

Or 4) Ignore noise and do nothing; this is a case of user talking about things they don’t understand at best, or a blog intentionally misleading others to drum up traffic for themselves at worst. This is literally not a problem. Serving that kind of traffic can be done on a single server without any CDN and they’ve got a CDN already.