this post was submitted on 11 Jun 2023
40 points (97.6% liked)

Self Hosted - Self-hosting your services.

11440 readers
1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules

Important

Beginning of January 1st 2024 this rule WILL be enforced. Posts that are not tagged will be warned and if not fixed within 24h then removed!

Cross-posting

If you see a rule-breaker please DM the mods!

founded 3 years ago
MODERATORS
 

If I were to create my own instance federated with all the other instances, as of today, how much data would I be storing, since I would make a copy of all the content?

I know this will vary a lot, but I’m looking for a ballpark figure to have an idea. I don’t think it would be a lot, but I can’t find an estimate anywhere.

Reposted from https://lemmy.world/post/55030 as I think this community is probably a better fit

you are viewing a single comment's thread
view the rest of the comments
[–] knighthawk0811@lemmy.one 8 points 1 year ago (2 children)

it's my understanding that you won't make a full copy of everything. you'll only copy the communities that are added and you won't be copying the full history.

I'm unsure how far back it goes but it might be just the one day. i saw talk of changing it to go back as far as someone actually scrolled and pressed next but that was just an idea at this point.

[–] pimeys@lemmy.nauk.io 10 points 1 year ago (2 children)

Also it doesn't copy the images as Mastodon does, only the text objects.

[–] knighthawk0811@lemmy.one 3 points 1 year ago (1 children)

oh right, that's a big one there. that images stay on the origin instance and they linked from there

[–] pimeys@lemmy.nauk.io 3 points 1 year ago (1 children)

I actually kind of like how Akkoma does this. It creates a constant size proxy with nginx, and all the images come from predetermined host instead of all over the net. It's a good mixture of not using tens of gigabytes of space and still spreading the load a bit between the instances.

[–] utzer@social.yl.ms 1 points 1 year ago (1 children)

@pimeys @knighthawk0811 yes, that sounds more privacy friendly, #Friendica also has a proxy function, it is also important that the proxy is just accessible for logged in users... so it get's more complicated. Otherwise bad people will posts media some place and link it in scam pages/emails and such.

[–] utzer@social.yl.ms 1 points 1 year ago

@knighthawk0811 @pimeys I think the proxy does not really store the media files anymore, it just retrieves them and serves them... because of the potential danger by scammers. So it protects the users privacy but does not enable people to exploit the proxy as as host for scam.

[–] ndr@lemmy.world 3 points 1 year ago

For this reason, I assume it should be feasible to self-host an instance that sync with most communities.

For reference, Wikipedia (en) takes up just over 20 GB (https://en.wikipedia.org/wiki/Wikipedia:Size_of_Wikipedia), so I assume much less than that for Lemmy (with the current userbase).

[–] ndr@lemmy.world 3 points 1 year ago

Okay, I see, thanks.

But for example, do we know how much data lemmy.ml stores?