this post was submitted on 05 Nov 2024
401 points (99.8% liked)

Fediverse

28248 readers
776 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy

founded 1 year ago
MODERATORS
 

Disclaimer: I wrote this article and made this website.

There was some talk of this issue in the recent fediverse inefficiencies thread. I'm hopeful that in the future we'll have a decentralized solution for file hosting but for now I deeply believe that users should pay for their own file hosting.

top 50 comments
sorted by: hot top controversial new old
[–] glowing_hans@sopuli.xyz 0 points 8 hours ago* (last edited 8 hours ago)

am I the only one who would advocate for text only storage: no. This comment also gets my point? There should be text only lemmy instances which only save text and do not allow any kind of image posting or storage.

  • text is less offensive to consume and moderate than evil images or video
  • text is way more information dense and can be even compressed more! Truly the green biosphere friendly data format. I would be willing to save text only data of strangers on my hard-drive, but not images or video. Could even be valuable llm analysis training data.

Yes people could post base64 encoded images, but that is a larger technical barrier and can be detected. If image storage is really need, images should be heavily compressed (webp 90% quality loss), provided as links to external sites, and whenever possible svg / vector graphics should be preferred.

[–] NuXCOM_90Percent@lemmy.zip 125 points 2 days ago (5 children)

We totally need sustainable file hosting. Freedom!

Wait... the fuck did you just upload? Oh god. Oh god no. Do I have to call the cops on you? Oh no. Wait, does this count as possession? FUCK!!!!!!

We need someone else to handle the totally sustainable file hosting. Freedom!

[–] Kalkaline@leminal.space 47 points 2 days ago (4 children)

Yep, there needs to be moderation tools that can be quickly deployed to stop the illegal/immoral/evil stuff from spreading and taking over self-hosted servers.

[–] NuXCOM_90Percent@lemmy.zip 41 points 2 days ago (2 children)

And moderation of this kind of content almost always sounds like torture when you hear about what facebook and the like are outsourcing.

Theoretically, this is a good problem for computer vision/machine learning. But there are a LOT of false positives (I think it was Aftermath who did an article on a study of when a nipple becomes female?). And.. what ethical responsibility do you have to report on the fiftieth time that SheIsReallyAnEightThousandYearOldDragon_6969 uploaded CSAM? And how quick do you think people are going to lose faith in you and start wondering if you'll also report on the rampant piracy?

And... there are also false negatives. At which point you find out you have been hosting something truly heinous for the past few months... possibly when local law enforcement tells you.

Like a lot of things: it sounds great. But nobody in their right mind is going to host this for free. And once you start accepting money you start opening yourself up to a LOT of regulations.

[–] rigatti@lemmy.world 9 points 2 days ago (1 children)

A nipple becomes female precisely when it wants to!

load more comments (1 replies)
load more comments (1 replies)
load more comments (3 replies)
load more comments (4 replies)
[–] C126@sh.itjust.works 17 points 2 days ago (3 children)

anyway to use torrent protocol somehow? Like popcorn time did?

[–] tiddy@sh.itjust.works 8 points 1 day ago (1 children)

Ipfs would be similar but more purpose based

[–] C126@sh.itjust.works 1 points 1 day ago

Never heard of that for, it looks exactly built for this problem, better than torrent. Good call.

[–] TriflingToad@sh.itjust.works 3 points 1 day ago (1 children)

if it was easy to set up id definitely host a terabyte of Lemmy. As a pet.

[–] moseschrute@lemmy.world 2 points 22 hours ago

What would you name your pet

[–] nutsack@lemmy.world 4 points 2 days ago* (last edited 2 days ago)

this is actually a super interesting idea of which i have never seen proof of concept. other than maybe freenet or something.

[–] hperrin@lemmy.world 48 points 2 days ago* (last edited 2 days ago) (3 children)

Ok, hear me out.

We find the users with the slowest internet and start sending them all the data. They don’t have to keep anything on disk. Then they send it all back and forth between each other. Any time a user makes a request, we just wait for one of the slow nodes to come across the data and send it out.

We use the slowest wires for all the storage. It’s fool proof.

[–] sosodev@lemmy.world 49 points 2 days ago (2 children)

Somebody actually did make this as a joke years ago haha https://github.com/yarrick/pingfs

[–] dojan@lemmy.world 10 points 2 days ago (1 children)

I was brushing my teeth when reading this comment and inadvertently ended up swallowing all my toothpaste.

[–] TseseJuer@lemmy.world 2 points 1 day ago

don't forget to spit

load more comments (1 replies)
[–] mlg@lemmy.world 19 points 2 days ago (6 children)
[–] Ludrol@szmer.info 10 points 2 days ago (1 children)

as I stated in this comment it's not really feasible as to ~5s delay that was tested some time ago.

[–] Lemmchen@feddit.org 0 points 7 hours ago

That's the wrong comment.

load more comments (5 replies)
[–] M0oP0o@mander.xyz 22 points 2 days ago (1 children)

Too wet for server racks in the forest.

[–] sosodev@lemmy.world 10 points 2 days ago (1 children)
[–] M0oP0o@mander.xyz 5 points 2 days ago

Look I know there called "farms" but like I told the last forest gnome, the dank woods is no place to host data.

[–] poVoq 20 points 2 days ago* (last edited 2 days ago) (1 children)

Have you considered providing something like this: https://jortage.com/ and maybe contribute to their efforts to develop a specific API for that? Source code is here: https://github.com/jortage

[–] sosodev@lemmy.world 21 points 2 days ago (7 children)

Jortage is a really interesting approach. It definitely helps reduce the impact of the file hosting problem but it doesn’t fully address the underlying cost issue. The cost of storing files grows every month indefinitely while donations typically don’t.

I would like to see a file hosting pool come to lemmy though. So I will look into it. :)

[–] poVoq 11 points 2 days ago* (last edited 2 days ago)

Pict-rs that is used by Lemmy to store images already supports S3 type storage, so in theory it should work with Jortage, but I don't think anybody has tested that yet. The people behind Feddit.org might have experimented with it as they expressed interest a while back.

load more comments (6 replies)
[–] tehn00bi@lemmy.world 14 points 2 days ago (2 children)

Is file hosting really a must? I mean Reddit and feddit are basically forums. And not many forums allow file uploads. Also, we should have retention limits. Low value posts are allowed to fade away. High value posts that have some level of interaction stay alive longer.

[–] hendrik@palaver.p3x.de 31 points 2 days ago (1 children)

A lot of pictures and memes get posted here. And every other post shows a thumbnail picture. These images are all files.

[–] tehn00bi@lemmy.world 13 points 2 days ago (2 children)

Not denying that. But maybe we should accept that photos and memes and whatnot aren’t that valuable and limit their size or the volume allowed per user. Just a thought.

[–] hendrik@palaver.p3x.de 10 points 2 days ago* (last edited 2 days ago)

Yeah, I wonder if that would fly with the users. I just scrolled through my timeline and nearly every post has some colorful image to it. (except in Ask Lemmy and No Stupid Questions.) I'm not sure if users would accept this platform if it were mostly textual. And putting restrictions in place would certainly reduce the number of images. Scrolling through Lemmy would feel like Hackernews, not any modern social media platform. I doubt mainstream people appreciate that.

But yeah, that'd be possible. We could just close the meme communities for example. Or exclude them from individual instances to save some space there.

load more comments (1 replies)
load more comments (1 replies)
[–] abff08f4813c@j4vcdedmiokf56h3ho4t62mlku.srv.us 12 points 2 days ago (2 children)

You're not the first to think about this.

See https://aumetra.xyz/posts/the-fedi-ddos-problem - there an embed server is proposed, to be shared by multiple instances (ideally a great many would use just the one), which can host things like image files and previews.

load more comments (2 replies)
[–] moon@lemmy.cafe 6 points 2 days ago* (last edited 2 days ago) (2 children)

Personally I'm in the camp that I want history to be lost. That's part of the appeal to me. In fact my favorite feature in the fedi is Mastodon's option to enable auto-deleting posts of a certain age.

Only content that is explicitly pinned or reaches a certain amount of interactions should be saved imo. Since that's the stuff you'd actually want to preserve rather than the 99% of forgettable content, and it would also drastically cut down on file hosting.

Another thing is that a federation should only act as the exchange between users on ActivityPub. It should only cache relevant information and not be expected to store everything, like I wrote before. The user should be a portable account that is stored on a device. The federation server would sync your account between your devices, but not store it. You send your content to the federation, and then the federation sends it out into the world where they choose to do what they want with it. The federation shouldn't hoard it indefinitely.

Also this makes sense from a privacy perspective. If you care about privacy, why would you also want all your data indefinitely stored? Unless certain things are relevant and explicitly kept, it should be expected to expire and be lost by default. Where did we get this expectation that data should be stored forever? Also you expect it to be stored forever and not be trained on by AI?

This comment for example, after about a week or two most of the visibility and interaction of it will drop to zero. At that point, this comment should expire and no longer exist. I wrote this comment, it reached some people, and served it's purpose and should expire. I'm not going to pretend like this comment is some kind of historic document that should be indefinitely preserved, nor do I expect or want it to be.

[–] Lemmchen@feddit.org 1 points 7 hours ago (1 children)

This comment for example, after about a week or two most of the visibility and interaction of it will drop to zero. At that point, this comment should expire and no longer exist.

That's an incredible naive and egoistic take. Think about all the knowledge that is getting lost by applying this approach. How many times have you searched for some obscure thing and found the answer only on some five years old reddit post? That information would be lost for ever if you had your way.

[–] moon@lemmy.cafe 1 points 5 hours ago

I think the massive privacy benefits outweigh things like that, which should be documented properly anyways

[–] Ludrol@szmer.info 5 points 2 days ago (1 children)

Can you judge a work of art by it's virality? Should you judge by virality?

A lot of times in history artists got recognition they deserved only after their death. When they ware alive they lived in poverty struggling to make ends meet.

There is a lot of internet 1.0 preserved by internet archive that I didn't get to experience. There are flash games that I would love to preserve and show the next generation.

We wouldn't have known how Scotts Cawthon games have looked like before he made FNAF if not for the preservation efforts.

[–] MrMakabar 2 points 1 day ago

Usually those artist did get some recognition during their life, but never got into the main stream. That changed due to the main stream changing and the people who did like the art showing it again. That is actually rather easy to do with something like the Fediverse. It just requires a download option. Especially when everybody is aware, that the content will be deleted, that would be a decent option.

Also a lot of content on social media in general is very short term. Stuff like politcal discussions are fairly useless after a few months in most cases. So that can be deleted without much care and again, if somebody wants to preserve it, they easily can just download it.

[–] TORFdot0@lemmy.world 6 points 2 days ago (4 children)

Is a p2p system for media with the instances just hosting magnet links too slow for fediverse purposes? To me this seems like the most resilient way to handle media in a decentralized system

load more comments (4 replies)
[–] HulkSmashBurgers@reddthat.com 7 points 2 days ago (4 children)

I think IPFS could help the fediverse with storage.

load more comments (4 replies)
[–] jaggedrobotpubes@lemmy.world 7 points 2 days ago

This feels like something the Fediverse is ultimately going to build for itself. I know jack squat about the details, but it's gonna have to be a thing eventually, I think.

[–] tofuwabohu 7 points 2 days ago (1 children)

Interesting approach, good luck! Admittedly I'm not sure if many users want to take their media uploading in their own hands and pay for it but maybe I'm wrong. Where are the images stored? Do you have your own hardware? Backups etc?

Also since you're interested in Fediverse media storage, I recently read about https://jortage.com/ It's a third party storage for your instance with deduplication, pretty interesting idea. Takes away a bit of the federated part though

load more comments (1 replies)
[–] Valmond@lemmy.world 6 points 2 days ago

I think Tenfingers could be an interesting option as hosters do not know what they host, the data can be modified, and it's 100% decentralised.

load more comments
view more: next ›