this post was submitted on 18 Jun 2023
286 points (100.0% liked)

Fediverse

19 readers
2 users here now

This magazine is dedicated to discussions on the federated social networking ecosystem, which includes decentralized and open-source social media platforms. Whether you are a user, developer, or simply interested in the concept of decentralized social media, this is the place for you. Here you can share your knowledge, ask questions, and engage in discussions on topics such as the benefits and challenges of decentralized social media, new and existing federated platforms, and more. From the latest developments and trends to ethical considerations and the future of federated social media, this category covers a wide range of topics related to the Fediverse.

founded 2 years ago
 

I had been having trouble getting meaningful results from the fediverse on Google, and after seeing this post, it seems I'm not the only one. So, I created a site that helps search the fediverse in your search engine of choice (it currently supports Google, Bing, Yahoo, DuckDuckGo, and Dogpile).

Due to query limitations with most search engines, it currently only searches the top 15 lemmy/kbin instances, but I've tested it and it seems to provide access to a good chunk of fediverse content. The exception is Google, which should be far more reliable overall as well as providing the ability to search Mastodon and PeerTube.

If you have contributions or ideas for improvement, feel free to check out the project here or shoot me a message. Hope this helps people! :)

https://fedi-search.com/

Edit: Update in progress including improved search queries and support for Mastodon/PeerTube (Google only, unfortunately)

Edit 2: Update is live, along with a dedicated domain name. If the website doesn't look any different for you, try Ctrl+F5 or clearing site data - it seems some browsers are caching the old page.

top 40 comments
sorted by: hot top controversial new old
[–] tal@kbin.social 19 points 1 year ago* (last edited 1 year ago) (3 children)

In all seriousness, Google needs to get on providing an easier way to specify that a search should hit the Fediverse. site:reddit.com works for Reddit, but there is presently no analogous operator on Google's search for a distributed system that spans many domains.

I mean, it's great that you've made this, don't get me wrong, but they really should do that as well.

[–] 0x1C3B00DA@kbin.social 3 points 1 year ago

but there is presently no analogous operator on Google's search for a distributed system that spans many domains.

Because that's just a basic search. A search engine searches across multiple domains by default. If you're specifically looking for only results from ActivityPub enabled services, that's pretty much an impossibility since there's no way to know (from a web crawl) if a page is served by a server that supports ActivityPub. Another problem is that a lot of fediverse instances purposefully block search engine crawlers because they don't want to appear in search results.

[–] Polymath@lemmy.dbzer0.com 1 points 1 year ago

I like the idea of scrapping Google altogether, and just having "better" search engines here that account for federated decouplings/distributions

Not entirely the same, but I switched over to Presearch a year or so ago, just to get away from Google and the "big tech" corporations

[–] TGRush@forum.fail 8 points 1 year ago (2 children)

Hm, I find it somewhat annoying that right now, this is not really searching the Fediverse, but rather what we've come to call "the Threadiverse", which is all about Reddit-like content aggregators.

In other words, I'd love an option to search different kinds of content, like instead of Threadiverse-stuff searching the most popular mastodon, misskey, or pleroma instances just to name a few.

[–] TenorTheHusky@kbin.social 6 points 1 year ago (1 children)

I'm hoping to expand the project to hopefully be a bit more robust - I'll definitely keep this on my radar

[–] TGRush@forum.fail 3 points 1 year ago

Thank you so much for the consideration! <3

[–] Kichae@kbin.social 5 points 1 year ago (1 children)

Searching Mastodon is a bit of a.... contentious issue. A lot of smaller Mastodon-based sites are full of traumatized vulnerable people who really just want to do their own thing, and they'll rattle cages if they find out someone's indexing their sites or posts. If anyone's making third party search tools, it's best to be careful to respect discoverability and indexing flags.

[–] stochasticity@lemmy.world 1 points 1 year ago (1 children)

I find this to be incredibly fair, but also makes it much harder to dive into the fediverse. Where is the middle ground do you think?

[–] Kichae@kbin.social 3 points 1 year ago (1 children)

Mastodon has flags for opting in to discoverability features (being featured in the profile directory, and having posts be searchable via Mastodon's search bar) and for search engine indexing (for Google, bing, etc.).

Just don't return posts from users that have opted out of those, and things should be mostly ok.

[–] 0x1C3B00DA@kbin.social 1 points 1 year ago

Just don't return posts from users that have opted out of those, and things should be mostly ok.

This is the main problem I see. User settings are part of the mastodon API. If you're building a general-purpose search engine, you use a crawler to index pages and your crawler has no idea those flags even exist.

[–] EmpathicVagrant@lemmy.world 6 points 1 year ago

Won’t let me upvote so I’m commenting to show love instead.

[–] lixus98@kbin.social 6 points 1 year ago

Pretty cool, thank you!

Good work bruv

[–] wave_walnut@kbin.social 5 points 1 year ago

Simple and handy, thanks!

[–] arkcom@kbin.social 5 points 1 year ago

add kbin-link to your browser to make this even better!

[–] Horik@kbin.social 5 points 1 year ago

Nice UI, thx.

[–] dosidosankofa@kbin.social 5 points 1 year ago

dig it, works great

[–] babelspace@kbin.social 5 points 1 year ago* (last edited 1 year ago) (1 children)

Awesome. Though I notice very little shows up from kbin.social; content I know is there is missing when I search for it. That may have more to do with the recency of the site growth or the cloudflare protection that was up a few days ago.

[–] TenorTheHusky@kbin.social 5 points 1 year ago

I would guess that it is the cloudflare protection, since that will have prevented crawlers from indexing the site while it was enabled.

[–] techviator@kbin.social 5 points 1 year ago* (last edited 1 year ago) (1 children)

Very cool! Thanks!
Suggestion: add Brave Search (search.brave.com) as an option as well. It's a smaller search engine but they have their own index and does not track users.

[–] TenorTheHusky@kbin.social 3 points 1 year ago* (last edited 1 year ago)

Will do o7

Edit: It seems Brave doesn't support chaining site specifiers, so my current method won't work with their search

[–] WebCrawlerReborn@kbin.social 4 points 1 year ago

Simple and to the point. Nice work!

[–] VE3VVS@kbin.social 3 points 1 year ago

very nicely done!

[–] Tywele@feddit.de 3 points 1 year ago

Would be great to have ecosia.org as an option for the search engine :)

[–] LetThereBeDwight@lemmy.world 3 points 1 year ago

Seems like you could probably use this strategy and get rid of the limits by turning this into an extension that would tack on the site list to the search directly(though, I'm unsure if there are such limits directly via the search box on Google or whomever).

I'd also, just from a code quality perspective, bust the list out into it's own property (which could later become smarter), and build the query string out at runtime.

[–] JurassicPork@lemmy.one 3 points 1 year ago

Amazing! Great job! And thanks!

[–] brasilikum 2 points 1 year ago

Down for me

[–] --@kbin.social 2 points 1 year ago
[–] Icarus@lemmy.ml 2 points 1 year ago (1 children)

have you considered redirecting to whoogle or searx ? you can use farside.link which automatically redirects to a working instance. it would be helpful for people who want to avoid using Google

[–] TenorTheHusky@kbin.social 3 points 1 year ago* (last edited 1 year ago)

I will absolutely check this out, thanks for the suggestion!

Edit: I can't get searx working at all for me, but I went ahead and implemented whoogle support

[–] Utzmetah@feddit.it 1 points 1 year ago

Thank you! good idea!

[–] redhydride@lemmy.ml 1 points 1 year ago

That's fantastic! Thank you!

[–] Poot@sh.itjust.works 1 points 1 year ago

Thank you!!

[–] dekkzz76@emacs.ch 0 points 1 year ago (1 children)
[–] TenorTheHusky@kbin.social 4 points 1 year ago (1 children)

Each one has its upsides and downsides. Kbin's search is more convenient for certain use cases, but it suffers from the same problem as Reddit's search - it always sorts by new no matter what. In addition, it can only index instances with which kbin has federated. FediSearch should in theory index every instance, regardless of federation status, and do so in such a way that the most useful posts show first (assuming Google does its job).

[–] dekkzz76@emacs.ch 1 points 1 year ago

@TenorTheHusky

google is crap at searching the fediverse

load more comments
view more: next ›