memes

9680 readers

3219 users here now

Community rules

1. Be civil

No trolling, bigotry or other insulting / annoying behaviour

2. No politics

This is non-politics community. For political memes please go to !politicalmemes@lemmy.world

3. No recent reposts

Check for reposts when posting a meme, you can only repost after 1 month

4. No bots

No bots without the express approval of the mods or the admins

5. No Spam/Ads

No advertisements or spam. This is an instance rule and the only way to live.

Sister communities

!tenforward@lemmy.world : Star Trek memes, chat and shitposts
!lemmyshitpost@lemmy.world : Lemmy Shitposts, anything and everything goes.
!linuxmemes@lemmy.world : Linux themed memes
!comicstrips@lemmy.world : for those who love comic stories.

founded 1 year ago

MODERATORS

Tenthrow@lemmy.world

The_Picard_Maneuver@lemmy.world

The_Picard_Maneuver@startrek.website

430

With all the talk about privacy, people seem to be forgetting that censorship is also a major problem for today's internet. (lemmy.world)

submitted 7 months ago by renzev@lemmy.world to c/memes@lemmy.world

102 comments fedilink hide all child comments

Many "alternative" search engines are better for privacy, but they are still vulnerable to censorship, because they rely on g**gle and m*crosoft's indices for their search results. This isn't a deep-hidden secret either, many of them disclose what search index they use on the "about" page, for example:

There are still search engines that (claim to) maintain their own index. Most surprisingly, br*ve:

https://brave.com/search-independence/

you are viewing a single comment's thread
view the rest of the comments

[–] Mojeek@lemmy.ml 3 points 7 months ago* (last edited 7 months ago)

if you look at the repo they give thanks to:

"The commoncrawl organization for crawling the web and making the dataset readily available. Even though we have our own crawler now, commoncrawl has been a huge help in the early stages of development."

There is nothing I can find which says how much of the index is CC and how much is their own; if there's a decent amount of CC, this is originally for researchers etc. it's not the best resource in the world for a search index: https://commoncrawl.org/

That being said, as an independent search engine, it's always good to see people take on the massive task of actually building an index, not becoming a proxy.