this post was submitted on 05 Jul 2023
864 points (98.5% liked)

World News

32088 readers
1181 users here now

News from around the world!

Rules:

founded 4 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Veltoss@lemmy.world 82 points 1 year ago (11 children)

How does Pinterest get around this then? They pollute image searches like crazy, and require you to login to see anything. At least they did, I blocked them from searches so maybe it's different now.

[–] gressen@lemmy.world 14 points 1 year ago (6 children)

Easy - detect if you're getting accessed by a search crawler or a human. Serve a full page or just a login request.

[–] RGB3x3@lemmy.world 11 points 1 year ago (5 children)

So how can a user pretend to be a web crawler?

[–] dangrousperson@vlemmy.net 7 points 1 year ago

Ever heard of https://12ft.io/ ? It allows you to bypass alot of pay walls by basically pretending to be a search engine trying to index a website. For SEO reasons a lot of pay walled sites allow search engines to access the whole article to index. 12ft.io leverages this to show you whole articles behind paywalls. This is something you could also achieve by spoofing the User-Agent. It would probably work for things like Pinterest without an account as well, but that's something I have never tried (since I have no interest in the cancer that is Pinterest).

load more comments (4 replies)
load more comments (4 replies)
load more comments (8 replies)