this post was submitted on 01 Aug 2024
110 points (100.0% liked)

Technology

37719 readers
490 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] theangriestbird@beehaw.org 59 points 3 months ago (1 children)

The beef between Microsoft and Reddit came to light after I published a story revealing that Reddit is currently blocking every crawler from every search engine except Google, which earlier this year agreed to pay Reddit $60 million a year to scrap the site for its generative AI products.

I know the author meant "scrape", but sometimes it really does feel like AI is just scrapping the old internet for parts.

[–] cybermass@lemmy.ca 15 points 3 months ago (1 children)

Yeah, aren't like over half of reddit comments/posts by bots these days?

[–] originalucifer@moist.catsweat.com 13 points 3 months ago (1 children)

yep, and the longer that happens the less value to the dataset. its becoming aged.

[–] RiikkaTheIcePrincess@pawb.social 13 points 3 months ago* (last edited 3 months ago) (1 children)

[Joke] See, Reddit's doing a nice thing here! They're making sure nobody ends up toxifying their own dataset by using Reddit's garbage heap of bot posts!

[–] originalucifer@moist.catsweat.com 5 points 3 months ago (2 children)

google needs a checkbox of 'ignore reddit' im sick of having to manually add -reddit

[–] Cube6392@beehaw.org 13 points 3 months ago (1 children)

Hey good news. Turns out you can use bing and not get back Reddit results

yeah but then i get back bing results. no one needs that

There's a browser extension for that. It also works on Pintrest and other useless sites. https://iorate.github.io/ublacklist/docs