this post was submitted on 11 Jun 2023
556 points (100.0% liked)

Lemmy.World Announcements

29026 readers
4 users here now

This Community is intended for posts about the Lemmy.world server by the admins.

Follow us for server news 🐘

Outages 🔥

https://status.lemmy.world

For support with issues at Lemmy.world, go to the Lemmy.world Support community.

Support e-mail

Any support requests are best sent to info@lemmy.world e-mail.

Report contact

Donations 💗

If you would like to make a donation to support the cost of running this platform, please do so at the following donation URLs.

If you can, please use / switch to Ko-Fi, it has the lowest fees for us

Ko-Fi (Donate)

Bunq (Donate)

Open Collective backers and sponsors

Patreon

Join the team

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] ewe@lemmy.world 5 points 1 year ago (1 children)

The problem with this line of thinking is "why is 3rd party API data any less valuable than 1st party API data for AI training?" While this may be true, I don't see this particular move being motivated by AI. They still have all the API calls and interactions even if they aren't being made by Reddit's own apps.

[–] Doggylife@lemmy.world 4 points 1 year ago (2 children)

I may be wrong here, but what I mean is they have ways to stop LLM companies from web scraping all of Reddit. The only other way the likes of chatGPT can get all the info is through to API which is currently free. So I think Reddit might be doing is saying this information isn't free so pay X amount for access to our data.

Obviously 3rd party apps like Apollo won't pay that, but Google and OpenAi probably will.

I'm not too sure what you mean by api data being worth less or more, it's all the same data.

[–] jelos98@lemmy.world 4 points 1 year ago* (last edited 1 year ago)

Almost certainly true. Historically, you'd just grab the dumps from pushshift.

https://www.reddit.com/r/modnews/comments/134tjpe/reddit_data_api_update_changes_to_pushshift_access/

"TL;DR: Pushshift is in violation of our Data API Terms and has been unresponsive despite multiple outreach attempts on multiple platforms, and has not addressed their violations. Because of this, we are turning off Pushshift’s access to Reddit’s Data API, starting today"

[–] ewe@lemmy.world 3 points 1 year ago

This makes sense. I get the argument now. Thanks!