this post was submitted on 30 Apr 2024
1438 points (98.9% liked)

Reddit

17641 readers
349 users here now

News and Discussions about Reddit

Welcome to !reddit. This is a community for all news and discussions about Reddit.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules


Rule 1- No brigading.

**You may not encourage brigading any communities or subreddits in any way. **

YSKs are about self-improvement on how to do things.



Rule 2- No illegal or NSFW or gore content.

**No illegal or NSFW or gore content. **



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Posts and comments which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts.

Provided it is about the community itself, you may post non-Reddit posts using the [META] tag on your post title.



Rule 7- You can't harass or disturb other members.

If you vocally harass or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



:::spoiler Rule 10- Majority of bots aren't allowed to participate here.

founded 1 year ago
MODERATORS
 

For the threads with the older one on the left: https://lemmy.world/post/14859950

(Thank you @Nelots@lemm.ee )

you are viewing a single comment's thread
view the rest of the comments
[–] bjorney@lemmy.ca -2 points 6 months ago (2 children)

If you have access to the entire Reddit comment corpus it's trivial to see which users are only reposting carbon copies of content that appears elsewhere on the site

[–] criitz@reddthat.com 11 points 6 months ago (2 children)

It's probably not as easy as you imagine for reddit to identify and cleanse all bot content.

[–] livus@kbin.social 2 points 6 months ago

Of course it's not. Nor do they want to.

I think the person you're talking to thinks all bots are like the easy ones in this screenshot.

[–] bjorney@lemmy.ca -3 points 6 months ago* (last edited 6 months ago) (1 children)

Look at the picture above - this is trivially easy. We are talking about identifying repost bots, not seeing if users pass/fail the Turing test

If 99% of a user's posts can be found elsewhere, word for word, with the same parent comment, you are looking at a repost bot

[–] criitz@reddthat.com 5 points 6 months ago

That's easy in an isolated case like this, but the reality of the entire reddit comment base is much more complex.

[–] livus@kbin.social 4 points 6 months ago* (last edited 6 months ago) (1 children)

The low level bots in OPs screenshot, sure, because it's identical. Not the rest.

I used to hunt bots on reddit for a hobby and give the results to Bot Defense.

Some of them use rewrites of comments with key words or phrases changed to other words or phrases from a thesaurus to avoid detection. Some of them combine elements from 2 comments to avoid detection. Some of them post generic comments like 💯. Doubtless there are some using AI rewrites of comments now.

My thought process is if generic bots have been allowed to go so rampant they fill entire threads that's an indication of how bad the more sophisticated bot problem has become.

And I think @phdepressed is right, no one at reddit is going to hunt these sophisticated bots because they inflate numbers. Part of killing the API use was to kill bot detection after all.

[–] bjorney@lemmy.ca 0 points 6 months ago* (last edited 6 months ago)

Reddit has way more data than you would have been exposed to via the API though - they can look at things like user ARN (is it coming from a datacenter), whether they were using a VPN, they track things like scroll position, cursor movements, read time before posting a comment, how long it takes to type that comment, etc.

no one at reddit is going to hunt these sophisticated bots because they inflate numbers

You are conflating "don't care about bots" with "don't care about showing bot generated content to users". If the latter increases activity and engagement there is no reason to put a stop to it, however, when it comes to building predictive models, A/B testing, and other internal decisions they have a vested financial interest in making sure they are focusing on organic users - how humans interact with humans and/or bots is meaningful data, how bots interact with other bots is not