Great. Now all the AI bots will just be saying "This" and "came here to say that."
Technology
This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.
Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.
Rules:
1: All Lemmy rules apply
2: Do not post low effort posts
3: NEVER post naziped*gore stuff
4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.
5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)
6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist
7: crypto related posts, unless essential, are disallowed
"ChatGPT, solve this problem for me."
"As an AI language model, username checks out."
"poop knife xDD hehe"
There's so much actually great content posted across reddit over the years, it blows my mind that people decided that was something that needed to be mentioned all the time.
You just did that yourself lol
Thanks for the gold kind stranger
Well... We all knew that was coming. If you still have an account haven't done so, now's a good time to purge your account!
Unless you live in the EU or California, odds are that just deletes the public data, I’m sure Reddit retains it and would sell it.
Reddit account data has been training AI for over a decade. If you ever used it, you're already in a training set
Better yet, use an overwrite script to help turn their training models to jelly
That will remove your account from public view, but will it remove it from the data they use for AI training?
If not, you’re just enhancing the value of their proprietary data.
I'd be very surprised if comments weren't versioned in some way, so even if you delete or rewrite that data, it's probably still there and a part of training data.
So they're cashing in by selling other people's conversations.
Yeah.
FYI: reddit orphans content. In other words your posts/comments are undeletable.
I found instances of such late last year by way of search results. I clicked a username to see more posts by that account. The only content on their profile page was a final deletion message about the API changes.
Their post history was discoverable by using " site:reddit.com" on Google. All of their posts/comments still show up under their username
instead of the normal [deleted]
. Clicking the username takes you to their empty profile page.
So what we know from this now is that reddit has been saving original submissions. Whereas before their claim was that only the last edits are stored. Which is why the deletion scipts became a thing. People took it on good faith that we could delete our posts. At some point they stopped doing that. Or perhaps it was all a lie the whole time. Who knows.
Imagine AI mods trained by reddit mod models.
I would love a GPT model that just replies to every prompt with "Ya'll can't behave."
Y'all need Jesus
You know how artists can poison their images for AI... We need a way to poison content on Reddit
I would say most of the content is already poison.
i think that's just called posting on reddit
If you're talking about Glaze or Nightshade, those techniques are not proven to be particularly effective. Lots of people want them to work but that doesn't make it so.
Shit posts would do it. That'll turn the AIs into morons who spurt out "rizz" and "skibidi" instead of anything useful
There's nothing stopping anyone doing the same with lemmy posts though is there?
Alt title: Reddit looking to steal value from their millions of free users
Fuck /u/spez
It's a power grab. They'll justify control over the training data with intellectual property which keeps it out of the hands of everyday people but they stole the "intellectual property" from us in the first place. Then they'll control the "means of generation".
I'm Jack's complete lack of surprise.
This is why I deleted my posts. Also, 60 million is a jokingly lowball figure.
I wonder how reddit users feel about this. I wouldn't know, I DNS blocked the site months ago.
This is going to produce the saltiest AI the world has ever seen.
"Hey reddit ai , give me an idea how to balance my budget and pay my student debt, mate". "here ya go, I got a noose for you. Also I'm not your mate, dude".
" Hey reddit ai, draw for me a house with a genz family". Here ya go. " Hey reddit ai, why did you show me a pic of a highway ramp with homeless people?"
Just think, in 1000 years your body will be long dead but you'll be forced to live on as a poster! Death is not an option. 😌
what could go wrong with training your ai based on the posts of the most racist and misogynistic people on the internet?
My god they'll create a super redditor
It’s not 4chan… but someone did train one of those once.
Damn Facebook is owned by Reddit now?
That varies by subreddit, which might actually help in training LLMs to recognize the difference.