this post was submitted on 28 Jan 2024
1042 points (97.6% liked)

Programmer Humor

19589 readers
1717 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 1 year ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] tatterdemalion@programming.dev 79 points 9 months ago* (last edited 9 months ago) (22 children)

It literally cannot come up with novel solutions because it's goal is to regurgitate the most likely response to a question based on training data from the internet. Considering that the internet is often trash and getting trashier, I think LLMs will only get worse over time.

[–] ArrogantAnalyst@feddit.de 28 points 9 months ago (1 children)

Also the more the internet is swept with AI generated content, the more future datasets will be trained on old AI output rather than on new human input.

[–] tatterdemalion@programming.dev 16 points 9 months ago (1 children)

Humans are also now incentivized to safeguard their intellectual property from AI to keep a competitive advantage.

[–] Spaghetti_Hitchens@kbin.social 7 points 9 months ago* (last edited 9 months ago) (3 children)

What are some strategies for doing that? (This is me, totally not a bot)

[–] 0xD@infosec.pub 4 points 9 months ago
[–] FractalsInfinite@sh.itjust.works 2 points 9 months ago

Lets see, since the goal is to prevent webscaping all these should work: paywalls, account only acsess, text obferscation (e.g. using a custom font that maps letters randomly to other ones so it looks fine but to a webscraper it looks like gibberish), HTML obferscation (inserting random characters in the HTML then hiding them using CSS) and many more.

load more comments (20 replies)