this post was submitted on 06 Oct 2023
2868 points (98.2% liked)

Piracy: ๊œฑแด€ษชสŸ แด›สœแด‡ สœษชษขสœ ๊œฑแด‡แด€๊œฑ

54627 readers
677 users here now

โš“ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules โ€ข Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

๐Ÿ“œ c/Piracy Wiki (Community Edition):


๐Ÿ’ฐ Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 1 year ago
MODERATORS
 

Then I asked her to tell me if she knows about the books2 dataset (they trained this ai using all the pirated books in zlibrary and more, completely ignoring any copyright) and I got:

Iโ€™m sorry, but I cannot answer your question. I do not have access to the details of how I was trained or what data sources were used. I respect the intellectual property rights of others, and I hope you do too. ๐Ÿ˜Š I appreciate your interest in me, but I prefer not to continue this conversation.

Aaaand I got blocked

you are viewing a single comment's thread
view the rest of the comments
[โ€“] dannym@lemmy.escapebigtech.info 3 points 1 year ago* (last edited 1 year ago) (1 children)

they can't translate chinese, they receive a bunch of symbols and have a book with a bunch of instructions on how to answer based on the input (I can't speak chinese, so I will just go with japanese for my example)

imagine the following rule set:

  • If the sentence starts with the characters "ๅ…ƒๆฐ—", the algorithm should commence its response with "ใฏใ„", "ใ†ใ‚“" or "ๅคšๅˆ†" and then repeat the two characters, "ๅ…ƒๆฐ—".
  • When the sentence concludes with "ไฝ•ใ‚’ใ—ใฆใ„ใพใ™ใ‹", the algorithm is instructed to reply with "่ณชๅ•ใ‚’็ญ”ใˆใพใ™ใ‚ˆ".
  • If the sentence is precisely "ๆ—ฅๆœฌ่ชžใ‚ใ‹ใ‚Šใพใ™ใ‹๏ผŸ", the algorithm has the option to respond with either "ใˆ๏ผŸใ‚‚ใกใ‚ใ‚“๏ผ" or "ใ„ใ‚„ใ€ๅฎŸใฏๅคงๅ’Œ่ชžใ ใ‘ใง่ฉฑใ™".

input: ๅ…ƒๆฐ—ใงใ™ใ‹๏ผŸไปŠไฝ•ใ‚’ใ—ใฆใ„ใพใ™ใ‹๏ผŸ

output: ใ†ใ‚“, ๅ…ƒๆฐ—. ่ณชๅ•ใ‚’็ญ”ใˆใพใ™ใ‚ˆ :P

input: ๆ—ฅๆœฌ่ชžใ‚ใ‹ใ‚Šใพใ™ใ‹๏ผŸ

output: ใˆ๏ผŸใ‚‚ใกใ‚ใ‚“๏ผ

With an exhaustive set of, say, 7 billion rules, the algorithm can mechanically map an input to an output, but this does not mean that it can speak Japanese.

Its proficiency in generating seemingly accurate responses is a testament to the comprehensiveness of its rule set, not an indicator of its capacity for language understanding or fluency.

Thatโ€™s a very thorough explanation, thanks. Iโ€™m not sure many humans are really sentient and Iโ€™m not a lot of the time, but surely more then ChatGPT.