this post was submitted on 29 Aug 2024
54 points (71.1% liked)

Technology

59381 readers
4079 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] lvxferre@mander.xyz 12 points 2 months ago (2 children)

Yeah. But most people “write” online like they speak…

That's a common misconception.

While your written and spoken varieties do interact a fair bit, no, people don't "write like they speak". Not even online.

And that is not simply an "ackshyually". A lot of AAVE features simply don't transpose into writing - like prosody, non-rhoticity, /ɪ/-breaking, /äɪ/-monophtongisation... at most you can consciously approximate them into writing, but they won't be there.

If people followed rules about language, yeah, vernacular would just be spoken speech. But that’s not how it works. The rules are made to reflect what people are doing.

That is not about people following/not following "rules", it's about nomenclature - it's exactly the reason why "AAE" and "AAVE" are necessary as separated terms.

[–] treefrog@lemm.ee -2 points 2 months ago (1 children)

More and more people are using speech to text. And it does show how differently people speak than write (apparently I never say my be in because, for example).

But it also means that llms aren't only being fed text, but also speech converted into text.

[–] lvxferre@mander.xyz 4 points 2 months ago

For me it's like "holy fuck... do I eat so fucking many vowels???" It reaches a point that I eventually gave up using text-to-speech with Portuguese in my cell phone, I go straight for Italian because at least then it gets me right.

But it also means that llms aren’t only being fed text, but also speech converted into text.

That might be part of the issue causing the bias shown in the article.