this post was submitted on 04 Sep 2023
39 points (100.0% liked)
Technology
37724 readers
615 users here now
A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.
Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.
Subcommunities on Beehaw:
This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
What we (people in general, that use the internet, regardless of government/country) need, in large part, is literacy. Not "gen AI literacy" or "media literacy", but simply "literacy".
I'm saying this because of a lot of the output of those text generators: says a lot without conveying much, it connects completely unrelated concepts because they happen to use similar words, it makes self-contradictory claims, things like this. And often its statements are completely unrelated to the context at hand. People with good literacy detect those things right off the bat, but people who struggle with basic reading comprehension don't.
The thing that strikes me about LLMs is that they have been created to chat. To converse. They’re partly influenced by Turing tests where the objective is to convince someone you’re human by keeping up a conversation. They weren’t designed to create meaningful content or factual content.
People still seem to want to use chat GPT to create something, and fix the accuracy as a second step. I say go back to the drawing board and create a tool that analyses statements and tries to create information based on trusted linked open data sources.
Discuss :)
LLMs are not created to chat, they're literally what the name says - language models. They are very complex statistical models of the joint causal probability of all possible words given the previous words in the context window. There's a common misconception that they're "made for chat" by the wider public because ChatGPT was the first "killer application", but they are much more general than that. What's so profound about LLMs to AI and NLP engineers is that they're general purpose. That is, given the right framework they can be used to complete any task expressible in natural language. It's hard to convey to people just how powerful that is, and I haven't seen software engineers really figure this out yet either. As an example I keep going back to, I made a library to create "semantic functions" in Python which look like this:
That is the entire function, expressed in the docstring. 10 months ago, this would’ve been literally impossible. I could approximate it with thousands of lines of code using SpaCy and other NLP libraries to do NER, maybe a dictionary of known names with fuzzy matching, some heuristics to rule out city names or more advanced sentence structure parsing for false positives, but the result would be guaranteed to be worse for significantly more effort. Here, I just tell the AI to do it and it… does. Just like that. But you can’t hype up an algorithm that does boring stuff like NLP, so people focus on the danger of AI (which is real, but laymen and news focus on the wrong things), how it’s going to take everyone’s jobs (it will, but that’s a problem with our system which equates having a job to being allowed to live), how it’s super-intelligent, etc. It’s all the business logic and doing things that are hard to program but easy to describe that will really show off its power.
Thanks for your reply, I appreciate the correction and the info.
I hope my reply didn't come off as too caustic - I thought your reply with an open request for discussion was refreshing regardless of the common misconception. You're not bad for being wrong, and I do enjoy sperging about these things. I didn't intend to demean you, just in case it came across like that (if not just ignore this - I guess I'm overthinking it 🤔).