this post was submitted on 04 Aug 2023
71 points (92.8% liked)

Technology

34920 readers
176 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] eager_eagle@lemmy.world 25 points 1 year ago* (last edited 1 year ago) (6 children)

I like the experience using Copilot and GPT much better than browsing SO, but this is what worries me in the long-term though:

This issue goes beyond the survival of Stack Overflow. All AI models need a steady flow of quality human data to train on. Without that, they'll be left to rely on machine-generated content, and researchers have found that this leads to worse performance. There's an ominous name for this: model collapse.

Without this incredible knowledge sharing and curated feedback, in an environment that constantly changes with new libraries, languages, and best practices, these LLMs are doomed. I think solving this might be Stack Overflow's way out.

[–] dbilitated@aussie.zone 3 points 1 year ago (2 children)

if we use embedding and the language documentation, I wonder how much it can work out going forward?

[–] bionicjoey@lemmy.ca 2 points 1 year ago

Nothing because language models don't understand the text they read.

[–] eager_eagle@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

From what we see today based on these LLMs that are given a larger context (e.g. internal documentation or knowledge bases), we can say that it'd be as good as a decent developer that reads said documentation and it's able to apply that knowledge to a specific use case.

But Stack Overflow answers often target things that don't come up in the docs, that are outdated, or somewhat case-dependent and/or opinionated. Answers that might even lead to changes in documentation. This kind of insight will be hampered over time without a way of continuously sharing such knowledge.

load more comments (3 replies)