this post was submitted on 09 Jul 2023
499 points (97.0% liked)
Technology
59080 readers
4344 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I think this is exposing a fundamental conceptual flaw in LLMs as they're designed today. They can't seem to simultaneously respect intellectual property / licensing and be useful.
Their current best use case - that is to say, a use case where copyright isn't an issue - is dedicated instances trained on internal organization data. For example, Copilot Enterprise, which can be configured to use only the enterprise's data, without any public inputs. If you're only using your own data to train it, then copyright doesn't come into play.
That's been implemented where I work, and the best thing about it is that you get suggestions already tailored to your company's coding style. And its suggestions improve the more you use it.
But AI for public consumption? Nope. Too problematic. In fact, public AI has been explicitly banned in our environment.