Opensource

1298 readers

134 users here now

A community for discussion about open source software! Ask questions, share knowledge, share news, or post interesting stuff related to it!

Credits

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

⠀

founded 1 year ago

MODERATORS

pylapp@programming.dev

The Open Source Initiative Announces the Release of the Industry’s First Open Source AI Definition (opensource.org)

submitted 1 day ago by neme@lemm.ee to c/opensource@programming.dev

4 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] trevor@lemmy.blahaj.zone 3 points 4 hours ago (1 children)

Thanks for sharing the actual license text.

To me, this stinks of companies knowing that if they're actually required to reproduce the data, they'll get hit with copyright infringement or other IP-related litigation. Whereas if they can just be trusted to very honestly list their sources, they can omit the sources they weren't authorized to steal and reproduce content from, they can get away with it.

I think that, in practice, this means that the industry standard will be to lie and omit the incriminating data sources, and when someone tries to reproduce the model they won't actually be able to, but they also won't be able to easily prove one way or another if data was withheld.

Really, what should (but won't) happen, is that we should fix our broken IP laws and companies should be held to account for when they engage in behavior that would be prosecuted as piracy or Computer Fraud and Abuse if you or I did it.

AI is pretty much the epitome of companies getting to act with impunity in the eyes of the law and exerting that power over everyone else, and it's annoying to see it get a blessing from an "open source" organization.

[–] starshipwinepineapple@programming.dev 2 points 3 hours ago

Right, the other thing i considered is that you could just create a company and "buy" the data from them for a ridiculous amount of money and then you have less requirement to detail the data. Similarly you could deem the data unsharable and fudge the provenance.

Like locks, it will only keep honest people honest.