>... former GitHub CEO Nat Friedman claimed during the Copilot technical preview that “training [machine-learning] systems on public data is fair use”.
>
> Well—is it? The answer isn’t a matter of opinion; it’s a matter of law. Naturally, Microsoft, OpenAI, and other researchers have been promoting the fair-use argument. Nat Friedman further asserted that there is “jurisprudence” on fair use that is “broadly relied upon by the machine[-]learning community”. But Software Freedom Conservancy disagreed, and pressed Microsoft for evidence to support its position. According to SFC director Bradley Kuhn—
>
> "[W]e inquired privately with Friedman and other Microsoft and GitHub representatives in June 2021, asking for solid legal references for GitHub’s public legal positions … They provided none."
>
> Why couldn’t Microsoft produce any legal authority for its position? Because SFC is correct: there isn’t any. Though some courts have considered related issues, there is no US case squarely resolving the fair-use ramifications of AI training.
>
> Furthermore, cases that turn on fair use balance multiple factors. Even if a court ultimately rules that certain kinds of AI training are fair use—which seems possible—it may also rule out others. As of today, we have no idea where Copilot or Codex sits on that spectrum. Neither does Microsoft nor OpenAI.
James Salsman
unread,
Nov 11, 2022, 6:29:21 PM11/11/22
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to licenses-m...@creativecommons.org
The GitHub Copilot class action suit got filed; see the same link from
October below. Microsoft's competitors seem to share a similar willful
dismissal of licensing restrictions, e.g., "It currently is based on
open-source large language models trained on public data." --
https://docs.replit.com/ghostwriter/faq
I see that CC is hiring fundraising staff, so if there's anything I or
anyone else can do to help as a volunteer, please let us know. I
recently started following
https://eval.ai/web/challenges/challenge-page/1866/overview and it
occured to me that CC could use the same system to crowdsource banner
ad text for their site with the same evaluation and leaderboard
system. Would anyone like to collaborate on support for that?
> >... former GitHdub CEO Nat Friedman claimed during the Copilot technical preview that “training [machine-learning] systems on public data is fair use”.
James Salsman
unread,
Nov 11, 2022, 9:14:09 PM11/11/22
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message