> I would find it a big stretch to say Github's intent here is to illegally dist...

brookst · on Nov 3, 2022

> Almost everything on GitHub is subject to copyright,

Agreed. Like I said, it's about intent. Can anyone say with a straight face that copilot is an elaborate scheme to profit by duplicating copyrighted work?

I don't think the defense is that it wasn't trained on copyrighted data. It obviously was.

I think the defense is that anything, including a person, that learns from a large corpus of copyrighted data will sometimes produce verbatim snippets that reflect their training data.

So when it comes to copyright infringement, are we moving the goalposts to where merely learning from copyrighted material is already infringement? I'm not sure I want to go there.