Hacker News new | past | comments | ask | show | jobs | submit login

> I would find it a big stretch to say Github's intent here is to illegally distribute copyrighted code.

Almost everything on GitHub is subject to copyright, except for some very old works (maybe something written by Ada Lovelace?), and US government works not eligible for copyright.

Now, many of the works there are also licensed under permissive licenses, but that is only a defense to copyright infringement if the terms of those licenses are being adequately fulfilled.




> Almost everything on GitHub is subject to copyright,

Agreed. Like I said, it's about intent. Can anyone say with a straight face that copilot is an elaborate scheme to profit by duplicating copyrighted work?

I don't think the defense is that it wasn't trained on copyrighted data. It obviously was.

I think the defense is that anything, including a person, that learns from a large corpus of copyrighted data will sometimes produce verbatim snippets that reflect their training data.

So when it comes to copyright infringement, are we moving the goalposts to where merely learning from copyrighted material is already infringement? I'm not sure I want to go there.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: