Meta torrented over 81.7TB of pirated books to train AI, authors say

https://arstechnica.com/tech-policy/2025/02/meta-torrented-over-81-7tb-of-pirated-books-to-train-ai-authors-say/

by esporx

7 comments
  1. – … but which books on libgen did you pirate?

    – yes

  2. So how many copyright infringement letters did their ISP send them?? /s

  3. So big corporations say pirating is okay to do.

  4. How many books about pirates are there?

    ![gif](giphy|Ma9YUiOM7bqZW)

  5. Companies only ever care about copywrite laws when its to their benefit. I see little reason I should have a different perspective.

  6. I think they’ve all done it. I’ve been surprised time and time again on on how well chatgpt can generate quizzes on niche books. There is no way it would have been able to without having sucked it all up for its training data.

Comments are closed.