{"id":207295,"date":"2025-06-23T09:15:21","date_gmt":"2025-06-23T09:15:21","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/207295\/"},"modified":"2025-06-23T09:15:21","modified_gmt":"2025-06-23T09:15:21","slug":"study-meta-ai-model-can-reproduce-almost-half-of-harry-potter-book-2","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/207295\/","title":{"rendered":"Study: Meta AI model can reproduce almost half of Harry Potter book"},"content":{"rendered":"<p>The Google Books precedent probably can\u2019t protect Meta against this second legal theory because Google never made its books database available for users to download\u2014Google almost certainly would have lost the case if it had done that.<\/p>\n<p>In principle, Meta could still convince a judge that copying 42 percent of Harry Potter was allowed under the flexible, judge-made doctrine of fair use. But it would be an uphill battle.<\/p>\n<p>\u201cThe fair use analysis you&#8217;ve gotta do is not just \u2018is the training set fair use,\u2019 but \u2018is the incorporation in the model fair use?\u2019&#8221; Lemley said. &#8220;That complicates the defendants&#8217; story.\u201d<\/p>\n<p>Grimmelmann also said there\u2019s a danger that this research could put open-weight models in greater legal jeopardy than closed-weight ones. The Cornell and Stanford researchers could only do their work because the authors had access to the underlying model\u2014and hence to the token probability values that allowed efficient calculation of probabilities for sequences of tokens.<\/p>\n<p>Most leading labs, including OpenAI, Anthropic, and Google, have increasingly restricted access to these so-called logits, making it more difficult to study these models.<\/p>\n<p>Moreover, if a company keeps model weights on its own servers, it can use filters to try to prevent infringing output from reaching the outside world. So even if the underlying OpenAI, Anthropic, and Google models have memorized copyrighted works in the same way as Llama 3.1 70B, it might be difficult for anyone outside the company to prove it.<\/p>\n<p>Moreover, this kind of filtering makes it easier for companies with closed-weight models to invoke the Google Books precedent. In short, copyright law might create a strong disincentive for companies to release open-weight models.<\/p>\n<p>\u201cIt&#8217;s kind of perverse,\u201d Mark Lemley told me. \u201cI don&#8217;t like that outcome.\u201d<\/p>\n<p>On the other hand, judges might conclude that it would be bad to effectively punish companies for publishing open-weight models.<\/p>\n<p>\u201cThere&#8217;s a degree to which being open and sharing weights is a kind of public service,\u201d Grimmelmann told me. \u201cI could honestly see judges being less skeptical of Meta and others who provide open-weight models.\u201d<\/p>\n<p>Timothy B. Lee was on staff at Ars Technica from 2017 to 2021. Today, he writes <a href=\"https:\/\/www.understandingai.org\/\" data-ml-dynamic=\"true\" data-ml-dynamic-type=\"sl\" data-orig-url=\"https:\/\/www.understandingai.org\/\" data-ml-id=\"0\" data-ml=\"true\" data-skimlinks-tracking=\"xid:fr1749765501754iea\" data-xid=\"fr1749765501754iea\" target=\"_blank\" rel=\"noopener\">Understanding AI,<\/a>\u00a0a newsletter that explores how AI works and how it&#8217;s changing our world. You can subscribe\u00a0<a href=\"https:\/\/www.understandingai.org\/\" target=\"_blank\" rel=\"noopener\">here<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"The Google Books precedent probably can\u2019t protect Meta against this second legal theory because Google never made its&hellip;\n","protected":false},"author":2,"featured_media":199704,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3163],"tags":[323,1942,53,16,15],"class_list":{"0":"post-207295","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-technology","11":"tag-uk","12":"tag-united-kingdom"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/114731923778192152","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/207295","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=207295"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/207295\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/199704"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=207295"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=207295"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=207295"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}