This picture shows screens displaying the logos of OpenAI and ChatGPT. (Photo by LIONEL … [+] BONAVENTURE/AFP via Getty Images)

AFP via Getty Images

OpenAI is at the center of a copyright debacle that could shape the future of content creation and publishing discourse. The latest case, filed by global book publishers in India, alleges that OpenAI’s language models are unlawfully trained on proprietary material.

The outcome of this case could determine whether AI systems are going to be subject to stricter regulations — or gain freer rein over existing data pools.

Moreover, India’s role as a global publishing powerhouse makes this case particularly interesting. With vast troves of content and a burgeoning AI ecosystem, the country is uniquely positioned to influence global copyright standards. The case could serve as a blueprint for how countries reconcile intellectual property frameworks with rapid AI advancements.

Proponents of stricter AI regulations argue that large language models like ChatGPT benefit disproportionately from copyrighted materials, essentially “scraping” years of creative labor without compensation. On the other hand, OpenAI and its AI allies contend that such data is necessary for innovation, and that fair-use policies offer sufficient protection.

Who owns the copyright? “The AI’s creator might be compared to the camera maker, while the AI user who prompts the creation of a specific work might be compared to the photographer who uses that camera to capture a specific image,” notes Christopher T. Zirpoli, legislative attorney at the U.S. Congressional Research Service.

Zirpoli suggests that while AI users might be considered the authors of specific outputs, the creative process behind coding and training AI gives its creators a stronger claim to authorship than a traditional toolmaker.

“In this view, the AI user would be considered the author and, therefore, the initial copyright owner,” he states. “The creative choices involved in coding and training the AI, on the other hand, might give an AI’s creator a stronger claim to some form of authorship than the manufacturer of a camera.”

Some firms that provide AI software attempt to allocate the respective ownership rights of the company and its users via contract, such as the company’s terms of service, states Zirpoli. OpenAI’s Terms of Use assign copyright of AI-generated content to the user:

“As between you and OpenAI, and to the extent permitted by applicable law, you (a) retain your ownership rights in Input and (b) own the Output. We hereby assign to you all our right, title, and interest, if any, in and to Output.”

What does this imply?

“OpenAI seems to cleverly bypass most copyright questions through contract, making only oblique references to ownership in an IP sense,” noted Andres Guadamuz, a scholar in intellectual property law at the University of Sussex.

“The commercialisation of AI works is here to stay.”

Experts suggest that a licensing model, where AI developers pay royalties to use copyrighted materials, could strike a balance — but that solution has its own concomitant risks: how should royalties be distributed? Will smaller creators be compensated fairly, or will larger publishers dominate the payouts?

The Indian lawsuit is not an isolated event. Similar cases are unfolding in the U.S., EU, and other jurisdictions, signaling a global reckoning for AI ethics and governance. These cases highlight the need for well-defined regulations and guidelines to help ensure that tech innovation happens in line with IP rights — clearer legal frameworks could go a long way in making sure innovation doesn’t come at the expense of equity, and that a more ethical sound approach to AI development becomes the norm.

“This free tool produces book summaries, extracts — why would people buy books then?” Pranav Gupta, general secretary of the Federation of Indian Publishers, told Reuters, referring to AI chatbots using extracts from unlicensed online copies. “This will impact our sales; all members are concerned about this.”