Authors sue OpenAI, allege ChatGPT was trained on their books

Last updated: July 5, 2023 5:07 pm

By Celebrity Gig Add a Comment

OpenAI logo on the website displayed on a phone screen and ChatGPT on AppStore displayed on a phone screen are seen in this illustration photo taken in Krakow, Poland on June 8, 2023.

Jakub Porzycki | Nurphoto | Getty Images

Two authors filed a lawsuit against OpenAI last week alleging that their copyrighted books were used to train the company’s artificial intelligence chatbot, ChatGPT, without their consent.

Paul Tremblay, the author of “The Cabin at the End of the World,” and Mona Awad, the author of “Bunny” and “13 Ways of Looking at a Fat Girl,” claim that ChatGPT generates “very accurate summaries” of their works, according to the complaint. They allege the summaries are “only possible” if ChatGPT was trained on their books, which would be a violation of copyright law.

OpenAI did not immediately respond to CNBC’s request for comment. Lawyers for Tremblay and Awad did not immediately respond.

ChatGPT automatically generates text based on written prompts in a fashion that’s much more advanced and creative than the chatbots of Silicon Valley’s past. The technology was developed by San Francisco-based OpenAI, a research company led by Sam Altman and backed by Microsoft.

The chatbot is trained on an enormous amount of text data. OpenAI doesn’t reveal what precise data was used for training ChatGPT, but the company says it generally crawled the web, including the use of archived books and Wikipedia.

The lawsuit, which was filed with a San Francisco federal court, alleges that “much” of the material in OpenAI’s training data is based on copyrighted materials, including books by Tremblay and Awad. But proving exactly how and where ChatGPT gleaned this information, as well as whether the authors have suffered financial damages, could be a challenge.

The complaint references exhibits of the summaries that ChatGPT generated, and it notes that the chatbot gets some things wrong. Awad and Tremblay claim that the rest of the summaries are accurate, however, which means “ChatGPT retains knowledge of particular works in the training dataset.”

“At no point did ChatGPT reproduce any of the copyright management information Plaintiffs included with their published works,” the complaint states.

Top Stories

Court indicts nurse, faults father, Wunmi for negligence

“Who is your man?” – Jumoke Odetola slams intrusive fans in fiery post

Over 20 Graduates storm TikToker Peller’s ₦500K Cameraman interview, sparks online buzz

Stay Connected

Authors sue OpenAI, allege ChatGPT was trained on their books

Leave a Reply Cancel reply

Content Safety

Trustworthy

Celebrity Gig Magazine

Related Stories

The US has a single rare earths mine. Chinese export limits are energizing a push for more

Airlines Drop Prices to Fill Seats, Some Flights Only $29

EU’s crackdown on Apple, Meta and others is to avoid forced breakups, top official says

Quordle today – hints and answers for Monday, June 10 (game #868)

‘Jeopardy!’ Champion Buzzy Cohen Shares Backstage Secrets

Balloon system can produce localized solar electricity for the ground below

Disney Is Investing $60 Billion in Its Cruises, Theme Parks

Not supplying Dangote Refinery crude shameful – FG

About Us

Quick Links