Sarah Silverman and others file court case claiming CEO approved use of dataset despite warnings
Citing internal Meta communications, the filing claims that the social network company’s chief executive backed the use of the LibGen dataset, a vast online archive of books, despite warnings within the company’s AI executive team that it is a dataset “we know to be pirated”.
The internal message says that using a database containing pirated material could weaken the Facebook and Instagram owner’s negotiations with regulators, according to the filing. “Media coverage suggesting we have used a dataset we know to be pirated, such as LibGen, may undermine our negotiating position with regulators.”
The authors sued Meta in 2023, arguing that the social media company misused their books to train Llama, the large language model that powers its chatbots.
The Library Genesis, or LibGen, dataset is a “shadow library” that originated in Russia and claims to contain millions of novels, nonfiction books and science magazine articles. Last year a New York federal court ordered LibGen’s anonymous operators to pay a group of publishers $30m (£24m) in damages for copyright infringement.
The filing cites a memo, referring to Mark Zuckerberg’s initials, noting that “after escalation to MZ”, Meta’s AI team “has been approved to use LibGen”.
Quoting internal communications, the filing also says Meta engineers discussed accessing and reviewing LibGen data but hesitated on starting that process because “torrenting”, a term for peer-to-peer sharing of files, from “a [Meta-owned] corporate laptop doesn’t feel right”.
Sign up to Business Today
Get set for the working day – we’ll point you to all the business news and analysis you need every morning
The writers argued this week that the evidence bolstered their infringement claims and justified reviving their CMI case and adding a new computer fraud allegation.
Chhabria said during a hearing on Thursday that he would allow the writers to file an amended complaint but expressed scepticism about the merits of the fraud and CMI claims.
Meta has been contacted for comment.
Reuters contributed to this article
Source: www.theguardian.com