- Sam Altman says AI needs small amounts of high quality data
- New York Times is suing OpenAI for copyright infringement
Artificial intelligence doesn’t need vast quantities of training data from publishers like
“There is this belief held by some people that you need all my training data and my training data is so valuable,” Altman said Tuesday at Bloomberg House at the World Economic Forum’s annual meeting in Davos. “Actually, that is generally not the case. We do not want to train on the New York Times data, for example.”
The ChatGPT maker is in the midst of a major push to secure access to news content after the Times last month
OpenAI is in talks with publishers including CNN, Fox Corp. and Time to license news content, Bloomberg
Read More:
“What we want to do with publishers — if they want — is when one of our users says, what happened at Davos today, we’ll be able to say, here’s an article from Bloomberg, here’s an article from the New York Times,” Altman said. “Some people want to partner with us, some people don’t.”
Prior to the lawsuit, OpenAI and the Times had been in contact since April regarding licensing and failed to reach a deal. The AI company is also facing class action suits from writers including comedian
Read More:
Other publishers have been more amenable to cooperation. OpenAI signed an agreement with the
“A lot of our research is how do we learn more from smaller amounts of very high quality data,” Altman said.
To contact the reporters on this story:
To contact the editors responsible for this story:
Graham Starr
© 2024 Bloomberg L.P. All rights reserved. Used with permission.
Learn more about Bloomberg Law or Log In to keep reading:
See Breaking News in Context
Bloomberg Law provides trusted coverage of current events enhanced with legal analysis.
Already a subscriber?
Log in to keep reading or access research tools and resources.