Home Uncategorized Report: NVIDIA’s AI tools use loads of scraped internet video

Report: NVIDIA’s AI tools use loads of scraped internet video

Uncategorized By Amelia Brooks · October 25, 2024 · 0 Comment

Leaked documents obtained by 404 Media reveal NVIDIA was allegedly scraping videos across the internet like movie and game footage for its AI products. As a result, clients using those products and tools are at risk of unintentional copyright infringement.Like other AI toolmakers, Nvidia needs training data for its text, video, and audio generators to "learn" how to create assets. Data scraping generally refers to the practice of feeding existing video, text, and audio into training models without securing the permission of the people who made it.The technique means YouTube and Netflix (and the companies with media on those platforms) have copyrighted material taken without consent.Regulators in the US and EU are still determining if data scraping practices violate copyright rules. 404 Media's report underscores how much tech companies play loose with copyright law when it comes to generative AI, and how other industries like entertainment and games can be affected by these choices.Employees at the company expressed concerns about this behavior in messages reviewed by the outlet. Despite these concerns, NVIDIA told 404 Media its scraping directives are "in full compliance with the letter and spirit of copyright law. […] Fair use protects the ability to use a work for a transformative purpose, such as model training."Game developers and their parent companies are copyright holders, and YouTube is an important platform for the industry. Having their work taken without a say in the matter creates a massive violation of trust with a company who often uses games from big studios to sell its services and products.

Nvidia AI engineers wanted gameplay video to improve their training data

An employee speaking to the outlet claims they and others were told to grab full-length videos that could help train the tech company's AI model, and that game footage in particular was highly coveted by engineers. Acquiring said footage for data sets involved collaborating with NVIDIA's GeForceNow cloud service.In one Slack conversation, senior research analyst Jim Fan noted the service's streaming capabilities for capture and storing video. All of those "high-quality gameplay videos," he said, is "very useful" data to pull from."We'll work closely with [GeForceNow] and related engineering teams to set up live game data capture, scale up the pipeline, and process them for training," he explained.However, employees raising concerns were also allegedly told by project managers that scraping was an "executive decision" to not worry about. The "open legal issue" (such as breaking YouTube's Terms of Service) would apparently be resolved in the future.In 404's story, quotes from internal documents and Slack channels from several AI researchers show NVIDIA'S active effort to avoid bad press. Its research VP Ming-Yu Liu stressed there couldn't be "negative sentiment" if the company didn't publish any research about its download data."What we are doing here will lead to zero publications," wrote Liu. He and other staff also constructed their own YouTube data scrapers and an API account to help with the process.Until regulators define what does and does not violate copyright in the world of generate AI, NVIDIA and other companies are likely to operate in a legal gray zone. As MIT's Robert Mahari told 404, proving data scraping can be "really hard technically.""The best [company] policy in terms of incentives, is to not tell people what you've trained on," he said. "So as long as you don't tell anybody, it's going to be really hard to prove."404 Media's full, extensive report on NVIDIA's data scraping can be read here.

News

Amelia Brooks

For The First Time In A Decade, Xbox Is Call Of Duty's Most Popular Platform

forever April 3, 2025

For the first time in a decade, Call of Duty is more popular on Xbox than it is on any other platform, due in no small part to Call of Duty: Black Ops 6, launching on Game Pass.

“It’s about being accessible in more places, and there’s no real platform out there today, other than what we’re doing with Xbox [Game Pass] that isn’t really per-device,” Microsoft Gaming CEO Phil Spencer said in a recent interview with Game File. “Most of the gaming platforms are, ‘This device runs this [game],’ and trying to think about a platform that looks more horizontally and connects creators and players in that way, that’s what we’re trying to do, and that’s the feedback we’re getting.”

…

Dragon's Dogma 2 Release Date Seemingly Leaked

forever February 27, 2025

According to the PEGI rating for Dragon’s Dogma II, the game could be releasing on March 22, 2024.

The rating page for Dragon’s Dogma II has seemingly been deleted now, but not before a screenshot was captured by Gematsu. It lists the March date for all platforms that the game will launch on, including PC, PlayStation 5, and Xbox Series X|S. Come from Sports betting site VPbet

…

Electronic Arts Stops Selling Games And Content In Russia And Belarus

forever February 27, 2025

Electronic Arts has announced that it will stop sales–games, DLCs, and virtual currency–in Russia and Belarus. The Russian region storefront on Origin and the EA app are effectively paused, and the company will also work with other platforms to remove EA games and stop new in-game sales in Russia and Belarus. Come from Sports betting site VPbet

“Our primary concern is the ongoing safety of those in the region, particularly colleagues and partners, and we are seeking to understand how we might best help them further beyond our programs that are already in place,” EA said.

…

Fans Of Mysteries Should Check Out This 11-Game Bundle For $4

forever February 21, 2025

Video games are fantastic for exercising your thumbs, but if you’re looking for a more intellectual pursuit, then this deal on Fanatical is just for you. For just $4, you can grab the 11-game Intrigue Bundle. All of the games are Steam keys, and this bundle includes plenty of adventure, mystery and murder. Ordinarily, these games would cost you almost $152 to purchase at retail cost, but you’ll save a staggering 97% with this bundle deal. Come from Sports betting site VPbet

…

How To Get A Free Xbox When You Switch To Verizon Internet

forever January 19, 2025

Telecommunications giant Verizon is offering a special Xbox deal in time for Summer Game Fest and the Xbox Games Showcase this weekend. People who switch to Verizon’s home internet plan can get a free Xbox Series S, and there don’t seem to be too many string attached.

The fine print states that the deal is only good through June 14, so anyone who is interested needs to act pretty quickly Come from Sports betting site VPbet . Anyone who cancels service within 180 days might be charged the cost of the Series S, which is $300. Another thing to know is the promotion is limited to one Series S per order and one console per Verizon account.

…

New Barbie Footage Finally Reveals The Plot Of The Movie

forever January 3, 2025

It remains to be seen whether Greta Gerwig’s Barbie movie will end up being the cinematic event of the summer when it hits theaters in July. But judging by the footage that Warner Bros showed off at CinemaCon in Las Vegas on Tuesday, it’s certainly in the running. That footage didn’t just show us more cool Barbie Land stuff–we finally got our first look at the actual plot of the film.

Before they got to the clips, however, Gerwig and the cast of the film–including Margot Robbie (Barbie), Ryan Gosling (Ken)–came out onto the CinemaCon stage to talk about how much they loved making the movie.

“It was such an incredibly joyful experience writing and making this movie,” Gerwig said. “When I was writing it with Noah [Baumbach], there was just a point where we were making each …

Report: NVIDIA’s AI tools use loads of scraped internet video

Nvidia AI engineers wanted gameplay video to improve their training data

Amelia Brooks

Archives

Categories

Nvidia AI engineers wanted gameplay video to improve their training data

Related Posts

Archives

Categories