Nvidia, Apple, and others allegedly trained AI using 173,000 YouTube videos — professional creators frustrated by latest AI training scandal: Report

lemme in@lemm.ee · 4 months ago

Nvidia, Apple, and others allegedly trained AI using 173,000 YouTube videos — professional creators frustrated by latest AI training scandal: Report

ShadowRam@fedia.io · 4 months ago

without consent,

Youtubers still got paid for the AI views.

What extra compensation do they think they are due?

Dude learns how to do plumbing from plumbing channels, goes out and makes a business plumbing. I don’t see why the author of said videos are entitled to prevent that or needs to give the plumber permission to do so?

I’m sure I’ll eat the downvotes from this group who’ll shout FUCK AI no matter what the context is, But in this particular case, I don’t see how this is a problem.

Entity watched your video, and then went and did something that made money. It didn’t copy your video, that’s not how AI works. So copyright doesn’t have a leg to stand on.

You created a video garner views to make you money. This thing saw your video, you made your money.

subignition@fedia.io · 4 months ago

Dude learns how to do plumbing from plumbing channels, makes his own shittier video series on how to do plumbing made out of clips he didn’t have the rights to from the plumbing channel

Fixed that for you

ShadowRam@fedia.io · edit-2 4 months ago

made out of clips he didn’t have the rights

See, and this is where your showing your ignorance in understanding how currently AI functions.

Yes, it’s possible the AI could go and make shittier videos with its new knowledge. As could the novice plumber in the example I gave.

But the AI isn’t copying clips of any videos.

It’s not a repository of the videos/pictures or words it was exposed to, that it just recalls.

LLMs do not model the world - Sean Carroll

subignition@fedia.io · edit-2 4 months ago

It generates new content that is based on patterns it has acquired from training data. The fact that you can’t readily trace/attribute output to specific parts of training data does not make it permissible for a human to cause the LLM to train on that data without permission of the rights holder, or in violation of the content provider’s ToS.

I fear you are getting stuck nitpicking my analogy which was a bit simplified.

ShadowRam@fedia.io · edit-2 4 months ago

does not make it permissible for a human to cause the LLM to train on that data without permission of the rights holder

Says who? These videos are out there for people (or things) to see.

If someone was playing some videos to train their dog to to respond to a noise, what business is that of the rights holder?

Show me were in the ToS over a year ago, where it says you’re not allowed to train an AI on the video.

Rights holder can’t control what people are using the video for. They can control when and how it’s delivered, but not who’s actually watching it.

subignition@fedia.io · 4 months ago

Says who? These videos are out there for people (or things) to see.

What an awful troll you are. You conveniently didn’t quote the remainder of the sentence so you could try to nitpick a part of my response out of context.

Read the “Permissions and Restrictions” section of the YouTube terms of service.

ShadowRam@fedia.io · 4 months ago

Even now, it says nothing about letting AI watch them.

Goun@lemmy.ml · 4 months ago

Youtubers still got paid for the AI views.

First, are you sure about that? I’m pretty sure they don’t get anyhing even when their videos are watched by real people using another frontend, like freetube, let alone an automated scrapper.

Second, they’re violating youtube’s terms, if I read correctly.

ShadowRam@fedia.io · 4 months ago

If these companies used youtube videos in a way that circumvented their revenue stream in any way, then yeah, absolutely that’s a problem. But that’s a completely different issue not related to who/what is consuming the video

GBU_28@lemm.ee · 4 months ago

Yeah it’s not like mass web scraping is a new thing.