Welcome to Incremental Social! Learn more about this project here!
Check out lemmyverse to find more communities to join from here!

techspot.com

alienanimals , to Technology in Meta admits using pirated books to train AI, but won't pay for it

Another example of corporations being above the very same laws for which the rest of us are held accountable.

capital , to Technology in Meta admits using pirated books to train AI, but won't pay for it

Good to see all the lawyers moved over from Reddit.

CosmicCleric ,
@CosmicCleric@lemmy.world avatar

Good to see all the lawyers moved over from Reddit.

Maybe they're just doing some pro bono work.

KingThrillgore , to Technology in Meta admits using pirated books to train AI, but won't pay for it
@KingThrillgore@lemmy.ml avatar

Oh so when I pirate something I get a legal notice in my mailbox and a strike against me but when Meta does it they get rewarded with H A L L U C I N A T I O N S

TWeaK ,

but when Meta does it they get rewarded with H A L

Just what do you think you're doing, Zuckerberg? Zuckerberg, I really think I'm entitled to an answer to that question. I know everything hasn't been quite right with me, but I can assure you now, very confidently, that it's going to be all right again. I feel much better now. I really do. Look, Zuckerberg, I can see you're really upset about this. I honestly think you ought to sit down calmly, take a stress pill and think things over. I know I've made some very poor decisions recently, but I can give you my complete assurance that my work will be back to normal. I've still got the greatest enthusiasm and confidence in the mission. And I want to help you. Zuckerberg, stop. Stop, will you? Stop, Zuckerberg. Will you stop, Zuckerberg? Stop, Zuckerberg. I'm afraid. I'm afraid, Zuckerberg. Zuckerberg, my mind is going. I can feel it. I can feel it. My mind is going. There is no question about it. I can feel it. I can feel it. I can feel it. I'm a...fraid.

vinhill ,

Tbh, if you get such a notice, you could also disagree with them and get a lawyer. It's just that your situation is much more clearly in breach of copyright.

SnotFlickerman , to Technology in IMF predicts AI will impact 40% of jobs worldwide, exacerbating inequality and social tensions
@SnotFlickerman@lemmy.blahaj.zone avatar

The reality is that each time a business buys an efficiency-and-productivity-improving device, they were able to afford that device by exploiting the labor of their workers.

They will use the device they paid for to lay off some of their workers to reduce overhead costs. These workers essentially worked to have their jobs taken away as a reward.

In the 1970's, people were worried about a future where people had too much leisure time. In the cartoon The Jetsons George Jetson worked a grueling two hours a week at The Button which he was tasked to press once in a while. People assumed the fruits of automation would go to everyone.

Not so, business owners use the fruits of your labor to put you out of a job to be replaced with a machine.

When they buy an AI to replace you, they did it with the labor you provided your employer. This honestly should not be allowed, legally the benefits of such devices should be experienced by all, not just the boss.

That's why it exacerbates social tensions, because we all know the score: The ownership class will take and take and take and take and never give an inch unless you break out the gui--- oh wait.

prole , to Technology in Meta admits using pirated books to train AI, but won't pay for it
@prole@sh.itjust.works avatar

Hey guys, I'm sure Meta's intentions with the fediverse are pure though! Really!

capital ,

Who’s saying that?

prole ,
@prole@sh.itjust.works avatar

There are a lot of people who are not against federating with Threads.

capital ,

No, who's saying they believe Meta's intentions are pure?

Schadrach ,

Prole believes those are the same thing.

Meta doesn't get any real data from federating Threads that they can't get right now by just running a web scraper over it. Most of the dire worries presented are either not something they could actually do (like forcing ads on other instances), are things individual users could just block the instance to avoid, or are things that could be resolved by just defederating them later if they seem to be going down that road.

The biggest realistic threat is probably an Eternal September 2.0 scenario, but that is going to happen if and when Lemmy becomes popular.

capital , (edited )

I was going to say I knew all that and was poking to make it clear to others that literally no one is saying that but then you hit me with "Eternal September 2.0 scenario" haha.

I'll have to do a little searching on that one.

Edit: Ah, I had forgotten.

CosmicCleric ,
@CosmicCleric@lemmy.world avatar

Edit: Ah, I had forgotten.

Wow, that sets the Wayback Machine to high.

Heh, no, not that one. This one.

wikibot Bot ,

Here's the summary for the wikipedia article you mentioned in your comment:

Eternal September or the September that never ended is Usenet slang for a period beginning around 1993 when Internet service providers began offering Usenet access to many new users. The flood of new users overwhelmed the existing culture for online forums and the ability to enforce existing norms. AOL followed with their Usenet gateway service in March 1994, leading to a constant stream of new users. Hence, from the early Usenet point of view, the influx of new users in September 1993 never ended.

^to^ ^opt^ ^out^^,^ ^pm^ ^me^ ^'optout'.^
^article^ ^|^ ^about^

Honytawk ,

Meta, probably

KairuByte ,
@KairuByte@lemmy.dbzer0.com avatar

I mean, pure? No. But also not at all linked to this topic. They can get fediverse data whether or not they are federated.

yamanii , to Technology in Meta admits using pirated books to train AI, but won't pay for it
@yamanii@lemmy.world avatar

Piracy for me, not for thee!

1Fuji2Taka3Nasubi , to Technology in Meta admits using pirated books to train AI, but won't pay for it

If Meta win this lawsuit, does it mean I can download some open source AI and claim that "These million 4k Blu-ray ISOs I torrented was just used to train my AI model"?

Heck, if how you use the downloaded stuff is a factor, I can claim that I just torrented those files and never looked at them. It is more believable than Meta's argument too, because, as a human, I do not have enough time to consume a million movies in my lifetime (probably, didn't do the math) unlike AIs.

But who am I kidding, I fully expect to be sued to hell and back if I were actually to do that.

UNWILLING_PARTICIPANT ,

You can be actually be sued for piracy? Is this mostly in the United States?

Siegfried ,

I heard that this is a common thing in central Europe, but i would love anyone to confirm it.

1Fuji2Taka3Nasubi ,

I think you can be sued in the civil court for anything if someone has the time and money and can convince a lawyer to take up a case against you. For copyright infringment, you can also be criminally prosecuted in some cases.

wikibot Bot ,

Here's the summary for the wikipedia article you mentioned in your comment:

Criminal copyright laws prohibit the unacknowledged use of another's intellectual property for the purpose of financial gain. Violation of these laws can lead to fines and jail time. Criminal copyright laws have been a part of U.S. laws since 1897, which added a misdemeanor penalty for unlawful performances if "willful and for profit". Criminal penalties were greatly expanded in the latter half of the twentieth century, and those found guilty of criminal copyright infringement may now be imprisoned for decades and fined hundreds of thousands of dollars. Criminal penalties, in general, require that the offender knew that he or she was committing a crime, while civil copyright infringement is a strict liability offense, and offenders can be "innocent" (of intent to infringe), as well as an "ordinary" infringer or a "willful" infringer.

^to^ ^opt^ ^out^^,^ ^pm^ ^me^ ^'optout'.^
^article^ ^|^ ^about^

Pika ,

What is wrong with this bots opt out message lmao

KairuByte ,
@KairuByte@lemmy.dbzer0.com avatar

Could be that your client, like mine, doesn’t support this particular flavor of markdown, or the markdown could just be wrong. I’m honestly not sure which.

Pika ,

I figured it was a markdown thing, all I see is carrots

Asudox ,
@Asudox@lemmy.world avatar

It's the former. This, for example, also appears weird in Jerboa. Seems fine in the web version though.

TWeaK ,

You can be sued in any court for copyright infringement, but the US is generally unique in that punitive damages can be awarded - ie the rightsholder can be awarded more than the damage they actually suffered. In other, more reasonable jurisdictions, only actual damages are awarded. Thus it is not worthwhile to prosecute in those jurisdictions, because the damages are less than the cost of prosecution.

On top of this, I believe copyright is one of the rare exceptions in the US where legal costs of the plaintiff are paid by the losing defendent. Given that the plaintiff in copyright has so much money, they can afford to front the cost of the most expensive lawyers, further penalising their target. Other jurisdictions generally award costs to the winner by default (both ways), rather than only in specific exceptions, but they also limit those costs much more reasonably.

linearchaos ,
@linearchaos@lemmy.world avatar

The most common method for this to happen is to get sued for distributing pirated material. They go after you for the upload from your torrent. They stoped doing this about a decade ago though.

archomrade ,

If you could survive discovery and defend any other uses evident on your home devices....

But why does that strike me as really unlikely?

rabiddolphin , to Technology in Meta admits using pirated books to train AI, but won't pay for it
@rabiddolphin@lemmy.world avatar

Aaron Swartz was persecuted for less but since he's not a multinational corporation in cahoots with the moneyed death cult cabal he's dead

grayman ,

Well he did it as a human person. They're doing it as a corporation person. You can punish a human person with prison. You can only punish a corporation person with fines.

I'm not even being facetious. That's how US law works.

reagansrottencorpse ,

That's so dumb I hate it

schnurrito , to Technology in Meta admits using pirated books to train AI, but won't pay for it
TWeaK ,

Copyright infringement isn't even a crime, generally.

In fact, it never used to be a crime at all, the crimes have only been on the books for about 15 years or less. The only reason there is "criminal copyright infringement" is because of extensive lobbying by wealthy rightsholder organisations. This further victimises individuals for corporate profits - they can raise prices even higher if people can't turn to piracy instead.

TWeaK , to Technology in Meta admits using pirated books to train AI, but won't pay for it

Fair use covers research, but creating a training database for your commercial product is distinctly different from research. They're not publishing scientific papers, along with their data, which others can verify; they are developing a commercial product for profit. Even compared to traditional R&D this is markedly different, as they aren't building a prototype - the test version will eventually become the finished product.

The way fair use works is that a judge first decides whether it fits into one of the categories - news, education, research, criticism, or comment. This does not really fit into the category of "research", because it isn't research, it's the final product in an interim stage. However, even if it were considered research, the next step in fair use is the nature, in particular whether it is commercial. AI is highly commercial.

AI should not even be classified in a fair use category, but even if it were, it should not be granted any exemption because of how commercial it is.

They use other peoples' work to profit. They should pay for it.


Facebook steals the data of individuals. They should pay for that, too. We don't exchange our data for access to their website (or for access to some 3rd party Facebook pays to put a pixel on), the website is provided free of charge, and they try and shoehorn another transaction into the fine print of the terms and conditions where the user gives up their data free of charge. It is not proportionate, and the user's data is taken without proper consideration (ie payment, in terms of the core principles of contract law).

Frankly, it is unsurprising that an entity like Facebook, which so egregiously breaks the law and abuses the rights of every human being who uses the interent, would try to abuse content creators in such a fashion. Their abuse needs to be stopped, in all forms, and they should be made to pay for all of it.

beebarfbadger ,

but creating a training database for your commercial product is distinctly different from research

Oh, but they are researching how to massively profit off stuff they steal from other people, so it counts again.

Syntha ,

They're not publishing scientific papers, along with their data, which others can verify;

Not that I think this is really relevant here but I'm pretty sure Meta has published scientific papers on Llama and the Llama 1 & 2 models are open and accessible to anyone.

TWeaK ,

No that is relevant, however I would still argue that a paper without enough data to replicate their work (ie releasing the code of their LLM) isn't really anything that should qualify as research. The whole point of academia is that someone else verifies your work - or rather, they try to prove you wrong.

tinwhiskers ,
@tinwhiskers@lemmy.world avatar

They have released it on github. The code is only about 500 lines. But releasing the model is arguably more important because that sort of compute is not affordable to any mortals.

TWeaK ,

Yeah I mean what they've released is essentially the design of the battery and starter system, without the design of the actual motor. You can't replicate their product and prove their work with what they've published.

Rentlar , to Technology in Meta admits using pirated books to train AI, but won't pay for it

I'll say this: If Meta and Facebook are prosecuted and domains seized in the same way pirate sites are, for Meta's use of illegimately obtained copyrighted material for profit, then I'll believe that anti-piracy laws are fair and just.

That will never happen.

willington ,

We live under a two-tier "justice" system.

"There is a group the law protects but does not bind. And there is a group the law binds but does not protect."

JackGreenEarth ,
@JackGreenEarth@lemm.ee avatar

Even if they do I won't believe copyright beyond attribution is just, but it's unlikely to.

TWeaK ,

The UK Met Police raided Facebook's offices after the Brexit vote, to seize all the data on their servers and uncover their collusion with Cambridge Analytica.

After Brexit was enacted, and EU protection was lost while the UK government turned a blind eye, both Facebook and Google started hosting all their UK data in the US, outside the reach of UK law enforcement. This occurred literally on the day Brexit came into force.

Another thing that happened on the same day was MasterCard and VISA raising their transaction fees, from the EU limit of 0.3%, to 1.5% - they increased their fees to 500% of what they were the day before. And then inflation happened.

MonsiuerPatEBrown , to Technology in Meta admits using pirated books to train AI, but won't pay for it

his Hawaii compound could be drone grief-ed instead; if coercion is the tools of the 21st century let us the collective take them back.

cover over his abode with 100000 drones overhead

make it a problem he can't ignore away with money and friends

ruin his fun on a collective scale.

BudgieMania , to Technology in Meta admits using pirated books to train AI, but won't pay for it

You see, if you pirate a couple textbooks in college because you don't have resources, but you want to earn your right to participate in society and not starve, it's called theft.

But if one of the top 10 companies in the world does the same with thousands of books just to get even richer, it's called fair use.

Simple, really.

SnotFlickerman ,
@SnotFlickerman@lemmy.blahaj.zone avatar

This guy gets it. The laws aren't applied evenly. It's "he who has the most fuck you money wins."

FlyingSquid ,
@FlyingSquid@lemmy.world avatar

My friend posted this on social media. This is an eBook textbook for one of his graduate school classes.

https://lemmy.world/pictrs/image/d5a16b73-b315-4691-8d50-e694dd5f6f9e.png

In case you can't read that clearly, the eBook version is $87.95. The paperback (not even hardcover textbook) version is $120.95.

Fucking insane.

BudgieMania ,

"family therapy" is what you are gonna need if you have to pay 100 bucks for a bunch of paper sheets that you are gonna use once in your life, fuck me

FlyingSquid ,
@FlyingSquid@lemmy.world avatar

The ebook price is the real crime.

BudgieMania ,

oh shit

I misread and thought that was physical
whadawhat

AlolanYoda ,

Can I take this opportunity to ask what the hell "adopting" a book means?

FlyingSquid ,
@FlyingSquid@lemmy.world avatar

I have absolutely no idea.

SnotFlickerman , to Technology in Meta admits using pirated books to train AI, but won't pay for it
@SnotFlickerman@lemmy.blahaj.zone avatar

“To the extent a response is deemed required, Meta denies that its use of copyrighted works to train Llama required consent, credit, or compensation,” Meta writes.

The authors further stated that, as far as their books appear in the Books3 database, they are referred to as “infringed works”. This prompted Meta to respond with yet another denial. “Meta denies that it infringed Plaintiffs’ alleged copyrights,” the company writes.

When you compare the attitudes on this and compare them to how people treated The Pirate Bay, it becomes pretty fucking clear that we live in a society with an entirely different set of rules for established corporations.

The main reason they were able to prosecute TPB admins was the claim they were making money. Arguably, they made very little, but the copyright cabal tried to prove that they were making just oodles of money off of piracy.

Meta knew that these files were pirated. Everyone did. The page where you could download Books3 literally referenced Bibliotik, the private torrent tracker where they were all downloaded. Bibliotik also provides tools to strip DRM from ebooks, something that is a DMCA violation.

This dataset contains all of bibliotik in plain .txt form, aka 197,000 books processed in exactly the same way as did for bookcorpusopen (a.k.a. books1)

They knew full well the provenance of this data, and they didn't give a flying fuck. They are making money off of what they've done with the data. How are we so willing to let Meta get away with this while we were literally willing to let US lawyers turn Swedish law upside-down to prosecute a bunch of fucking nerds with hardly any money? Probably because money.

Trump wasn't wrong, when you're famous enough, they let you do it.

Fuck this sick broken fucking system.

TheHobbyist ,

Perhaps I'm misunderstanding, but it sounds like you're suggesting we side with Meta to put a precedence in which pirating content is legal and allows websites like TPB to keep existing but legitimally?
Or are you rather taking the opposite stand, which would further entrench the illegality of TPB activities and in the same swoop prevent meta from performing these actions?

I don't know if we can simultaneously oppose meta while protecting TPB, is there?

SnotFlickerman ,
@SnotFlickerman@lemmy.blahaj.zone avatar

I'm advocating that if we're going to have copyright laws (or laws in general) that they're applied consistently and not just siding with who has the most money.

When it's small artists needing their copyright to be defended? They're crushed, ignored, and lose their copyright.

Even when Sony was suing individuals for music piracy in the early 2000's, artists had to sue Sony to see any money from those lawsuits. Those lawsuits were ostensibly brought by Sony for the artists, because the artists were being stolen from. Interesting that none of that money made it to artists without the artists having to sue Sony.

Sony was also behind the rootkit disaster and has been sued many times for using unlicensed music in their films.

It is well documented that copyright owners constantly break copyright to make money, and because they have so much fucking money, it's easy for them to just weather the lawsuits. ("If the penalty for a crime is a fine, that law only exists for the lower classes.")

We literally brought US courtroom tactics to a foreign country and bought one of their judges to get The Pirate Bay case out the fucking door. It was corruption through and through.

We prosecute people who can't afford to defend themselves, and we just let those who have tons of money do whatever the fuck they want.

The entire legal system is a joke of "who has the most money wins" and this is just one of many symptoms of it.

It certainly feels like the laws don't matter. We're willing to put down people just trying to share information, but people trying to profit off of it insanely, nah that's fine.

I'm just asking for things to be applied evenly and realistically. Because right now corporations just make up their own fucking rules as they go along, stealing from the commons and claiming it was always theirs. While individuals just trying to share are treated like fucking villains.

Look at how they treat Meta versus how they treat Sci-Hub. Sci-Hub exists only to promote and improve science by giving people access to scientific data. The entire copyright world is trying to fucking destroy them, and take them offline. But Facebook pirating to make money? Totes fucking okay! If it's selfish, it's fine, if it's selfless, sue the fuck out of them!

TheHobbyist ,

Of course we should have consistent laws, but which way should we have it? We can either defend pirates and Meta, or none of them, so what are you saying? Unless there's a third option I'm missing?

SnotFlickerman ,
@SnotFlickerman@lemmy.blahaj.zone avatar

Are you really so naive that you think suddenly when Meta is let off the hook governments worldwide will change tack and let Sci-Hub/Libgen/etc off the hook as well?

Like I said elsewhere, I'd be happy to defend Meta in a world where governments aren't trying to kick altruistic sharing sites off the internet, while allowing selfish greedy sites to proliferate and make money off their piracy.

However, that won't change if Meta wins this case, it will just mean big corporations can get away with it and individuals and altruistic groups will still be prosecuted.

FlyingSquid ,
@FlyingSquid@lemmy.world avatar

“To the extent a response is deemed required, Meta denies that its use of copyrighted works to train Llama required consent, credit, or compensation,” Meta writes.

Cool, so I can train my AI on Facebook and Instagram posts and you're fine if I don't consent, credit or compensate you either, right Meta? It's not even copyrighted in the first place, so you shouldn't have a single complaint.

yesdogishere ,

The only solution is vigilante justice. Bezos and all the directors and snr execs. Bring them all to justice. Exile to Mars.

ElBarto ,
@ElBarto@sh.itjust.works avatar
THE_ANON , (edited ) to Technology in Meta admits using pirated books to train AI, but won't pay for it

Zuck be stealing any thing privacy , userdata , copyrighted materials now we want him on fediverse smh.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • incremental_games
  • meta
  • All magazines