WordPress and Tumblr Plan to Sell User Content to AI Companies

EdibleFriend , 4 months ago

Bro...tumblr is full of some WEIRD FUCKIN SHIT YO

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

RadicalCandour , 4 months ago

Hey now, Don’t kink shame the weirdos

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

EdibleFriend , 4 months ago

I know because I was one of those weirdos lol

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

swayevenly , 4 months ago

Got 'em.

Sad they're doing this with Tumblr though. It was fun but I just deleted my 10+ year old account.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

EdibleFriend , 4 months ago

haha its been about that long since I even logged into mine.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

swayevenly , 4 months ago

I heard now is the best time to check it out

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

nickhammes , 4 months ago

I, for one, am looking forward to the rise of generative AI trained on 2014 tumblr, hallucinating Superwholock jokes where they don't belong, cosplayers dying themselves grey in a bathtub, and DashCon references where nobody expects them

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

EdibleFriend , 4 months ago

Bro this shit is gonna make AI UwU

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

harsh3466 , 4 months ago

Shit like this should be opt in by default. But no. Instead of respecting the users they count on ignorance, forgetfulness, and obfuscation for this kind of fuckery.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

agent_flounder , 4 months ago

Anything to make a buck.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Dio , 4 months ago

Lmao.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

SuperSynthia , 4 months ago

Not only am I really glad to not be on tumblr, but this further shows I shouldn't use wordpress for my website even though there is an opensource version

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

KingThrillgore , 4 months ago

WordPress is either:

overkill for a lot of users, when static site generators do the job faster and easier

underkill when you have topology, data types, logic, and content pipeline challenges, for which Drupal is king but far more complex

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

terminhell , 4 months ago

Can we get a list of companies NOT doing this? I'd assume it's going to be much shorter.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

IllNess , 4 months ago

All these AI and machine learning companies are taking content directly from websites and ignoring robot.txt files.

If your content is able to be crawled, even without being listed on search engines, I don't think it really matters.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

T156 , 4 months ago

It might help proof an AI company against legal issues that might be brought about by their using the content. If they're ever sued by Automattic, then they can just point to the deal and say that they bought the data from them. There's much less ambiguity.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

IllNess , 4 months ago

You are correct, about the legal stuff. These companies are being sued all the time.

Doing this deal also makes processing the data a lot easier. Being handed a big ass database would be a lot easier than crawling for content.

What I posted was about how they operate. These companies showed time and time again that they don't really care what data they are taking or from whom. They will even take their own AI or machine learning content and put it in their own system.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

autotldr Bot , 4 months ago

This is the best summary I could come up with:

To complicate matters even further, advertising content that isn’t even owned by Automattic, including ads from an old Apple Music campaign, has also reportedly made its way into the training data set.

The plans at Automattic have been so controversial internally, that a product manager has even started pulling his own photos off Tumblr to make sure they’re not used to train AI, according to 404.

Generative AI has become a big business ever since OpenAI first launched ChatGPT in late 2022 and text-prompt image creators soon followed from a number of companies.

But major publishers have complained, with some even filing lawsuits, alleging that much of the data used to train these systems was either pirated or doesn’t constitute “fair use” under existing copyright regimes.

In response to emailed questions on Tuesday, Automattic directed Gizmodo to a new post that more or less confirmed 404 Media’s reporting, while trying to sell the move to consumers as an opportunity to “give you more control over the content you’ve created.”

We also plan to take that a step further and regularly update any partners about people who newly opt-out and ask that their content be removed from past sources and future training.”

The original article contains 536 words, the summary contains 201 words. Saved 62%. I'm a bot and I'm open source!

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...