AI trainers do a lot of work filtering and reformatting the training data. Often that's the most expensive part. There's a lot of synthetic data used these days too, reprocessed by other AIs.
i stopped using reddit and deleted my accout and posts when they introduced those fucking nft-avatars and it seems that they've been going downhill ever since that.
When you delete your account and posts now, unless you edit them first, all deleting them does is hide their visibility in the database. The post is still there.
The only engagement you actually get is on super-niche subreddits. Other than that, the "engagement" you get on reddit is largely indistinguishable from bot traffic.
The difference is that Lemmy admins across the fediverse aren’t making the user experience worse so they can sell the data to corporations for LLM training
Secondly, I think it’s more important what they did to achieve this goal, locking down the API behind a paywall was their way of creating value in their data. They knew then that it would be too expensive for independent developers to pay for but didn’t care. They knew the money would be coming AI data brokers.