Google has struck a deal with Reddit that will allow the search engine maker to train its AI models on Reddit’s vast catalog of user-generated content, the two companies announced. Under the arrangement, Google will get access to Reddit’s Data API, which will help the company “better understand” content from the site.
The deal also provides Google with a valuable source of content it can use to train its AI models. “Google will now have efficient and structured access to fresher information, as well as enhanced signals that will help us better understand Reddit content and display, train on, and otherwise use it in the most accurate and relevant ways,” the company said in a statement.



is it Reddit’s content though?
It’s content that Reddit users generated which apparently is theirs to sell.
From the TOS/EULA, the content belongs to each user, they just license it to Reddit to use as it pleases.
So it’s user generated content that is a product for Reddit to sell, like most big tech companies do, as I said.
The difference is: Reddit doesn’t own the content, they can’t stop anyone else from selling it, or giving it for free; only the users could (the actual owners).
There are Reddit content dumps out there, which Reddit can’t stop anyone from using… so not sure what they are selling, but if it’s just that, then they’re scamming people.
If you are posting on walled-garden big tech site like Reddit, Instagram, Twitter / X, the site and therefore the company certainly owns your content and all the metadata attributed to it. You’re the product. This is why most of us are here on the Fediverse where things are different. Maybe if it’s your personal photo you took than you can make a copyright claim to some degree and download your data tediously but once it’s on their network it’s generally theirs to do as they please, whether that be sell to Google or any other advertiser or use on in-house advertising. Often without proper informed consent and not always legally. It’s definitely a scam, I agree. Hopefully this exposes it more and brings more people to places on the Fediverse where there’s no owner/seller/buyer of your data or anything else you contributed.
Ownership comes with both rights and responsibilities.
Platforms want as many of the rights as possible, without the responsibilities… which is why they have a contract (TOS) where they explicitly renounce to ownership, leaving it for the user, and only license the rights.
If platforms took full ownership, like in a “work for hire” agreement, they would be responsible for any illegal content a user could upload, since it wouldn’t be the user’s content anymore. Obviously they don’t want that.
A side effect of wanting as much content as possible without owning it, is that… well, they don’t own it. 😎
Incorrect. You get ownership of anything that’s yours, then upload stuff under whatever TOS your instance has… what’s that? it has no TOS? Then they’re in for a rough awakening some day. 🤷
Whether there are sellers/buyers… is something we’ll learn in time. For now, user generated content on the Fediverse gets shared with little regard or protection of anyone’s rights, so anyone can make a compilation, bundle it up, slap a price tag on it, and try to sell it.