A long list of tech companies are rushing to give themselves the right to use people’s data to train AI::More companies are quietly giving themselves permission to use consumer data to train generative AI models and tools.
Tethics
Backpfeifengesicht x1000
Give or take how much time do you guys think until Meta or any other big corp starts scraping and selling data from the Fediverse?
I thought that was the whole point of Threads.net. I still don’t understand why lemmyWorld hasn’t blocked them.
Are you implying that’s an issue? We freely publish these comments for everyone to use equally.
The very least they could do is allowing those people to use their AI.
This is the best summary I could come up with:
Over the last couple of months, companies as varied as Twitter, or X, Microsoft, Instacart, Meta, and Zoom have rushed to update their terms of service and/or privacy policies to allow the collection of information and content from people and customers as data to train generative artificial intelligence models.
Tweets, web searches and apparently even grocery shopping are now an opportunity for companies to build more predictive tools like Bard and ChatGPT, which is owned by OpenAI and receives considerable backing from Microsoft.
Users were only prompted to review updated Terms in September, in an email from the company announcing its partnership with OpenAI as “a new third-party sub-processor.”
However, Instacart also added language that left it a window to do just that with its own customers’ data, saying its license now allows it to “…otherwise enhance our machine learning algorithms, for the purposes of operating, providing, and improving the services.”
“We’re incorporating generative-AI experiences into our products to assist with customers’ grocery shopping questions and help them make food-related decisions,” the spokesperson said.
At the end of August, it created a simple form where users could “request” to opt out of their data being used to train AI models.
The original article contains 1,151 words, the summary contains 200 words. Saved 83%. I’m a bot and I’m open source!
I don’t see the issue with this, don’t give your data to companies if you don’t them to use it. No one is forcing us to use these services, if you don’t want twitter to train their AI off your tweets, then don’t tweet.
The problem is if you signed up for an account for a social media service years ago and they suddenly decide (without telling you or getting your consent) to start training off your data, there is nothing you can do if you don’t know it’s happening.
If the admins of the largest Lemmy instance didn’t tell us that they were gonna use our posts and comments as AI training data and everybody was none the wiser, how would we react? We wouldn’t until someone finds out and spills the beans.
I’m actually fine with it, in the case of Lemmy this is all public data, whether or not Lemmy admins are training AI on it, there is nothing to stop me from training my own AI models with this data.
I think the larger issue is I don’t consider it “your data” once you put it on one of these sites. As soon as you take your own thought and put it on facebook/instagram/reddit/whatever, it’s now theirs, it lives in their databases, and frankly for a social media company it’s probably their most valuable asset.
No one is forcing anyone to use social media, if you want your thoughts and actions to be your own I would recommend not putting them on the internet.