machine learning

2 posts
Your Bluesky Posts Are Probably In A Bunch of Datasets Now
Bluesky

Your Bluesky Posts Are Probably In A Bunch of Datasets Now

After a machine learning librarian released and then deleted a dataset of one million Bluesky posts, several other, even bigger datasets have appeared in its place—including one of almost 300 million non-anonymized posts.
AI Video Generator Runway Trained on Thousands of YouTube Videos Without Permission
AI

AI Video Generator Runway Trained on Thousands of YouTube Videos Without Permission

A leaked document obtained by 404 Media shows company-wide effort collected thousands of YouTube videos and pirated content for training data.