On 26 November, Daniel van Strien, a machine learning librarian at Hugging Face, uploaded a dataset of 1m public posts and ...
Bluesky's Firehouse is known for being an open API, but it is also its flaw as anyone can scrape its data for the likes of AI ...
Daniel van Strien, a machine learning librarian at Hugging Face, took a million Bluesky posts and turned them into a dataset ...
Learn what makes Bluesky different by taking an in-depth look at its technological framework, privacy commitments, and ...
Bluesky, the social media platform often seen as a rival to Twitter, is at the center of a controversy after one million of ...
Bluesky user posts and user information was scraped by an AI researcher and built into a dataset and published on open ...
A Hugging Face librarian released and later removed a 1 million Bluesky posts dataset, sparking concerns over data transparency and consent. Daniel van Strien extracted the posts using the ...
A Hugging Face librarian released and later removed a 1 million Bluesky posts dataset, sparking concerns over data ...
A machine learning librarian scraped public posts and created a searchable AI training dataset, without consent of users.
Bluesky is facing its first major controversy over data scraping after a dataset containing one million public posts appeared ...
Reported by 404Media on Nov. 26, one million public Bluesky posts — complete with identifying user information — were crawled and then uploaded to AI company Hugging Face. The dataset was ...