News
Upcoming Domain Change on 09 April
In mid-April, Mozilla Data Collective's primary domain will change to mozilladatacollective.com
News
In mid-April, Mozilla Data Collective's primary domain will change to mozilladatacollective.com
News
379 new datasets with a Mozilla Common Voice update, improvements to the Python SDK (make sure you update to the latest!) and a preview of an upcoming feature. đź‘€
data
Panjebar Semangat, a weekly Javanese-language magazine established before Indonesian independence, is collaborating with Mozilla Data Collective to advance community-governed language dataset frameworks.
News
Mozilla Data Collective is building towards a multicultural, multilingual, and multimodal future that works for all of us. And over the past few months, we’ve listened as people have flagged what kinds of datasets they need, but are struggling to find. So we’re pleased to announce that MDC
News
This week: 19 new datasets and a few small changes while we're heads down in some exciting new features that will be coming soon...
News
The institutions that safeguard humanity's cultural memory, galleries, libraries, archives, and museums (collectively known as the GLAM sector) are confronting a paradox that defines the current moment in AI development. Years of careful digitization of their archives have transformed physical collections into vast, machine-readable repositories of human knowledge.
News
This week: dataset filtering, enhanced uploader request flow, API improvements, and 20 new datasets!
News
This week: new features for uploaders, updates to dataset search, and new datasets on MDC!
News
Mozilla Data Collective has an amazing opportunity for you to get a free ticket to the 2026 Mozilla Festival in beautiful Barcelona, Spain this November, 2026. We are looking for feedback to inform our 2026 roadmap. Help shape the future of ethical data-sharing by filling out the form below
News
The internet belongs to everyone—but right now, it doesn't work for everyone. From the islands of Borneo to the mountains of Pakistan, hundreds of millions of people speak languages that AI simply can't understand. That's a problem we can solve together. At Mozilla
News
The internet belongs to everyone—but right now, it doesn't work for everyone. Millions of people speak languages that AI simply can't understand, and that's a problem we can solve together. At Mozilla Data Collective, we're proud to host a growing collection
News
Tl;dr: Update to newest version of the MDC Python library (0.2.0 or newer) to continue downloading datasets