MDC Release Notes - 13.02.26
This week: new features for uploaders, updates to dataset search, and new datasets on MDC!
This week: new features for uploaders, updates to dataset search, and new datasets on MDC!
A practical guide for communities creating datasets together—no legal expertise required. Based on our data governance workshop at Mozilla Festival Zambia 2024 Your community has created something valuable: a dataset. Maybe it's voice recordings in your language. Maybe it's traditional knowledge, local photographs, or cultural
Mozilla Data Collective has an amazing opportunity for you to get a free ticket to the 2026 Mozilla Festival in beautiful Barcelona, Spain this November, 2026. We are looking for feedback to inform our 2026 roadmap. Help shape the future of ethical data-sharing by filling out the form below
If you need to remove a dataset from Mozilla Data Collective after it has been published, you can make your dataset private through the following steps: 1. Sign into your account - make sure you are using the account that published the dataset originally 2. Go to your Profile >
You might be sitting on something precious The modern world runs on data. One unfortunate result of this is the fact that many of us are unknowingly producing data for third-party companies, who use our content and actions as data points to make AI models that they then sell back
The internet belongs to everyone—but right now, it doesn't work for everyone. From the islands of Borneo to the mountains of Pakistan, hundreds of millions of people speak languages that AI simply can't understand. That's a problem we can solve together. At Mozilla
The internet belongs to everyone—but right now, it doesn't work for everyone. Millions of people speak languages that AI simply can't understand, and that's a problem we can solve together. At Mozilla Data Collective, we're proud to host a growing collection
Tl;dr: Update to newest version of the MDC Python library (0.2.0 or newer) to continue downloading datasets
We review every request to become a data provider on Mozilla Data Collective. Reviewing Uploader Requests If you have not already been in contact with our team about uploading data to Mozilla Data Collective, a member of our team will reach out to you via email to discuss your dataset
Key highlights from the Common Voice v24 Scripted Speech and v2 Spontaneous Speech release.
We are a collective of linguists, technologists, activists, researchers and creatives. Whether you’re interested in stewarding data, conducting research, developing new AI and ML technologies, or just want to be part of our community working to make AI all it promises to be - not all it threatens to
No, you do not need to be a part of an organization to upload a dataset to Mozilla Data Collective. We recognize that there are many use cases where an individual might want to share datasets that they have created, either on their own or on behalf of a group.