Guide
Using the MDC Python SDK Library to Download Datasets
In this guide, you will learn how to use the MDC Python SDK Library to download datasets from the Mozilla Data Collective website.
Guide
In this guide, you will learn how to use the MDC Python SDK Library to download datasets from the Mozilla Data Collective website.
News
This week: dataset filtering, enhanced uploader request flow, API improvements, and 20 new datasets!
News
This week: new features for uploaders, updates to dataset search, and new datasets on MDC!
News
Mozilla Data Collective has an amazing opportunity for you to get a free ticket to the 2026 Mozilla Festival in beautiful Barcelona, Spain this November, 2026. We are looking for feedback to inform our 2026 roadmap. Help shape the future of ethical data-sharing by filling out the form below
FAQ
If you need to remove a dataset from Mozilla Data Collective after it has been published, you can make your dataset private through the following steps: 1. Sign into your account - make sure you are using the account that published the dataset originally 2. Go to your Profile >
News
Tl;dr: Update to newest version of the MDC Python library (0.2.0 or newer) to continue downloading datasets
FAQ
We review every request to become a data provider on Mozilla Data Collective. Reviewing Uploader Requests If you have not already been in contact with our team about uploading data to Mozilla Data Collective, a member of our team will reach out to you via email to discuss your dataset
FAQ
We are a collective of linguists, technologists, activists, researchers and creatives. Whether you’re interested in stewarding data, conducting research, developing new AI and ML technologies, or just want to be part of our community working to make AI all it promises to be - not all it threatens to
FAQ
No, you do not need to be a part of an organization to upload a dataset to Mozilla Data Collective. We recognize that there are many use cases where an individual might want to share datasets that they have created, either on their own or on behalf of a group.
FAQ
When you upload a dataset to Mozilla Data Collective, you have the option to make your dataset exclusive to MDC. The default terms of use for data providers on the platform is that datasets are exclusive to MDC. Choosing to host your dataset exclusively with MDC means that you do
FAQ
Our priority is technology that is more multilingual, multicultural, and multi-modal. We prioritise helping communities unlock content that is not on the web already, and prefer audio, image, and video formats, though we will also accept text documents that advance the above goals. Our expectation is that each dataset is
FAQ
About downloading and using datasets When downloading a dataset, am I getting permission to use it from Mozilla Data Collective or the Data Provider? When you download a dataset from Mozilla Data Collective, you are entering into an agreement with the Data Provider who published the dataset. Data Providers set