Recommended Repositories
Recommended Data Repositories for AllStats Authors
Why Share Your Data?
Sharing research data promotes transparency, enables verification of results, and allows reuse in future studies. AllStats requires data sharing for published articles unless prohibited by ethical, legal, or privacy concerns. Below are vetted, field-agnostic and domain-specific repositories recommended by the journal.
General-Purpose Repositories
Suitable for any statistical dataset; most offer free storage and assign persistent identifiers (DOIs).
|
Repository |
Best For |
Cost |
DOI |
Size Limit |
Link |
|
Zenodo |
Any research data, code, supplements |
Free |
Yes |
50 GB/dataset |
|
|
Figshare |
Datasets, figures, media, posters |
Free |
Yes |
5 GB/file, 20 GB private |
|
|
Dryad |
Curated data from published research |
$120 submission fee* |
Yes |
300 GB/dataset |
|
|
Open Science Framework (OSF) |
Projects with multiple files/stages |
Free |
Yes |
5 GB/file, 50 GB/project |
|
|
Mendeley Data |
Broad discipline coverage |
Free |
Yes |
10 GB/dataset |
* Dryad’s fee may be waived for authors from low-income countries or through institutional memberships.
Domain-Specific Repositories
Choose these if your data fits a specialized field; they often provide enhanced metadata standards and community recognition.
Biomedical & Life Sciences
- NCBI Gene Expression Omnibus (GEO) – Gene expression, microarray, RNA-seq
https://www.ncbi.nlm.nih.gov/geo/ - European Nucleotide Archive (ENA) – Sequencing data
https://www.ebi.ac.uk/ena - ClinicalTrials.gov – Clinical study results
https://clinicaltrials.gov - ImmPort – Immunology data
https://www.immport.org
Social, Behavioral & Economic Sciences
- ICPSR (Inter-university Consortium for Political and Social Research) – Social science data
https://www.icpsr.umich.edu - UK Data Service – Social, economic, population data
https://ukdataservice.ac.uk - Harvard Dataverse – Social science dataverse collections
https://dataverse.harvard.edu
Physical Sciences & Engineering
- PANGAEA – Earth & environmental science
https://www.pangaea.de - NASA Earthdata – Earth science data
https://earthdata.nasa.gov - Materials Cloud – Materials science, computational datasets
https://www.materialscloud.org
Government & Public Data
- data.gov – U.S. government open data
https://data.gov - European Data Portal – EU open data
https://data.europa.eu - World Bank DataBank – Global development data
https://databank.worldbank.org
Code & Software Repositories
For sharing statistical analysis scripts, software, and computational workflows.
|
Platform |
Best For |
DOI Integration |
Link |
|
GitHub |
Code, version control, collaboration |
Via Zenodo integration |
|
|
GitLab |
Code, CI/CD, private repositories |
Via Zenodo integration |
|
|
Code Ocean |
Executable code capsules, reproducible environments |
Yes |
|
|
Swate |
Workflow annotation tool for computational studies |
Linked to data repositories |
Choosing a Repository: Quick Guide
- Check funder/institution requirements – Some mandate specific repositories.
- Consider your data type – Use domain-specific repositories if available.
- Ensure persistent identifiers – Choose repositories that issue DOIs or permanent URLs.
- Verify licensing options – Select licenses that allow reuse (CC0, CC BY recommended).
- Plan for long-term access – Preferred repositories commit to preservation.
Data Preparation Checklist
Before uploading, ensure your dataset includes:
- README file explaining data structure, variables, and units
- Clean, de-identified data (no personal identifiers unless consented)
- Metadata (title, authors, description, keywords, funding source)
- Codebook for variables (names, descriptions, allowed values)
- License (CC0 or CC BY recommended for maximum reuse)
- Citation information (how others should cite your data)
How to Cite Shared Data in Your Manuscript
In your Data Availability Statement, include:
- Repository name
- Persistent identifier (DOI, accession number)
- Direct URL to dataset
- Citation in references list (if DOI provided)
Example:
The data supporting this study are available in the Zenodo repository at https://doi.org/10.5281/zenodo.1234567.
Need Help?
- General repository selection: https://re3data.org – A global registry of research data repositories.
- Data management planning: https://dmptool.org/ – DMPTool for creating data management plans.
- AllStats support: Contact [datahelp@allstatsjournal.com] for guidance on data sharing for your submission.
FAQs
Q: My data contains sensitive personal information. What should I do?
A: Use controlled-access repositories (e.g., dbGaP for genomic data) or share de-identified, aggregated data. Contact the ethics editor if unsure.
Q: Is there a fee for data sharing?
A: Most general repositories are free. Some (like Dryad) charge a small fee, often covered by institutions or funders.
Q: Can I use my institutional repository?
A: Yes, if it provides persistent identifiers and long-term preservation. Please provide the DOI or permanent link.
Q: What if my field doesn’t have a dedicated repository?
A: Use a general repository like Zenodo or Figshare; they accept data from any discipline.
This list was last updated on 18 December 2025.
