Preserve & Share

Make your research outputs findable, accessible, and citable

Data Sharing Code Publishing Open Access Persistent Identifiers

Checkpoint: Manuscript submission with DOIs for preregistration, data, and code

Data Sharing

Sharing your data allows others to verify your findings, build on your work, and increase the impact of your research. But sharing does not always mean making everything publicly available. What you can share depends on the consent you obtained, the sensitivity of your data, and your ethics approval.

If you wrote a Data Management Plan in Phase 1, this is where you implement it.

In Phase 2 you created documentation for yourself and collaborators. Now you prepare that documentation for people who have no prior knowledge of your project. You also decide how openly your data can be shared, select an appropriate repository, and apply a license that tells others how they may reuse your work.

This step is straightforward if you followed Phase 2. You already have a README, a codebook, organized files, and clear metadata.

Not all data can or should be shared openly. Your sharing options depend on what consent participants gave and what your ethics approval permits.

Open data can be downloaded by anyone without restrictions. This maximizes reuse but is only appropriate when data contains no personal information or has been fully anonymized.

Restricted access means data is available on request, often requiring a Data Use Agreement. The requester typically must describe their purpose and agree to conditions. This balances reuse with protection.

Metadata only describes what data exists without sharing the data itself. Others can discover your work and contact you to discuss access. This is appropriate when data cannot be shared due to legal or ethical constraints.

When deciding, ask: What did participants consent to? What does your ethics approval allow? Can the data be anonymized without losing value? If you work with sensitive data, consult your institution’s data protection officer.

In Phase 2 you created a README and codebook for yourself and collaborators. Before sharing publicly, review them from the perspective of someone who knows nothing about your project.

Expand your README to include the research context, how to cite the dataset, and any access conditions. See the Documentation tab in Phase 2 for what a complete README contains.

Review your codebook to ensure every variable is fully described. What was obvious to you during analysis may need explanation for others.

Follow community standards. Many fields have established formats for sharing data (e.g., BIDS for neuroimaging). Using these makes your data immediately usable with existing tools. See the Standards tab in Phase 2 for how to find conventions in your domain.

Verify anonymization. Check that no personal information remains in the data files, metadata, or filenames.

FAIR Data Management (Tutorial)

Documentation, organization, and metadata

Comprehensive guide to managing research data following FAIR principles. Covers READMEs, codebooks, file formats, and standards.

Choose a repository that suits your data type, your field’s expectations, and your access requirements.

Discipline-specific repositories are often the best choice. They use metadata standards your community expects, making your data findable by researchers in your field. Search re3data to find repositories for your domain.

General-purpose repositories like Zenodo, Figshare, or OSF accept any data type. They provide DOIs and long-term preservation, but may lack the specialized metadata fields of discipline-specific options.

Institutional repositories may be required by your funder or institution. Check your grant terms and institutional policies.

Whichever you choose, ensure the repository provides a DOI (Digital Object Identifier) so your data can be cited and tracked. See the Identity & Credit panel for more on persistent identifiers.

re3data

Registry of research data repositories

Search by discipline to find repositories that accept your type of data and meet your requirements.

Zenodo

General-purpose repository for any research output

Free, open repository hosted by CERN. Accepts any file type, assigns DOIs, and integrates with GitHub.

A license tells others what they can do with your data. Without one, others cannot legally reuse your work, even if it is publicly available.

CC0 (Public Domain) places no restrictions. Anyone can use, modify, and redistribute without attribution. Recommended for factual data where maximum reuse is the goal.

CC BY (Attribution) requires users to credit you. A good default when you want recognition while enabling broad reuse.

CC BY-NC (Non-Commercial) adds a restriction against commercial use. Consider whether this limitation actually serves your goals.

For restricted-access data, a Data Use Agreement specifies conditions beyond a standard license: approved purposes, security requirements, publication terms, and data destruction timelines.

Choose a License (Tutorial)

Guidance on selecting the right license

Covers licenses for data, code, and other research outputs with decision flowcharts.

Code Publishing

Open Access

Identity & Credit