Jump to Content
UC3 Logo

Sharing Data

Does project funding require your data to be shared or publicly accessible?

In order to promote open access to research data, many funding agencies now require that research data produced as part of a funded project be made publicly available.

Researchers can comply with these requirements by depositing their data into one of the many available data repositories. For tips on creating a data sharing plan, see the NIH examples of data sharing plans.

When and where do you intend to publish or distribute your data?

You can share your data easily by emailing it to requestors, or posting it to a website, Google, Amazon or Microsoft. However, this method of sharing makes it difficult for people to find your data. Depositing your data in an archive will facilitate its discovery and preservation.

Publish Your Data in a Repository

Any Discipline | Science and Engineering | Social Sciences | Arts and Humanities

Note: Not all of the repositories listed can ensure long-term preservation of your data; contact each one for more details. This list contains suggestions and is not necessarily complete. For a more complete list of data repositories, see these sites:

Data Created at UC (Any Discipline)

Merritt — a new cost-effective repository service from the University of California Curation Center (UC3) that lets the UC community manage, archive, and share its valuable digital content. Use Merritt to provide long-term preservation of digital assets, share your research with others or meet the data sharing and preservation requirements of a grant-funded project. For more information contact UC3.

EScholarship — an open access publishing platform that offers UC departments, centers, and research units direct control over the creation and dissemination of the full range of their scholarship, including working papers, peer-reviewed journals, monographic series, paper/seminar series, postprints, and conference proceedings. Contact the CDL Publishing Group for more information.

Science and Engineering

  • Archaeology
  • Astronomy
  • Atmospheric Science
  • Life Sciences
    • Dryad — Dryad is an international repository of data underlying peer-reviewed articles in the basic and applied biosciences.
    • Protein Data Bank — An Information portal to biological macromolecular structures
    • UniProt — Submit a new protein sequence to UniProtKB using SPIN, a web-based tool for submitting directly sequenced protein sequences to the Universal Protein Resource (for new nucleotide submissions, use EMBL's WEBIN instead).
  • Chemistry
    • PubChem — provides information on the biological activities of small molecules. It includes substance information, compound structures, and bioactivity data in three primary databases, PCSubstance, PCCompound, and PCBioAssay, respectively.
  • Computer Science
  • Earth Science
    • GEON — Portal for sharing, publishing, and integrating data.
  • Oceanography
  • Snow and Ice
    • National Snow and Ice Data Center — NSIDC archives cryospheric data. NSIDC acknowledges all data providers in do cumentation, metadata (Directory Interchange Format (DIF)), and references, and is also willing to hold or restrict data distribution until providers publish.
  • Space Science
    • National Space Science Data Center — NSSDC accepts data from: active archives in the space sciences funded through the Science Missions Directorate, missions in that same directorate, and individual scientists (mission or instrument principal investigators).

Social Sciences

Arts and Humanities

  • Cultural Policy and the Arts National Data Archive — the world's first interactive digital archive of policy-relevant data on the arts and cultural policy in the United States. It is a collaborative effort of Princeton University's Firestone Library and the Princeton Center for Arts and Cultural Policy Studies, with support from the Pew Charitable Trusts.

How do I cite data?

See the DataCite Metadata Schema Repository for recommendations on what information to include and how to format it.

A quick summary: when formatting a data citation for human readers, a complete citation should include this information in the following form:

[Creator] ([PublicationYear]): [Title]. [Publisher]. [PidInURLform]

where [Publisher] is the data archive that holds the data and [PidInURLform] represents a persistent identifier in actionable form (i.e., embedded in a URL). Here are some examples:

  • Denhard, Michael (2009): dphase_mpeps: MicroPEPS LAF-Ensemble run by DWD for the MAP D-PHASE project. World Data Center for Climate.
    http://dx.doi.org/10.1594/WDCC/dphase_mpeps
  • Manoug, J L (1882): Useful data on the rise of the Nile. Alexandria : Printing-Office V Penasson.
    http://n2t.net/ark:/13960/t44q88124

Creator names in non-Roman scripts should be transliterated using the ALA-LC Romanization Tables.

Privacy and Intellectual Property

When publishing data, it is vital to consider the rights and responsibilities you have with regard to issues of confidentiality and intellectual property.

Confidentiality

It is vital to maintain the confidentiality of research subjects for reasons of ethics and to ensure the continuing participation in research.

Intellectual Property Issues

Sharing data that you produced/collected yourself:

  • Data are not copyrightable (yet a particular expression of data can be, such as a chart or table in a book).
  • Data can be licensed; some data providers apply licenses that limit how the data can be used, such as to protect the privacy of participants in a study or guide downstream uses of the data (e.g., forbidding for-profit use).
  • If you want to promote sharing and unlimited use of your data, you can make your data available under a Creative Commons CC0 Declaration to make this explicit.

Sharing data that you have collected from other sources:

  • You may or may not have the rights to do so, depending upon whether that data were accessed under a license with terms of use.
  • Most databases to which the UC Libraries subscribe are licensed and prohibit redistribution of data outside of UC. For more information on terms of use for databases licensed by the Libraries, contact UC3.

If you are uncertain as to your rights to disseminate data, UC researchers can consult with your campus Office of General Council. Note: Laws about data vary outside the U.S.

For a general discussion about publishing your data, applicable to many disciplines, see the ICPSR Guide to Social Science Data Preparation and Archiving (pdf).


Credit to MIT Libraries for permission to use and adapt their pages and to members of the UC3 community.
Please send us any comments about these guidelines.

Creative Commons License

Last updated: March 14, 2014
Document owner: Perry Willett