All virtual services are available and some libraries are open for in-person use, while others remain closed through January 23, 2022. Learn more.


Harvard Dataverse

Harvard's open online repository for sharing, preserving, citing, exploring and analyzing research data.

Harvard Dataverse is an online data repository where you can share, preserve, cite, explore, and analyze research data. It is open to all researchers, both inside and out of the Harvard community.
The Harvard Dataverse repository runs on the open-source web application Dataverse, developed at the Institute for Quantitative Social Science. Dataverse facilitates making data available to others, and allows you to replicate others' work more easily.
Researchers, journals, data authors, publishers, data distributors, and affiliated institutions all receive academic credit and web visibility.
A Dataverse repository is the software installation, which then hosts multiple virtual archives called dataverses. Each dataverse contains datasets, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data). As an organizing method, dataverses may also contain other dataverses.

For Researchers

  • A personal dataverse is easy to set up,
  • allows you to display your data on your personal website,
  • can be branded uniquely as your research program,
  • makes your data more discoverable to the research community,
  • and satisfies data management plans.

For Scholars

Harvard Dataverse provides access to a rich array of datasets to support your research. Harvard Dataverse offers advanced searching and text mining in over 2.000 dataverses, 75,000 datasets, and 350,000+ files, representing institutions, groups, and individuals at Harvard and beyond.