FAQs

Depositing & Updating

Yes. openICPSR allows depositors to update files and metadata at any time. An updated project will get a new version and a new DOI, so that citations can indicate clearly which version of the data collection was used in analysis. Furthermore, users still can download earlier versions of the file, unless the depositor chooses to unpublish them.

If you feel you should have chosen a different option during the publish step (questions on disclosure and such), simply "unpublish" the study and "publish" it again. You will be presented with the same questions and can edit them as you see fit.

Prior to publishing, depositors have the option to delete any file. After publishing, depositors have the option to "unpublish" any file. openICPSR will continue to list the data/metadata, but the file no longer will be downloadable. Furthermore, ICPSR will keep an archived copy of the file. Depositors should note that users will be directed to contact the depositor if they are seeking an "unpublished" file. If a depositor needs a published file to be completely expunged from our system, s/he will need to contact ICPSR.

openICPSR is a research data-sharing service that allows depositors to rapidly self-publish research data, enabling the public to access the data without charge -- or in the case of restricted-use data, for nominal charge.

ICPSR has a rich 50-year history of archiving and preserving data collections. For standard deposits, professional staff review, curate and distribute data collections in multiple formats (e.g., SAS, SPSS, Stata, R, delimited). Preservation copies are made to better ensure long-term access. The openICPSR service is an alternative to the standard ICPSR deposit process, providing immediate, self-controlled distribution of data and metadata, albeit without the rich curation and preservation features; data in openICPSR are distributed and preserved as-is, exactly as they arrive.

openICPSR deposits should include all data and documentation necessary to independently read and interpret the data collection. We do not restrict file types. However, because openICPSR is a social and behavioral science data repository, each project should include research data or syntax that can be used with research data available through ICPSR or elsewhere. Documentation files or publications without accompanying data or code should not be deposited in openICPSR as standalone projects. ICPSR encourages individuals wishing to share abstracts or research papers without associated raw data to use another service, such as the Social Science Research Network.

Please do not create or publish test data on the OpenICPSR website. We have a testing process in place and welcome feedback on bugs, but we request that you not fill our database with fake metadata and random test files.

First, make sure you are using a supported Web browser (Google Chrome, Apple Safari, Mozilla Firefox). This website is not optimized for Internet Explorer. Second, make sure Javascript is allowed within your browser. This website requires Javascript to be allowed. Third, try turning off browser add-ons (e.g., Privacy Badger), which may block the website from loading. Finally, if none of these suggestions resolve the issue, please contact us.

Discovery & Access

OpenICPSR deposits are searchable on the ICPSR website within 24 hours of publication. ICPSR has an immediate distribution network of over 760 institutions looking for research data that has powerful search tools and a data catalog indexed by major search engines.

Yes, after an openICPSR project is published, it is automatically assigned a project citation, which includes a persistent digital object identifier (DOI). Users may also request a citation for individual folders and files by logging into openICPSR and clicking the "Generate Citation for this File" button on the individual folder or file page.

openICPSR will provide depositors with aggregate data on how many users have downloaded the data. We do not disclose users' download information to third parties.

OpenICPSR staff does not conduct bibliographic searches on data deposited in the Self-Deposit Package; however, we provide for all packages a data citation that end users can easily copy and paste into their publications. Each citation includes a DOI.

Purchasers of the Professional Curation Package or the Full Topic Archive (Data Collection) service are provided full-service bibliography search.

This service is not available as part of the Self-Deposit Package. If you wish ICPSR to create versions of your data for all major statistical applications, please purchase the Professional Curation Package.

Licenses

The federal Office of Science and Technology Policy (OSTP) memo "Increasing Access to the Results of Federally Funded Scientific Research" (February 13, 2013) indicates that federally funded research must be available for commercial use.

OpenICPSR is designed to meet federal open-access requirements. As such, we must use the Creative Commons license that meets those needs.

Preservation

OpenICPSR provides bit-level preservation and public access for the long term. Depositors can provide further funding to extend preservation and public access. Thereafter, unfunded deposits transfer to the ICPSR membership.

OpenICPSR files are archived in ICPSR's standard archival storage, which replicates holdings through multiple and varied methods and locations. All files uploaded to OpenICPSR are encrypted using server-side encryption.

Pricing

  1. Self-Deposit Package: Enables research scientists to deposit data on demand and provide immediate public access. Depositors prepare all files and metadata (meta tags). Once data are published, depositors get a DOI and a data citation; openICPSR conducts a metadata review to maximize exposure in ICPSR's catalog. Self-deposit is currently free for all users to share their data up to a 2GB limit.
  2. Professional Curation Package: Enables a research scientist to tap all aspects of ICPSR's curation services including full metadata generation and a bibliography search, stat package conversion, and user support. The costs for this service are primarily based on the number of variables and complexity of the data. Please contact ICPSR Acquisitions at deposit@icpsr.umich.edu for additional information.
  3. openICPSR for Institutions and Journals - Fully-hosted, branded for your organization, professionally-run, and trusted data repository that demonstrates the research impact of your organization. Appropriate for universities, journals, professional associations, research centers, and departments.

Yes. openICPSR fees should be written into grant/contract proposals that require public data access. Use of openICPSR for data sharing should be addressed in the data management plan and data deposit/data curation fees are an acceptable budget line within the grant application.

Sensitive Data

As part the terms and conditions presented when publishing a project, depositors must answer two questions pertaining to disclosure risk. The answers to those questions determine whether the data collection will be published as restricted data or made available for public download.

Users are required to apply for access to restricted data. Additionally, data users will be charged an administration fee to access restricted-use data.

Restricted-use data will be available to international users through the Virtual Data Enclave (VDE). Basically, international users use the same system as users within the United States, and have the same application requirements.

Disclosure analysis is not available for the Self-Deposit Package. Please note that you are attesting there are no direct or indirect identifiers as part of the deposit process. (Two examples are name and address.) It is the responsibility of the depositor to ensure there are no disclosure violations in his/her data.

If you wish ICPSR to conduct disclosure analysis of your data, you will need to purchase the Professional Curation package.

Advantages

  • Reduces Disclosure Risk. If identification of a respondent is possible based on information in the data, a responsible depositor should restrict the data to protect the identity of the respondent.

  • Protects Sensitive Information. Some depositors may want to restrict data to protect sensitive information that respondents have provided. For example, HIV status or other protected health status information elevate the importance in protecting respondent identity. Protecting sensitive information should be considered in conjunction with the types of potential identifiers that may be present in the data. Sensitive information may require a depositor to be more restrictive than they otherwise would have been.

  • Embargoes Data. By restricting data, the depositor delays access temporarily, ensuring that initial results may be published without being "scooped." Note that openICPSR is developing a delayed dissemination function to address this concern. Depositors will be able to specify a date after which the data will be publicly available. Currently, selecting to restrict files restricts all access until the depositor changes the access settings at a future date.

Disadvantages

  • Limits Access. If the research grant required that the data be publicly available, restricting access to data may fail to meet that obligation unless it is justified by the presence of potentially identifying variables in the data.

  • Negatively Affects Data Use Metrics. Restricted data does not typically generate the number of citations that publicly available data does, and those citations are viewed favorably by funding agencies (and peer reviewers of grant proposals) for evaluating future grant proposals of the depositor.

  • Reduces Research Impact. Data that are not readily available may have less overall impact on research in that field. It also may not contribute to the researcher's reputation as much as data that are readily available and widely used by other researchers.

  • Limits Transparency. Publicly available data can easily be replicated and demonstrates that the researcher is confident in his/her results and that his/her methods are credible. Restricting data reduce the visibility of the underlying methods and raise questions about the research.