Name File Type Size Last Modified
census1900_for_19001910test.csv text/csv 2 MB 01/28/2021 05:12:AM
census1900_for_19001910train.csv text/csv 4 MB 01/28/2021 05:12:AM
census1910_for_19001910test.csv text/csv 7.1 MB 01/28/2021 05:12:AM
census1910_for_19001910train.csv text/csv 23.6 MB 01/28/2021 05:12:AM
census1910_for_19101920test.csv text/csv 1.5 MB 01/28/2021 05:12:AM
census1910_for_19101920train.csv text/csv 2.1 MB 01/28/2021 05:12:AM
census1920_for_19101920test.csv text/csv 8 MB 01/28/2021 05:12:AM
census1920_for_19101920train.csv text/csv 18.5 MB 01/28/2021 05:12:AM

Project Description

Summary:  View help for Summary These files include data and code used to create tables and figures for "Combining Family History and Machine Learning to Link Historical Records." This was published as an NBER Working Paper (#26227) and is in the review process at an academic journal. We are seeking permission to share our full dataset that we obtained from FamilySearch. We have included the truth set that we use to train our model along with the machine learning code we use to link records using this truth set. Our truth set includes histid pairs between the different census records, which can be combined with the restricted versions of the full-count census records that are distributed through the University of Minnesota.



Related Publications

Export Metadata

Report a Problem

Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.

This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.