Name File Type Size Last Modified
  census 01/28/2021 10:12:AM
  training 01/28/2021 10:13:AM
Figure2_Precision_Recall.ipynb text/plain 18 KB 01/28/2021 05:12:AM
Table4_Comparing_Classifiers.ipynb text/plain 10.4 KB 01/28/2021 05:12:AM
Table5_Feature_Importances.ipynb text/plain 5.2 KB 01/28/2021 05:12:AM

Project Description

Summary:  View help for Summary These files include data and code used to create tables and figures for "Combining Family History and Machine Learning to Link Historical Records." This was published as an NBER Working Paper (#26227) and is in the review process at an academic journal. We are seeking permission to share our full dataset that we obtained from FamilySearch. We have included the truth set that we use to train our model along with the machine learning code we use to link records using this truth set. Our truth set includes histid pairs between the different census records, which can be combined with the restricted versions of the full-count census records that are distributed through the University of Minnesota.



Related Publications

Export Metadata

Report a Problem

Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.

This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.