A newer version of this project is available. See below for other available versions.
Jacob Kaplan's Concatenated Files: Uniform Crime Reporting (UCR) Program Data: Arson 1979-2019
Principal Investigator(s): View help for Principal Investigator(s) Jacob Kaplan, University of Pennsylvania
Version: View help for Version V8
| Name | File Type | Size | Last Modified |
|---|---|---|---|
|
|
application/zip | 104 MB | 10/20/2020 07:01:PM |
|
|
application/zip | 105.7 MB | 10/20/2020 07:01:PM |
|
|
application/zip | 65.9 MB | 10/20/2020 06:59:PM |
|
|
application/zip | 13.9 MB | 10/20/2020 06:58:PM |
|
|
application/zip | 14.7 MB | 10/20/2020 07:03:PM |
|
|
application/zip | 14 MB | 10/20/2020 07:01:PM |
Project Citation:
Project Description
Version 8 release notes:
- Adds 2019 data.
- Note that the number of months missing variable sharply changes starting in 2018. This is probably due to changes in UCR reporting of the column_2_type variable which is used to generate the months missing county (the code I used does not change). So pre-2018 and 2018+ years may not be comparable for this variable.
- Adds a last_month_reported column which says which month was reported last. This is actually how the FBI defines number_of_months_reported so is a more accurate representation of that. Removes the number_of_months_reported variable as the name is misleading. You should use the last_month_reported or the number_of_months_missing (see below) variable instead.
- Adds a number_of_months_missing in the annual data which is the sum of the number of times that the agency reports "missing" data (i.e. did not report that month) that month in the card_2_type variable or reports NA in that variable. Please note that this variable is not perfect and sometimes an agency does not report data but this variable does not say it is missing. Therefore, this variable will not be perfectly accurate.
- Adds 2018 data
- Adds data in the following formats: SPSS and Excel.
- Changes project name to avoid confusing this data for the ones done by NACJD.
- Adds 1979-2000, 2006, and 2017 data
- Adds agencies that reported 0 months.
- Adds monthly data.
- All data now from FBI, not NACJD. See here for the R code I used to read in the files and clean data, and the setup files made to read them in. https://github.com/jacobkap/crime_data
- Changes some column names so all columns are <=32 characters to be usable in Stata.
- Add data for 2016.
- Order rows by year (descending) and ORI.
- Removed data from Chattahoochee Hills (ORI = "GA06059") from 2016 data. In 2016, that agency reported about 28 times as many vehicle arsons as their population (Total mobile arsons = 77762, population = 2754.
- Fix bug where Philadelphia Police Department had incorrect FIPS county code.
- Oneida, New York (ORI = NY03200) had multiple years that reported single arsons costing over $700 million. I deleted this agency from all years of data.
- In January 1989 Union, North Carolina (ORI = NC09000) reported 30,000 arsons in uninhabited single occupancy buildings and none any other months.
- In December 1991 Gadsden, Florida (ORI = FL02000) reported that a single arson at a community/public building caused $99,999,999 in damages (the maximum possible).
- In April 2017 St. Paul, Minnesota (ORI = MN06209) reported 73,400 arsons in uninhabited storage buildings and 10,000 arsons in uninhabited community/public buildings and one or fewer every other month.
Scope of Project
Methodology
Related Publications
Published Versions
Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.
This material is distributed exactly as received from the data depositor. As of April 2026, depositors are required to submit study materials in accessible formats. ICPSR has not reviewed, checked, or processed this material. For additional information about the study, please contact the investigator(s) directly. If you have questions about the accessibility of materials distributed by ICPSR or require further assistance, please visit ICPSR's Accessibility Center.