Name File Type Size Last Modified
Maltz---Targonski-2002-A-Note-on-the-Use-of-County-Level-UCR-Data.pdf application/pdf 3.9 MB 01/21/2019 12:18:PM

Project Citation: 

Kaplan, Jacob. Jacob Kaplan’s Concatenated Files: Uniform Crime Reporting (UCR) Program Data: County-Level Detailed Arrest and Offense Data. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2021-01-16. https://doi.org/10.3886/E108164V5

Project Description

Summary:  View help for Summary
Version 5 release notes:
  • Changes release notes description, does not change data.
Version 4 release notes:
  • I am retiring this dataset - please do not use it. 
  • The reason that I made this dataset is that I had seen a lot of recent articles using the NACJD version of the data and had several requests that I make a concatenated version myself. This data is heavily flawed as noted in the excellent Maltz & Targonski's (2002) paper (see PDF available to download here and important paragraph from that article below) and I was worried that people were using the data without considering these flaws. So the data available here had the warning below this section (originally at the top of these notes so it was the most prominent thing) and had the Maltz & Targonski PDF included in the zip file so people were aware of it. 
  • There are two reasons that I am retiring it.
    • First, I see papers and other non-peer reviewed reports still published using this data without addressing the main flaws noted by Maltz and Targonski. I don't want to have my work contribute to research that I think is fundamentally flawed.
    • Second, this data is actually more flawed that I originally understood. The imputation process to replace missing data is based off of a bad design, and Maltz and Targonski talk about this in detail so I won't discuss it too much. The additional problem is that the variable that determines whether an agency has missing data is fatally flawed. That variable is the "number_of_months_reported" variable which is actually just the last month reported. So if you only report in December it'll have 12 months reported instead of 1. So even a good imputation process will be based on such a flawed measure of missingness that it will be wrong. How big of an issue is this? At the moment I haven't looked into it in enough detail to be sure but it's enough of a problem that I no longer want to release this kind of data (within the UCR data there are variables that you can use to try to determine the actual number of months reported but that stopped being useful due to a change in the data in 2018 by the FBI. And even that measure is not always accurate for years before 2018.).

Scope of Project

Subject Terms:  View help for Subject Terms crime; violent crime statistics; crime; victimless crimes; national crime statistics (USA); ucr; Uniform Crime Reports; arrest; arrest; arrest rates
Geographic Coverage:  View help for Geographic Coverage Counties in the United States
Time Period(s):  View help for Time Period(s) 1960 – 2017 (1960-2017 for crime data, 1974-2016 for arrest data)
Data Type(s):  View help for Data Type(s) administrative records data

Methodology

Unit(s) of Observation:  View help for Unit(s) of Observation County
Geographic Unit:  View help for Geographic Unit County

Related Publications

This study is un-published. See below for other available versions.

Published Versions

Export Metadata

Report a Problem

Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.

This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.