Name File Type Size Last Modified
County SDOH Data Documentation.docx application/vnd.openxmlformats-officedocument.wordprocessingml.document 35.5 KB 12/01/2025 10:32:AM
County SDOH Data Documentation.pdf application/pdf 202.4 KB 12/01/2025 10:39:AM
SDOHFull.RData application/x-rlang-transport 25.9 MB 11/26/2025 12:59:PM
SDOHMetadata.RData application/x-rlang-transport 25.5 KB 11/26/2025 01:08:PM
SDOHMetadata.xlsx application/vnd.openxmlformats-officedocument.spreadsheetml.sheet 96.4 KB 11/26/2025 01:08:PM
SDOHRaw.RData application/x-rlang-transport 5.6 MB 11/26/2025 01:17:PM
SDOHRed.RData application/x-rlang-transport 20.6 MB 11/26/2025 01:08:PM

Project Citation: 

Crown, William, Adams, Rachel, and Larson, Mary Jo. County Social Determinants of Health Data Pre-Processed to Facilitate Machine Learning/Multivariate Analysis. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2025-12-01. https://doi.org/10.3886/E227481V2

Project Description

Summary:  View help for Summary These datasets contain data from the AHRQ Social Determinants of Health (SDOH) Database (https://www.ahrq.gov/sdoh/data-analytics/sdoh-data.html), processed to facilitate machine learning/multivariate analyses focusing on the healthcare context of counties. The datasets derive from the AHRQ 2019 and 2018 county-level SDOH files. Three sets of files are provided. The first "Raw" set has the source SDOH data with a few core pre-processing steps applied. The second, “Full” set has variables characterizing the health and healthcare context of counties (rather than outcomes), with further processing steps applied to facilitate multivariate and machine learning analytics (e.g. handling of missing data, normalizing, standardizing). The third set, labeled “Reduced”, incorporates those same data processing steps but in addition has had a further data reduction step applied in which groups of highly intercorrelated variables were removed and replaced with corresponding principal component scores, one for each group. These files would be useful for investigators interested in characterizing and comparing the broad SDOH context of US counties.

Scope of Project

Subject Terms:  View help for Subject Terms Social determinants of health
Geographic Coverage:  View help for Geographic Coverage US Counties
Time Period(s):  View help for Time Period(s) 1/1/2018 – 12/31/2019
Universe:  View help for Universe Us counties, excluding those in overseas territories
Data Type(s):  View help for Data Type(s) aggregate data

Methodology

Data Source:  View help for Data Source The data files are derived from the AHRQ Social Determinants of Health (SDOH) database (https://www.ahrq.gov/sdoh/data-analytics/sdoh-data.html).

Related Publications

Published Versions

Export Metadata

Report a Problem

Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.

This material is distributed exactly as received from the data depositor. As of April 2026, depositors are required to submit study materials in accessible formats. ICPSR has not reviewed, checked, or processed this material. For additional information about the study, please contact the investigator(s) directly. If you have questions about the accessibility of materials distributed by ICPSR or require further assistance, please visit ICPSR's Accessibility Center.