Name File Type Size Last Modified
  aea_pnp_replication 05/01/2023 01:33:PM
README.txt text/plain 3.4 KB 05/01/2023 11:09:AM

Project Citation: 

BRYNJOLFSSON, ERIK, BUFFINGTON, CATHERINE, GOLDSCHLAG, NATHAN, LI, J. FRANK, MIRANDA, JAVIER, and SEAMANS , ROBERT. Data and Code for: Robot Hubs: The Skewed Distribution of Robots in U.S. Manufacturing. Nashville, TN: American Economic Association [publisher], 2023. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2023-05-04. https://doi.org/10.3886/E190543V1

Project Description

Summary:  View help for Summary
New technologies drive productivity growth (Romer 1990). Preliminary evidence using national-level data from 17 countries between 1993 and 2007 suggests that robots, like prior generations of general-purpose technologies, are also driving productivity growth (Graetz & Michaels 2018). Moreover, according to data compiled by the International Federation of Robotics (IFR), since 2010 the number of robot shipments has nearly quadrupled from about 100,000 to almost 400,000 per year, so the impact of robots on the economy is likely even greater. However, while robotics may be contributing to GDP growth at a national level, scholars are still working to understand how robots affect employment and other outcomes at other levels of analysis, leading to calls for establishment-level measures of robots and other new technologies (Brynjolfsson & Mitchell 2017; Raj and Seamans 2019).

To address this need, the U.S. Census Bureau, working with external researchers, developed a series of questions on the adoption and use of robots. These questions have subsequently been included in the Annual Survey of Manufactures and other Census surveys (Buffington, Miranda & Seamans 2018; Brynjolfsson et al. 2020; Acemoglu et al. 2022). In this paper we present results on the distribution of robots in U.S. manufacturing, using the new establishment-level microdata collected by the U.S. Census Bureau. We use the data to present several facts about the location and use of robots.

We find that the distribution of robots is highly skewed across locations, even accounting for the different mix of industry and manufacturing employment. Some locations - which we call “Robot Hubs” - have many more robots than one would expect after accounting for industry and manufacturing employment. We characterize these Robot Hubs along several industry, demographic, and institutional dimensions, and find that the presence of robot integrators and union membership are distinguishing features of Robot Hubs.

Scope of Project

Subject Terms:  View help for Subject Terms Robots; manufacturing industry; technology; geography
JEL Classification:  View help for JEL Classification
      O30 Innovation; Research and Development; Technological Change; Intellectual Property Rights: General
Geographic Coverage:  View help for Geographic Coverage United States
Time Period(s):  View help for Time Period(s) 1/1/2018 – 12/31/2018 (2018 Annual Survey of Manufacturers)
Universe:  View help for Universe The Annual Survey of Manufacturers (ASM) is sent to a sample of approximately 50,000 establishments in the manufacturing sector every year.3 The frame, constructed using the Census Bureau’s Business Register and Economic Censuses (conducted every five years), is segmented into mail and non-mail components. The non-mail component, roughly two-thirds of all manufacturing establishments (approximately 187,000), contains plants that, due to their size, are not eligible to receive a mailed form. Data for these establishments are based on administrative
records (e.g., employment) or are entirely imputed (e.g., using capital expenditures). The mail-eligible sample, roughly one-third of all manufacturing establishments (approximately 102,000), assigns all plants a stratified random probability of receiving a form; large plants are sampled with certainty, and the remainder are assigned probabilities proportionate to size and are sampled
within industries and product classes. Of the mail-eligible sample, roughly half will be surveyed. Sample weights are applied to surveyed plants to recover the full mail-eligible sample. Our analyses focus on the subset of the mail-eligible sample with reported values, weighted with sample weights throughout. Our analysis shows that robot users are relatively large and disproportionately likely to fall into the eligible sample that we focus on. That said, in future years, it will be increasingly important to monitor the behavior of non-mail units, particularly if robots become more accessible to smaller establishments.

Data Type(s):  View help for Data Type(s) survey data

Methodology

Response Rate:  View help for Response Rate Two types of response rates are computed for the ASM: unit response rate (URR) and total quantity response rate (TQRR).  The URR is the percentage of reporting units, based on unweighted counts, that were eligible (E) or of unknown eligibility (U) and were respondents (R) in the statistical period.  Cases are assumed to be active and in scope in the absence of evidence otherwise.  This includes cases that are Undeliverable as Addressed.  To be considered a respondent to the ASM, a reporting unit must provide both of the key items: value of shipments and total payroll.  The formula for calculating the URR is as follows: URR = [R/(E+U)] * 100.  The URR for the 2021 ASM was 58%. The TQRR is defined as the percentage of the estimated (weighted) item total that is obtained from directly reported data or from sources determined to be equivalent quality to reported data.  The 2021 TQRR was 68.06% for value of shipments, and 73.95% for total payroll.
Sampling:  View help for Sampling The 2017 Economic Census- Manufacturing contained 291,576 active manufacturing establishments.  For sample efficiency and cost considerations, the population was partitioned into two groups: (1) establishments eligible to be mailed a questionnaire; and (2) establishments not eligible to be mailed a questionnaire.  The following is a description of the 2019 ASM sample design:  
  1. Establishments that are eligible to be sent a report form:

    This is defined as the mail stratum.  It is comprised of larger single-location manufacturing companies and all manufacturing establishments of multi-location companies.  The mail stratum is comprised of 102,468 establishments.  On an annual basis, the mail stratum is supplemented with large, newly active single-location companies identified from a list provided by the IRS and new manufacturing locations of multi-location companies identified from the Census Bureau’s COS.

The 2019 ASM sample design is similar to the 2014-2018 sample design. The only significant change is the products universe file was created from the North American Product Classification System (NAPCS) codes.

Establishments in the 2017 Economic Census - Manufacturing that satisfied any of the following criteria are included in the sample with certainty: (1) the total 2017 employment for the establishment is greater than or equal to 1,000; (2) the establishment is identified as one of the ten largest establishments within the industry (based on employment); (3) the establishment is classified within an industry with less than 20 establishments; (4) the establishment is classified in the computer or flat-glass or sugar industry; (5) the establishment is located within a state where there are less than 20 additional establishments in the same North American Industry Classification System (NAICS) group (NAICS group is defined as the set of NAICS industries that have the same first four digits); or (6) the establishment is one of the largest establishments in terms of cost of fuels used, cost of electricity used, end-of-year inventories, end-of-year assets, or LIFO inventories.  Collectively, there are 16,621 establishments that are selected with certainty.  These establishments accounted for approximately 70 percent of the total value of shipments in the 2017 Economic Census - Manufacturing.

Establishments in the remaining portion of the mail stratum are sampled with probabilities ranging from .05 to 1.00.  Each of the 360 industries and 2,184 product classes are considered to be a separate population.  Using variable reliability constraints, each establishment within a given population is assigned an initial probability of selection that reflects its relative importance within the population.  Establishments producing products in multiple product classes receive multiple initial probabilities.  The final probability of selection for a specific establishment is defined as the largest of their initial probabilities.

This method of assigning probabilities is motivated by the Census Bureau's primary desire to produce reliable estimates of both product class and industry shipments.  The high correlation between shipments and employment, value-added, and other general statistics assures that these variables will also be well represented.  For sample selection purposes, each establishment is assigned to an industry stratum.  Within each of the 360 industry strata, an independent sample is selected using the final probability of selection associated with the establishments classified within the stratum.  A fixed-sample size methodology is used to assure that the desired sample size is realized. The total sample size for 2019 is 49,414.
2. Establishments not eligible to be sent a report form: This is defined as the nonmail stratum.  The nonmail stratum consists of small- and medium-sized, single-establishment companies from the Economic Census - Manufacturing.  The  initial nonmail stratum of the 2019 sample contained 186,670 single-establishment companies from the 2017 Economic Census - Manufacturing.

The nonmail stratum is supplemented annually using the list of newly active single-location companies provided by the IRS.  Data for establishments included in the nonmail stratum are estimated using information obtained from the administrative records of the IRS and Social Security Administration (SSA); and are included in the published ASM estimates.  This administrative information, which includes payroll, total employment, industry classification, and physical location, is obtained under conditions which safeguard the confidentiality of both tax and census records.

Data Source:  View help for Data Source Annual Survey of Manufactures (ASM)
Weights:  View help for Weights Estimation

Most of the ASM estimates derived for the mail stratum are computed using a difference estimator.  The difference estimator takes advantage of the fact that, for manufacturing establishments, there is a strong correlation between the current-year data values and the previous Census values.  Because of this correlation, difference estimates are generally more reliable than comparable estimates developed from the current sample data alone.  The ASM difference estimates are computed at the establishment level by adding the weighted difference (between the current data and the Census data) to the Census data.  That is,

Difference Estimate = Census value + weight(Current value - Census value)

Or equivalently

Difference Estimate = weight(Current value) + (1-weight)Census value

Estimates for the capital expenditures variables are not generated using the difference estimator because the year-to-year correlations are considerably weaker.  The standard linear estimator is used for these variables.

For the nonmail stratum, estimates for payroll are directly tabulated from the administrative-record data provided by the IRS and the SSA.  Estimates of the other data variables are developed from industry averages.  Although the establishments in the nonmail stratum are far more numerous than those in the mail stratum, they account for less than 6 percent of the value of shipments estimate at the total manufacturing level.

Corresponding estimates for the mail and nonmail components are combined to produce the estimates included in this publication.

Unit(s) of Observation:  View help for Unit(s) of Observation Establishment

Related Publications

Published Versions

Export Metadata

Report a Problem

Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.

This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.