Replication data for: Big Data: New Tricks for Econometrics
Principal Investigator(s): View help for Principal Investigator(s) Hal R. Varian
Version: View help for Version V1
Name | File Type | Size | Last Modified |
---|---|---|---|
ml-data | 10/21/2021 10:23:AM | ||
|
text/plain | 14.6 KB | 10/12/2019 01:54:PM |
Project Citation:
Varian, Hal R. Replication data for: Big Data: New Tricks for Econometrics. Nashville, TN: American Economic Association [publisher], 2014. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2019-10-12. https://doi.org/10.3886/E113925V1
Project Description
Summary:
View help for Summary
Computers are now involved in many economic transactions and can capture data associated with these transactions, which can then be manipulated and analyzed. Conventional statistical and econometric techniques such as regression often work well, but there are issues unique to big datasets that may require different tools. First, the sheer size of the data involved may require more powerful data manipulation tools. Second, we may have more potential predictors than appropriate for estimation, so we need to do some kind of variable selection. Third, large datasets may allow for more flexible relationships than simple linear models. Machine learning techniques such as decision trees, support vector machines, neural nets, deep learning, and so on may allow for more effective ways to model complex relationships. In this essay, I will describe a few of these tools for manipulating and analyzing big data. I believe that these methods have a lot to offer and should be more widely known and used by economists.
Scope of Project
Subject Terms:
View help for Subject Terms
Causal Inference;
Big Data;
Machine Learning
JEL Classification:
View help for JEL Classification
C55 Large Data Sets: Modeling and Analysis
C55 Large Data Sets: Modeling and Analysis
Time Period(s):
View help for Time Period(s)
5/1/2014 – 5/1/2014
Universe:
View help for Universe
Example datasets to illustrate the methodology
Data Type(s):
View help for Data Type(s)
survey data;
census/enumeration data;
observational data;
program source code;
aggregate data
Methodology
Data Source:
View help for Data Source
Misc
Unit(s) of Observation:
View help for Unit(s) of Observation
Miscellaneous,
Related Publications
Published Versions
Report a Problem
Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.
This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.