[Replication package for] Reconstructing history: Using language to estimate religious spread
Principal Investigator(s): View help for Principal Investigator(s) Julian Dyer, University of Exeter; Arthur Blouin, University of Toronto
Version: View help for Version V1
Name | File Type | Size | Last Modified |
---|---|---|---|
|
Unknown | 151 KB | 06/02/2022 02:18:AM |
|
application/zip | 4.7 GB | 06/16/2025 12:52:PM |
Project Citation:
Dyer, Julian, and Blouin, Arthur. [Replication package for] Reconstructing history: Using language to estimate religious spread. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2025-06-17. https://doi.org/10.3886/E233003V1
Project Description
Summary:
View help for Summary
This is the replication package for "Reconstructing history: Using language to estimate religious spread" Forthcoming in the Journal of Economic History, December 2025 issue.
We introduce a data-driven approach to use language to reconstruct history, and apply the methodology to estimate the geographic origins of religious spread. To validate the approach, we use language data to estimate origins of Islam and Buddhism to within 500km of their true (and uncontested) origins. We then apply the methodology to the more complex (and contested) cases of Christianity, Judaism and Hinduism. We show that language-based estimates, in these cases, are significantly more aligned with the origin of scripture than to the origin of the religion.
We introduce a data-driven approach to use language to reconstruct history, and apply the methodology to estimate the geographic origins of religious spread. To validate the approach, we use language data to estimate origins of Islam and Buddhism to within 500km of their true (and uncontested) origins. We then apply the methodology to the more complex (and contested) cases of Christianity, Judaism and Hinduism. We show that language-based estimates, in these cases, are significantly more aligned with the origin of scripture than to the origin of the religion.
Scope of Project
Subject Terms:
View help for Subject Terms
Economic History;
Cultural Economics
Geographic Coverage:
View help for Geographic Coverage
Global
Universe:
View help for Universe
All languages catalogues in the PanLex dataset, with additional restrictions on coverage in other lexical and linguistic datasets.
Data Type(s):
View help for Data Type(s)
geographic information system (GIS) data;
images: photographs, drawings, graphical representations;
program source code;
text
Collection Notes:
View help for Collection Notes
Contains raw data and replication code that will generate the original predicted loanwords data, as well as the topic clustering and network construction. This also includes the code to produce network measures, solve for predicted origins, and aggregate these predicted origins.
Methodology
Data Source:
View help for Data Source
Please see published article for full details on data sources and how they were used.
Unit(s) of Observation:
View help for Unit(s) of Observation
Neighbouring language-pair,
Language
Geographic Unit:
View help for Geographic Unit
Ethnolinguistic group as recorded in Ethnologue
Related Publications
Published Versions
Report a Problem
Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.
This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.