A machine learning model for assessing IO legitimation in the media
Principal Investigator(s): View help for Principal Investigator(s) Blind Review
Version: View help for Version V1
Name | File Type | Size | Last Modified |
---|---|---|---|
|
application/x-rlang-transport | 16.4 MB | 10/24/2021 01:53:AM |
Project Citation:
Review, Blind. A machine learning model for assessing IO legitimation in the media. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2021-10-24. https://doi.org/10.3886/E152961V1
Project Description
Summary:
View help for Summary
The media channel is crucial for the legitimation of International Organizations (IOs). This paper applies a machine learning framework to develop a new model of IO legitimation. The model uses Natural Language Processing and regression analysis to quantify intensity, tone and narrative in the media as three critical dimensions of legitimation. The model is applied to a corpus of 1.3 million newspaper articles from six Eurasian post-socialist countries and twelve IOs in the period 2018-2020. It yields four key results. First, and contrary to earlier studies, the tone of articles on IOs is predominantly positive. We document, however, significant differences between countries and IOs. The Russian media contribute to the delegitimation of IOs, while the Polish media engage primarily in their legitimation. The tone of articles featuring the WHO and the IMF is more negative than that of those on other IOs, after the topic of the narrative is controlled for. Second, articles mentioning influential domestic politicians contribute to the delegitimation of the IOs featured. Third, we show for the first time that the Covid-19 pandemic reduced the tone for most of the IOs analyzed. The impact is most pronounced for the WHO and is related to China’s influence on the WHO. Fourth, we verify the symmetric hypothesis of guilt by association and document that the world powers contribute significantly to the delegitimation of IOs in the post-socialist press in Eurasia. The model developed is easily scalable and can be applied to any number of countries and IOs, providing that sentiment lexicons exist for the languages spoken in those countries.
The dataset analyzed during
the current study is called replication_datafile.Rdata
and is available at https://www.openicpsr.org/openicpsr/project/152961. All calculations were conducted in R, and the file is saved in the
Rdata format. It contains one data frame called datafile with 1,255,294 rows
(observations) and 77 columns (variables). Each observation describes
characteristics of one article from the corpus of 1,255,294 articles.
List of variables and their
definitions:
sent – sentiment, number of positive inclination words minus the number of negative
inclination words divided by the number of words in the articles
rsent – relative sentiment, sentiment of the article minus the average
sentiment of all articles published in this newspaper
usa, china, european_union – one variable for each of the three world powers, equals zero if a world
power is not mentioned in the articles and 1+log(N) if a world power is
mentioned N times in the article.
un, wto, who, oecd, scun,
nato, imf, ebrd, adb, aiib, wb, cc - one variable
for each of the twelve IOs, equals zero if an IO is not mentioned in the
article and 1+log(N) if an IO is mentioned N times in the article.
dip_ru, dip_kz, dip_bel,
dip_ukr, dip_pl, dip_hu - one variable
for each of six countries, equals zero if no influential domestic politician is
mentioned in the articles and 1+log(N) if influential domestic politicians are mentioned
N times in the article.
Thirteen topic dummies,
the topic dummy equals one is an article was classified as discussing this topic,
zero otherwise. The LDA topic modelling was conducted for k=30 topics. Topics
were grouped by their analytical relevance and semantic similarity into
thirteen categories of: politics and legislation (POL); economy, finance,
various sectors of the economy (ECO); military, war, protests, crime, security
threats (MIL); international affairs, specific issues concerning foreign
countries (INT); technology (TECH); family issues, culture, sport, education
(FAM); regional issues and housing (REG); health issues and the Covid-19
pandemic (HEA); media (MED); accidents (ACC); religion (REL); the Soviet Union
(USSR); and articles for which no topic could be determined (MISC).
Twenty newspaper dummies
“n_newspaper_COUNTRY”, the newspaper dummy equals one if an article was
published in this newspaper, zero otherwise. The twenty media outlets are: iz.ru,
kommersant.ru, novayagazeta.ru, vedomosti.ru,
informburo.kz, nur.kz, tengrinews.kz, zakon.kz, bdg.by, belgazeta.by, sb.by, kp.ua,
segodnya.ua, vesti.ua, gazeta.pl, rp.pl, wpolityce.pl, index.hu, origo.hu, alfahir.hu.
Six country dummies “c_COUNTRY”.
Twelve month dummies
and three year dummies indicate year and month when an article
was published.
Scope of Project
Subject Terms:
View help for Subject Terms
Legitimation;
International organizations;
Computational politics;
Natural language processing;
Relative sentiment analysis;
Post-socialist countries
Geographic Coverage:
View help for Geographic Coverage
Six Eurasian post-socialist countries
Time Period(s):
View help for Time Period(s)
2018 – 2020 (2018-2020)
Collection Date(s):
View help for Collection Date(s)
2018 – 2021
Universe:
View help for Universe
We
selected the following twelve IOs for the analysis: the United Nations, the WTO,
the WHO, the OECD, the United Nations Security Council (SCUN), NATO, the IMF, the
World Bank, the EBRD, the Asian Development Bank (ADB), the Asian
Infrastructure Investment Bank (AIIB), and the United Nations Framework
Convention on Climate Change (UNFCCC).
Data Type(s):
View help for Data Type(s)
text
Collection Notes:
View help for Collection Notes
Data was scraped from the following twenty websites:
iz.ru
kommersant.ru
novayagazeta.ru
vedomosti.ru
informburo.kz
nur.kz
tengrinews.kz
zakon.kz
bdg.by
belgazeta.by
sb.by
kp.ua
segodnya.ua
vesti.ua
gazeta.pl
rp.pl
wpolityce.pl
index.hu
origo.hu
alfahir.hu
Related Publications
Published Versions
Report a Problem
Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.
This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.