Lost in a random forest: Using Big Data to study rare events

Christopher A. Bail

Download from

dx.doi.org

Lost in a random forest: Using Big Data to study rare events

Christopher A. Bail

Big Data and Society 2 (2) (2015) Copy BIBT_EX

Abstract

Sudden, broad-scale shifts in public opinion about social problems are relatively rare. Until recently, social scientists were forced to conduct post-hoc case studies of such unusual events that ignore the broader universe of possible shifts in public opinion that do not materialize. The vast amount of data that has recently become available via social media sites such as Facebook and Twitter—as well as the mass-digitization of qualitative archives provide an unprecedented opportunity for scholars to avoid such selection on the dependent variable. Yet the sheer scale of these new data creates a new set of methodological challenges. Conventional linear models, for example, minimize the influence of rare events as “outliers”—especially within analyses of large samples. While more advanced regression models exist to analyze outliers, they suffer from an even more daunting challenge: equifinality, or the likelihood that rare events may occur via different causal pathways. I discuss a variety of possible solutions to these problems—including recent advances in fuzzy set theory and machine learning—but ultimately advocate an ecumenical approach that combines multiple techniques in iterative fashion.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Keywords

Add keywords

Reprint years

DOI

10.1177/2053951715604333

Other Versions

No versions found

My notes

Analytics

Added to PP
2020-11-24

Downloads
9 (#1,520,028)

6 months
2 (#1,686,333)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

What are neural networks not good at? On artificial creativity.Anton Oleinik - 2019 - Big Data and Society 6 (1).

Add more citations

References found in this work

Movements and media: Selection processes and evolutionary dynamics in the public sphere.Ruud Koopmans - 2004 - Theory and Society 33 (3/4):367-391.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Lost in a random forest: Using Big Data to study rare events

Abstract

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work