Experts, coders and crowds: An analysis of substitutability
Marquardt, Kyle Lohse; Pemstein, Daniel; Petrarca, Constanza Sanhueza; Seim, Brigitte; Wilson, Steven Lloyd; Bernhard, Michael; Coppedge, Michael; I. Lindberg, Staffan
Journal article, Peer reviewed
Published version

View/ Open
Date
2024Metadata
Show full item recordCollections
- Department of Comparative Politics [555]
- Registrations from Cristin [12324]
Abstract
Political scientists increasingly use crowdworkers to produce data, predominantly in the context of coding researcher-curated text or to retrieve simple data from the internet. In this article, we provide a theoretical and empirical basis for understanding when crowdworkers can provide data of sufficient quality to substitute for other types of coders. First, we introduce a typology of data-producing actors – experts, trained coders and crowds – and hypothesize factors that affect the substitutability of crowdworkers. We then examine how crowdworkers perform across coding tasks that vary along multiple dimensions of difficulty: information verifiability, availability and complexity. The results provide scope conditions bounding the substitutability of crowdworkers in political science applications. Although crowds can substitute for trained coders in the context of relatively simple information retrieval tasks, there is little evidence that crowdworkers can substitute for experts, whose tasks require both information retrieval and data synthesis.