Oscar Stuhler

I am a sociologist studying discourse by formal, quantitative means. Most of my work centers around how to measure, analyze, and theorize textual representations of social structures.

I am a College Fellow and incoming Assistant Professor at Northwestern University’s Department of Sociology. I completed my PhD in sociology at New York University.

You can access a version of my CV here and reach me at oms@northwestern.edu. Below, you can find summaries of some of my work. If you are interested in analyzing textual representations of social structures, you may want to try out the semgram R package I wrote.


Who Does What to Whom? Making Text Parsers Work for Sociological Inquiry

Published in Sociological Methods & Research (2022) [PDF]

Abstract: Over the past decade, sociologists have become increasingly interested in the formal study of semantic relations within text. Most contemporary studies focus either on mapping concept co-occurrences or on measuring semantic associations via word embeddings. Although conducive to many research goals, these approaches share an important limitation: they abstract away what one can call the event structure of texts, that is, the narrative action that takes place in them. I aim to overcome this limitation by introducing a new framework for extracting semantically rich relations from text that involves three components. First, a semantic grammar structured around textual entities that distinguishes six motif classes: actions of an entity, treatments of an entity, agents acting upon an entity, patients acted upon by an entity, characterizations of an entity, and possessions of an entity; second, a comprehensive set of mapping rules, which make it possible to recover motifs from predictions of dependency parsers; third, an R package that allows researchers to extract motifs from their own texts. The framework is demonstrated in empirical analyses on gendered interaction in novels and constructions of collective identity by U.S. presidential candidates.

Gender association of actions in cross-gender interactions (A) and predictability of character gender in U.S. novels 1880-2000 (B)

What’s in a Category? A New Approach to Discourse Role Analysis

Published in Poetics (2021) [PDF]

Abstract: Building on the work of John Mohr, I propose a new, broadly applicable approach to Discourse Role Analysis (DRA). Whereas the goal of behavioral role analysis is to identify the different kinds of actors that exist in interaction, the goal of DRA is to identify the different kinds of identities that exist in discourse. To do this, I suggest thinking of discourse roles as latent conceptions of identities composed of treatments, actions, and characteristics that are frequently concurrently associated with identities in stories. I propose a method to infer discourse roles from unstructured text data that draws on novel techniques from Natural Language Processing. This framework is leveraged to shed light on German news coverage of refugees (2010-2020), which employs a set of distinct discourse roles such as refugee as claimant of welfare benefits, refugee in distress at sea, and refugee as a criminal. I then assess how different refugee identity categories are situated within this discourse role structure. I pay particular attention to Geflüchtete, a category that emerged only recently in German discourse. Whereas initial use of Geflüchtete was motivated by a language critique that aimed at replacing the general term for refugees (Flüchtlinge), DRA indicates a process of categorical differentiation in which the category increasingly serves to distinguish different kinds of refugees. 

Discourse Role Analysis workflow


Politics as Usual? Measuring Populism, Nationalism, and Authoritarianism in U.S. Presidential Campaigns (1952-2020) with Neural Language Models

Bart Bonikowski, Yuchen Luo, and Oscar Stuhler (equal authorship), published in Sociological Methods & Research (2022) [PDF]

Abstract: Radical-right parties and candidates combine three discursive elements in their electoral appeals: anti-elite populism, exclusionary and declinist nationalism, and illiberal authoritarianism. Recent studies have explored whether these frames have diffused from radical-right to centrist parties in the latter’s effort to compete for the former’s voters. This paper investigates the obverse process: the radical right’s (specifically, Donald Trump’s) reliance on discursive elements that had long been present in mainstream institutional politics. To do so, we identify instances of populism, nationalism (i.e., exclusionary and inclusive definitions of national symbolic boundaries and displays of low and high national pride), and authoritarianism in the speeches of Democratic and Republican presidential nominees between 1952 and 2016. These frames are subtle, infrequent, and polysemic, which makes their quantitative measurement difficult. We overcome this by leveraging the affordances of cutting-edge neural language models; in particular, we combine a variant of bidirectional encoder representations from transformers (RoBERTa) with active learning. As we demonstrate, this approach is considerably more powerful than other methods commonly used by social scientists to measure discursive frames. Our results suggest that what set Donald Trump’s campaign apart from those of mainstream presidential candidates was not its invention of a new form of politics, but its combination of negative evaluations of elites, low national pride, and authoritarianism—all of which had long been present among both parties—with an explicit evocation of exclusionary nationalism, which had previously been used only in coded form. Radical-right discourse therefore appearsto be less a break with the past and more an amplification and creative rearrangement of existing political-cultural tropes.

Use of discursive frames in selected U.S. presidential campaigns

Reclaiming the Past to Transcend the Present: Nostalgic Appeals in U.S. Presidential Elections

Bart Bonikowski and Oscar Stuhler, published in Sociological Forum (2022) [PDF]

Abstract: Nostalgic appeals to an idealized past are a commonly associated with radical-right discourse. They bolster candidates’ critiques of the status quo and promises of a better future, all while mobilizing perceptions of collective status threat among supporters. In this paper, we ask whether nostalgia is a radical-right innovation or whether it has precedents in mainstream politics. We make use of recent advances in natural language processing—specifically transformer-based deep learning models—to identify nostalgic claims in U.S. presidential campaign speeches from 1952 to 2020. We then examine what form nostalgia takes, when it has been most salient, what aspects of the nation it has been used to glorify, and how it relates to populist and nationalist appeals. Our findings suggest that nostalgic rhetoric usually takes the form of brief and multivocal statements with a consistent lexical signature. It is frequently used by challenger candidates from both parties to generate a heightened sense of crisis and to morally indict incumbent opponents, particularly during times of widespread cultural contention. In so doing, nostalgia helps substantiate candidates’ populist claims and expressions of low national pride. Given that these patterns are found throughout our time series, this points to important continuities between the discourse of mainstream and radical-right actors in U.S. politics. Where their respective messaging diverges, however, is in the use of nostalgia to frame exclusionary nationalist and authoritarian appeals, a practice limited to the radical-right (in our data, Donald Trump). Our findings suggest that radical-right actors did not invent their rhetorical strategies de novo, but rather, have adopted frames already widespread in mainstream politics, adapting and creatively recombining them for their own ends.

Prevalence of nostalgic appeals in U.S. presidential campaigns

Relating Social and Symbolic Relations in Quantitative Text Analysis. A Study of Parliamentary Discourse in the Weimar Republic

Jan Fuhse, Oscar Stuhler, Jan Riebling, and John Levi Martin, published in Poetics (2020) [PDF]

Abstract: Social relations between actors and symbolic relations between concepts or ideas are interwoven in discourse. We conceptually distinguish three approaches that construct relations between symbols with different connections to social structures. These three approaches are illustrated empirically with automated text analyses of the parliamentary proceedings of the Weimar Republic in Germany (1919-1933). First, cultural relations between symbols, as reconstructed from co-occurrences of terms in large text corpora, are supposedly widely shared in a social context. In this sense, we analyze a set of key terms in Weimar political discourse around the central term “Volk” (“people”). These fall into five word communities, each of them representing a different way of conceiving politics. Secondly, symbolic practices are related to actors positioning themselves through them in socio-symbolic constellations. We reconstruct such a constellation from the usage of key terms of Weimar parliamentary discourse by the eight major political parties in their speeches, with different parties signaling their ideological positions through these terms. Thirdly, the use of symbols in interaction characterizes social relationships between actors. In this vein, the ties between the Weimar parties show distinct patterns of hostility or support in their interjections and reactions to each other’s speeches. The second and the third analyses reveal a two-dimensional patterning of the Weimar political landscape, with the traditional Left-Right dimension complemented by an opposition of forces supporting or rejecting the republic. Also, the similarities in word usage by parties correspond fairly well to the support or hostility in their interjections and reactions.

Party-to-party relationships in the Weimar Reichstag (1919-1933) based on partisan interjections to speeches