Skip to main content

Main menu

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
    • Front Matter Portal
    • Journal Club
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses
  • Submit
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Accessibility Statement
    • Rights and Permissions
    • Site Map
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Home
Home
  • Log in
  • My Cart

Advanced Search

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
    • Front Matter Portal
    • Journal Club
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses
  • Submit
Inaugural Article

Lexical shifts, substantive changes, and continuity in State of the Union discourse, 1790–2014

Alix Rule, Jean-Philippe Cointet, and Peter S. Bearman
  1. aInterdisciplinary Center for Innovative Theory and Empirics (INCITE), Columbia University, New York, NY 10025;
  2. bInstitut National de la Recherche Agronomique–Laboratoire Interdisciplinaire Sciences Innovations Sociétés, Université Paris-Est, Marne-la-Vallée, F-77454 Marne-la-Vallée, France

See allHide authors and affiliations

PNAS September 1, 2015 112 (35) 10837-10844; first published August 10, 2015; https://doi.org/10.1073/pnas.1512221112
Alix Rule
aInterdisciplinary Center for Innovative Theory and Empirics (INCITE), Columbia University, New York, NY 10025;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jean-Philippe Cointet
bInstitut National de la Recherche Agronomique–Laboratoire Interdisciplinaire Sciences Innovations Sociétés, Université Paris-Est, Marne-la-Vallée, F-77454 Marne-la-Vallée, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter S. Bearman
aInterdisciplinary Center for Innovative Theory and Empirics (INCITE), Columbia University, New York, NY 10025;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: psb17@columbia.edu
  1. Contributed by Peter S. Bearman, June 30, 2015 (sent for review May 21, 2015; reviewed by Ronald L. Breiger and John Mohr)

  • Article
  • Figures & SI
  • Info & Metrics
  • PDF
Loading

Significance

A synoptic picture of the evolution of American politics is presented, based on analysis of the corpus of presidents’ State of the Union addresses, 1790–2014. The paper presents a strategy for automated text analysis that can identify meaningful categories in textual corpora that span long durées, where terms, concepts and language use changes, and evolution of topical structure is a priori unknown. Discourse streams identified as river networks reveal how change in contents masks continuity in the articulation of the major tasks of governance over US history.

Abstract

This study reveals that the entry into World War I in 1917 indexed the decisive transition to the modern period in American political consciousness, ushering in new objects of political discourse, a more rapid pace of change of those objects, and a fundamental reframing of the main tasks of governance. We develop a strategy for identifying meaningful categories in textual corpora that span long historic durées, where terms, concepts, and language use changes. Our approach is able to account for the fluidity of discursive categories over time, and to analyze their continuity by identifying the discursive stream as the object of interest.

  • State of the Union
  • text analysis
  • networks
  • natural language processing
  • American history

When did modern political discourse emerge in the United States? What is distinctive of basic understandings of the tasks of governance today, in contrast to those that organized the politics of an earlier period? Can the origins of contemporary political understandings be located in the discourse of the past? The annual State of the Union address (hereafter, SoU), in which the US president reports broadly on the progress and challenges of his administration, provides a singular standpoint from which to address the evolution of the tasks of governance. It can thus be used to investigate old questions like those above using network-based text analysis strategies.

This study reveals that the entry into World War I (WWI) in 1917 indexed the decisive transition to the modern period in American political consciousness, ushering in new objects of political discourse, a more rapid pace of change of those objects, and a fundamental reframing of the main tasks of governance. At the same time, this study demonstrates that discourse distinctive to modern politics, although it later crystalized around the liberal welfare state, in fact emerged before the transition to the modern period.

We offer a unique view of American political history, which tracks the articulation of the major tasks of governance in American political and social discourse. To do so, we develop a strategy for identifying meaningful categories in textual corpora that span long historic durées. We are able to account for the fluidity of discursive categories over time, and to analyze their continuity by identifying the discursive stream as the object of interest. The methodological approach developed in this article can be used to meaningfully analyze texts produced over very long historical periods, where terms, concepts, and language use changes—to our knowledge, a problem not satisfactorily solved.

Historical Background

The SoU address is delivered annually by the president to a joint session of Congress, a tradition with its basis in the US Constitution, where it is mandated that the president “shall from time to time give to the Congress information of the SoU, and recommend to their Consideration such Measures as he shall judge necessary and expedient.” Since George Washington’s first presidential address in 1790, the SoU has been given every year, with only one exception in 1933, when incoming president Franklin Roosevelt did not give a speech. The country’s first two presidents appeared in person before Congress to deliver the SoU. Thomas Jefferson, judging that this constituted an imperial gesture, set the precedent of delivering the address to the legislature in written form, a practice that endured until Woodrow Wilson took office in 1913. The latter is sometimes credited with having transformed the address into a direct appeal to the US populace, although presidents who immediately followed him sometimes reverted to written delivery. The SoU was radio broadcast for the first time in 1923, was first televised in 1947; in 1965, Johnson became the first president to cater to a television-viewing audience by delivering the speech in the evening rather than at midday (1).

Research attests to the SoU’s significance in political agenda setting and the reciprocal influence of public opinion on the content of the address. The SoU reflects opinion regarding the salience of issues, while also creating it (2⇓–4). Thanks both to its persistence and its prominence as an institution in US national politics, the SoU has been of perennial interest to researchers seeking to understand various facets of the country’s history (5⇓⇓⇓–9). The main focus of this work has been to pinpoint changes in political discourse to the influence of particular presidents and thus stands in contrast to the focus of this article, which is to represent continuity and change in the structure and content of American social and political thought.

To summarize, as a corpus, the text of SoU mirrors contemporary public understanding of what issues were important. It is nearly unique in the certainty and consistency of its provenance, produced at regular intervals by an individual occupying a well-defined social role, that of the US chief executive. Despite strong a priori reasons for doing so, we do not simply assume that the speech constitutes a stable cultural form, but rather demonstrate that this is the case empirically. The SoU thus provides a unique vantage point from which to reconsider arguments about the timing and nature of critical transition points in US political consciousness.

Revealing the evolution of political discourse requires appreciating how its contents change over time. The method we present relies on the straightforward idea that words acquire meaning through their relations with other words (10). Consequently, we focus on co-occurrence, extracting the local ties between terms in paragraphs to induce categories of discourse from the resulting network structure. By recognizing that the relations between words arise in time, and appropriately defining the period over which co-occurrence is considered, we approximate the semantic standpoint of contemporary observers. We thus consider the categorical structure of discourse over successive, delimited time periods to uncover and analyze continuity and change in social and political thought. Clarifying these methodological points and identifying the insights into American social and political discourse that they permit is the focus of this article.

Methodological Background

Our analysis strategy falls into a class of text analysis methods broadly characterized as co-occurrence approaches (11), which induce categories by relying on terms’ joint appearance over a particular unit of text (12). The central aim of our approach is to parsimoniously identify relevant and interpretable higher-level units of meaning endogenously, and to track their coevolution through time.

The core problem for analysts of text produced over very long historical periods is that key terms change, but for different reasons—language use shifts, new inventions join the world, concepts are recast and reorganized—making it difficult to distinguish meaningful from meaningless change. In general, canonical approaches to text analysis have not been sensitive to the fluidity of meaning over time, either on the level of individual terms or of higher-level context, conceived as categories, topics, classes, or discussions. Fig. 1 illustrates the two main reasons that a co-occurrence approach is uniquely well suited to analysis of the SoU and other historical corpora: first, in contexts where the reasons for changing word use are unclear and hard to disentangle, attention to the relationships between words is crucial for understanding the significance of such changes. Second, the co-occurrence structure, an abstraction of the changing context of use, is itself directly interpretable. In this sense, a frontal approach like co-occurrence analysis is preferable to other methods that identify categories in text, but require additional steps to make those categories accessible to interpretation.

Fig. 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 1.

The meaning of words is conditional on their co-occurrence with other words and terms. Attending to patterns of co-occurrence over time captures their evolving meaning. Key terms co-occurring with “constitution” are shown for four periods over the SoU corpus, 1790–1834, 1855–1894, 1915–1954, and 1975–2014. For each time period, we build a proximity network where each node (word or term) is linked to its closest neighbors. Nodes are colored according to the connected component or community to which they belong. The target node—“constitution”—and its links to all other pictured terms, is hidden from the visualization. Node size scales with frequency of terms’ occurrence.

In Fig. 1, we observe immediately that some of the terms associated with “constitution” change: “constituents” is present only in the first period, “slavery” only in the second, and “land laws” and “ideals” distinguish the semantic neighborhood of the term in the third and fourth periods, respectively. However, the relationship of these associated terms to one another also changes, strikingly. In the first decades of the country’s history, “people” and the objects associated with it appear as a distinct community (colored blue), indicating one context of the constitution’s meaning. During the Civil War and Reconstruction era, captured in the second period, the largely familiar set of contexts to which “constitution” is related themselves become more closely associated—as the constitution becomes central to a number of key discussions of the era. In the third period, the context of “constitution’s” meaning again becomes more straightforward and more limited—its contents focused on jurisprudence—and even more so in the fourth period, although the related terms again shift. Equally, Fig. 1 allows us to see that terms that reappear successively in connection with “constitution” may undergo semantic transformation. In the first period, “confederacy” is associated with “years” and “members”—referring, in the first decades of the country’s history, to the organization of member states—whereas in the second, it is associated with “state,” “self-government,” and “union.” We do not in fact need to know that the “confederacy” was the name adopted by the seceding southern states, changes in the network of terms alone indicates that such a transformation has occurred. At the risk of being didactic, the changing significance of words—revealed by the structure of co-occurrence with other words and terms—can play havoc with traditional dictionary and topic model approaches. We expand upon the reasons for this below.

In contrast to the approach developed here, dictionary-based methods compare words observed in a corpus against a predefined and often structured set of terms. They thereby fix both a semantic structure and the definition of particular words within it. Such methods thus assume a specific substantive context (13)—for example, a dictionary for political discourse would not capture the meaning of the same terms used in everyday speech. This makes dictionaries inappropriate for corpora that span long time periods, because adopting a “substantive context” entails arbitrarily assuming a fixed historical standpoint—in our example, the political discourse of a given moment (cf. ref. 14). Supervised text classification, by contrast, builds automated classifiers inductively, which learn the characteristics of those categories they apply from a set of preclassified texts. In analyzing language that evolves over time, however, supervised text classification methods present limitations similar to those of dictionary-based approaches in that they assume that categories possess a stable textual signature.

By contrast, topic modeling comprises a set of methods for identifying meaningful categories in textual corpora endogenously. Approaches like latent Dirichlet allocation (LDA), popularized by Blei et al. (15), infer what a unit of text is about by relying on probabilistic models based on observed word distributions. For a given corpus, the analyst sets a number of topics that are then distinguished by probability distributions over words specific to each topic. Topics are unstructured and consist of ranked lists of terms, whose weights are associated with their likelihood of being used when a topic is drawn. A distribution of topics over units of text and the membership of terms in topics are jointly optimized.

Despite their popularity, topic models raise a number of concerns: one is that the topics they identify are not in themselves easy to interpret, and consequently not substantively “trustworthy.” [The fact that topics produced through LDA often lack “actual and perceived accuracy” as meaningful categories (16, 17) has provoked topic modelers to find additional methods of checking for topics that do not fit the data (18).] Concerns about transparency further compound problems of parameterization that topic models inevitably raise for analysts (19), e.g., how many topics to specify (20).

More recent developments in the topic-modeling vein have started to account for the way that meaning arises over time. In contrast to the original probabilistic latent semantic analysis (21) and LDA approaches, some topic models now accommodate hypotheses of topic dependence—allowing for words’ distributions to be correlated (22)—and for topics’ dependence on a range of external variables (23), time among them (24). An example of the latter, the topics-over-time (ToT) algorithm (25) identifies topics that vary in their temporal profiles over time-stamped corpora, arguably making it well suited to detect brief semantic responses to exogenous events. However, ToT is not designed to capture how a changing set of words or terms compose a topic over time. Blei and Lafferty’s (26) continuous dynamic topic modeling (cDTM) does do this. Like LDA, cDTM requires the analyst to specify a set number of topics in advance, through which different sets of words may move over time, as through a tube. Such an approach is, however, unsatisfying in cases where the genealogy of relevant categories is itself unknown, and in fact is the question of interest, as in our study.

To our knowledge, Gao et al. (27) have made the only similar attempt to address this problem, inducing clusters of documents to study the “overall evolution of topics and their critical events.” However, the topical structure that Gao et al. (27) generate is not directly interpretable. A second step is needed to make topics accessible at the level of meaning; this is accomplished through an analysis of word co-occurrence “on top of” the heirarchical Dirichlet process model for retrieving the categories. In contrast to this strategy, our approach is both more parsimonious and transparent. Specifically, as illustrated in Fig. 1, our strategy exploits the co-occurrence structure—assumed absent in topic models (18)—as it unfolds, to track the continuity, discontinuity, and relations between categories across time, relying only on terms’ joint appearance over a particular unit of text to endogenously induce topics.

We analyze the co-occurrence structure using familiar network analysis techniques, relying on community detection to identify and interpret categories of relevance. We then limit consideration of co-occurrence to successive time periods in SoU discourse, and describe the genealogical relationships between categories. To do this, we assess the similarity between network clusters in successive periods.

A variety of techniques for community detection in dynamic networks have recently been developed (28⇓⇓–31)—although they have not been widely applied to problems of semantic detection, with the exception of ref. 32. Growth in this area makes us optimistic that the approach developed here can be applied widely in the analysis of other corpora where change in terms’ use over time is substantial, has multiple sources, and has uncertain bearing for higher-level meaning.

Methods

The SoU corpus includes 227 dated addresses, comprising a total of 1,763,622 words. The speeches vary in length and elegance. Our method is insensitive to such variation. We base our analysis on frequently occurring noun terms, including multiword phrases, e.g., “national security,” “local government,” “fellow citizens,” extracted using natural language processing (NLP) techniques.

Semantic categories are induced by considering the co-occurrence patterns of terms over documents spanning particular periods of time. We define the co-occurrence matrix as the number of joint appearances of terms i and j in the same paragraph in a document published at time t. A proximity score is then computed to measure the relatedness of each pair of terms, yielding a weighted semantic network with terms as nodes connected by edges weighted by their similarity (33). A community detection algorithm (34) is used to identify cohesive subsets. These clusters are what we will refer to as discursive categories, and interpret as such. Inducing discursive categories through co-occurrence provides a rich perspective on the inner structure of each topic, illuminating the individual connections between words, the positioning of terms in the clusters, and proximity between topics.

We first apply the procedures described above to produce the entire (“global”) semantic network on the full 1,000 × 1,000 terms matrix over the SoU’s history. We then apply the same procedure over successive, delimited time periods of the SoU to produce historically specific semantic networks. We refer to these as local networks, in contrast to the global network induced from the entire corpus. To analyze the continuity of discursive categories, we apply an algorithm that captures the movement of terms between discourse clusters in past and successive periods, and thereby allows us to reconstruct the most likely lineage of each discursive category. This makes it possible to represent the SoU as a series of conversation streams, in a river network.

To induce the river network, we generated local semantic networks according to the same procedure described above for 10 successive overlapping periods of 40 y each, evenly spaced to cover the entire SoU corpus. We hence obtained 10 terms maps based on the co-occurrence of the most frequent terms in each period. Clusters on these networks index historically specific categories. Each network provides an objective map to which the social and political discursive categories of contemporary actors corresponded. We then applied an algorithm (Supporting Information) to find the most likely lineage between the discursive categories of a given period and those that preceded it, on the basis of shared terms weighted by their within-cluster centrality. Recall that clusters index discursive categories. For each time period t, the algorithm considers a cluster detected within the given time period and knits it with clusters from the previous time period. To determine which clusters to connect, we compute the Bhattacharyya coefficient between the normalized centrality distribution of terms in a given cluster and each of the normalized centrality distributions of terms in clusters detected at t − 1. Pairs of clusters are intertemporally linked if similar.

Results

In terms of its lexical contents, the SoU is remarkably stable, changing only gradually over time. To assess change over the SoU’s history, we first use a vector space model as an efficient tool for representing the similarity of two given addresses. We treat each speech as a vector of terms weighted by their frequency, and compare each of the dated vectors using the cosine measure, controlling for speech length. Results are displayed in the transition matrix in the upper panel of Fig. 2. Transition matrices considering 500 and 1,500 terms return results consistent with those reported here, as do an otherwise-identical set of matrices obtained using a Euclidean distance measure.

Fig. 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 2.

The SoU is a continuous cultural form, its moments of disruption indexing real world events. The upper panel reports the transition matrix capturing change in key terms used in the SoU over time. Each cell of the matrix compares the terms vectors for two addresses, representing their dissimilarity on a scale of 0–1. Darker regions represent areas of substantial similarity; light bands represent moments in which the speech’s content departed dramatically from that of (all) other years. The lower panel is a homomorphic reduction of the first panel and highlights key transition periods—1917, 1816, and 1950—none of which overlap with changes in mode of delivery (represented as red dotted vertical lines).

Fig. 2 reveals the gradual nature of year-to-year change in the SoU’s contents, which can be seen in the dark colored cells above and below the main diagonal, indicating the similarity of adjacent years. Furthermore, one observes only a few moments of relatively brief disruption, none enduring more than a few years. Notably, the timing of these moments does not index changes in the speech’s mode of delivery or broadcast, but coincides with real world disruptions: the War of 1812 and the First and Second World Wars. In short, the SoU is a stable cultural form, and dramatic departures are brief and respond to exogenous events. [The exception that proves the rule concerning the SoU’s formal consistency comes in 1887, when Cleveland devoted his entire speech to a plea for tariff reform, the central issue of his campaign for reelection in the following year. Cleveland lost to his Republican challenger, which may help account for the fact that departure from the basic format of the SoU was never repeated (35).]

To analyze stability in the SoU’s contents, we compared adjacent years in which the president remained in office, to adjacent years in which a new president delivered the address. Given our orientation to the changing conceptual backdrop of US politics revealed through SoU discourse, our aim was to test the assumption that the objects that feature in political speech change at a rate independent of what is said about them. Indeed, there is no statistically significant difference in the pace of change in terms either when the new president is from his successor’s party or from the political opposition. In adjacent years where speeches were given by the same president, the average change ratio was 0.31. By comparison, years in which a new president from his predecessor’s party gave the address averaged 0.34 (P = 0.95), and those in which the SoU was delivered by a new president from the opposition, 0.33 (P = 0.85).

The observation that the SoU is a stable cultural form, however, does not imply that we cannot describe periods of continuity and discontinuity in the objects of political concern. To detect such periods, we divide the transition matrix into subblocks that maximize the homogeneity of the average values of each pair of years falling within each block. The results of this procedure are illustrated in Fig. 2. One can immediately see that these periods of similarity are not aligned with changes in the mode of the SoU’s delivery, any more than are momentary disruptions. The optimal partition is in 1917; secondary partitions in 1816 and 1950 demarcate unique discourse regimes. Differences in color saturations of blocks along the main diagonal reveal that each of these periods is also characterized by a unique rate of change. Net of further partitions, this is equally true of the history before and following 1917. The longer first period is characterized by less dissimilarity between any two given years than the second (Supporting Information). 1917 was the first year that objects of political discourse resembled our own more than those of the 19th century, and marks off a previous era of slower change from one a subsequent one in which change proceeded faster. It demarcates the transition to modern political consciousness.

Transhistorical Categories in US Social and Political Discourse.

The bag-of-words approach above (ignoring the structure of co-occurrence) provides a baseline picture of turnover in the objects of political discourse. It thus constitutes an efficient way to detect precise moments of change. However, it is inadequate to describe what is being discussed, much less the finer contours of discursive categories or relations between them. Relying on the approach described in Methods, we next identify the main categories over the entire history of the SoU. Fig. 3 displays the semantic map of social and political discourse over the entire period from 1790 to 2014. Filtering the aggregated matrix of co-occurring terms across the entire corpus, results in a network with 887 nodes and 13,302 edges. The nine clusters on the network are labeled on the basis of the lexical contents as they relate to the network structure detected by the Louvain algorithm, and index the main categories of political discourse over US history. Recalling that both the internal structure of the clusters and their position in relation to one another is significant, we describe the categories captured in the terms map, clockwise from Upper Left: “Production” in orange is closely connected to a larger cluster broadly concerning “Domestic Policy” (red), in which economic language mingles with terminology relating to the welfare state, also connected to a discourse of “Foreign Policy” (dark green). Below this, but clearly distinct, is a discourse concerned with the functioning and external affairs of the government, “Statecraft” (blue), connected to a conversation about the “Navy” (light green). Loosely connected above is a large cluster representing “Political Economy,” in which monetary policy appears as a distinct region. “Land” features above this, as a small but distinct discourse over US history.

Fig. 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 3.

The global network structure of the SoU, 1790–2014. The Louvain community detection algorithm reveals cohesive clusters or discursive categories from the semantic network built from the 1,000 × 1,000 terms matrix over the SoU’s history. Some terms lie between clusters and serve as bridges connecting otherwise disjoint discourses. Two clusters contain only a few linked terms: one indexes the set of concepts associated with immigration, the other those associated with crime.

These master categories provide a semantic summary of the entire 200-y durée of the SoU, but they do not reflect the native categories of speakers. No president may have ever uttered a paragraph about “statecraft” or “domestic policy.” These topics are rather a meaningful abstraction of discourses historically situated individuals could have and did perceive.

Three of the seven major clusters, Land, the Navy, and Production, are largely comprised of terms that retain unique meanings over the period from 1790 to 2014. We might thus assume that, despite changing contents, these categories are semantically stable and that over the history of the SoU and that the terms in these clusters reference a single focus of substantive political concern. By contrast, four of these seven clusters, by far the largest, appear to comprise many terms that are historically multivocal. Attention to the structure of co-occurrence suggests that these categories arise from discourse conducted over specific periods of history. Namely, the Statecraft and Political Economy clusters belong to an earlier moment in history than the Foreign Policy and Domestic Policy categories. The two pairs index similar tasks of government, as constructed differently in different epochs.

Discourse Categories over Time.

To obtain a provisional picture of the historical foundations of these master categories in US political discourse, we project the clusters derived from the structure of co-occurrence in the entire corpus onto paragraphs of dated SoU addresses. Specifically, each of the nine clusters is represented as a vector of member terms, weighted by centrality, and is compared with a vectorial representation of the lexical content of each paragraph. We match the content of dated paragraphs in the SoU to the clusters they most closely resemble, assigning paragraphs to one or multiple clusters. The projection procedure shares similarity with LDA topic modeling insofar as the optimal categorization of paragraphs is conditioned on topics as distributions of terms. (However, here, paragraphs’ assignment to topics is accomplished in a second step and does not assume a distribution of topics over paragraphs.) The procedure allows us to track the relative representation of political master categories over time, as in Fig. 4.

Fig. 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 4.

Foreign Policy and Domestic Policy Supplant Statecraft and Political Economy as master categories in American political discourse over time. Projecting global network clusters onto paragraphs of dated addresses allows us to track the historical foundations of basic understandings of the tasks of government revealed in the SoU. Time (1790–2014) runs along the x axis, and the y axis reports the proportion of the SoU devoted to each discourse cluster in given year. Dotted lines demarcate periods.

The results confirm the impression of historicity of the topics represented by four large clusters on the global network. In Fig. 4, continuity and discontinuity are clearly discernible. Some clusters, Land and the Navy, largely disappear over time. Others appear to shape-shift: notably, we observe that the discourses of Foreign and Domestic Policy indeed succeed and eventually replace the discourses of Statecraft and of Political Economy. The latter two categories dominated the SoU discourse during most of the premodern period, along with discussion of Land and the Navy.

The discovery of two sets of historically distinctive categories resonates with and enhances our understanding of the 1917 transition: political discourse changed not only in its objects of concern and their pace of change, but in its construction of the basic tasks of governance. Interestingly, the results displayed in Fig. 4 suggest that the modern categories that began to dominate political discourse after WWI, first emerged before it—around the turn of the century. Although discourses of Foreign and Domestic Policy were hardly hegemonic—neither ranks among the top three categories of SoU discourse in any year before WWI—both understandings gained ground in the first decade of the 20th century.

The implications of the projection procedure remain suggestive as to timing of semantic change, however. This is in part because the abstraction inherent in categories render them remote from those salient to contemporary observers. Furthermore, the procedure reflects the unsatisfying assumption that a single term retains the same meaning over more than 200 y. We transcend this assumption in the next section to obtain a detailed picture of the evolution of political discourse across US history.

Dynamics: River Networks.

How did a different understanding of the fundamental tasks of governance emerge in American political consciousness? Did new topics of discussion appear? Were extant discussions discontinued, or reorganized? To pursue such questions, we need a way to capture the context of meaning in which modern political categories could emerge. We must reconstruct the flow of political discourse and attend to the right moment therein. To achieve this, we induce a river network.

Recall that we generated local semantic networks for successive overlapping periods, retrieving terms maps based on the co-occurrence of the most frequent terms in each. The length of the periods (40 y) reflects what could have been perceived by contemporary actors. The topics indexed by clusters on these local networks are thus are unlike the categories over the full corpus, in that they are meaningful from a particular historic standpoint, and not sensitive to semantic changes that occur in subsequent periods. We then knit these clusters together. The river network that results from this procedure captures the flow of political discourse across US history, as shown in Fig. 5. Topics (clusters) woven together across periods catenate into continuous discourse streams. Clusters so connected at t1 and t3 may comprise none of the same terms (equally, unlike in the master categories derived from clusters on the global network, then projected back onto paragraphs, particular terms may appear in different streams during different periods). A stream remains the same thing from period to period, although it need not remain one thing. The approach can recognize multiple relationships between the structures of adjacent periods. Discourse streams may fork, merge, decline, swell; new streams can always emerge and old ones disappear. Fig. 6 provides one detailed example of the forking processes that the river networks can identify; here for the transition from the cluster labeled “action and law” to the two clusters labeled “departments and recommendations” and “law and interstate commerce” over the period from 1875 to 1914, in which a moralized conversation about the administrative structure of the emergent bureaucratic state is decoupled from the regulatory structure, in this instance focused on railroads.

Fig. 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 5.

A river network captures the flow across history of US political discourse, as perceived by contemporaries. Time moves along the x axis. Clusters on semantic networks of 300 most frequent terms for each of 10 historical periods are displayed as vertical bars. Relations between clusters of adjacent periods are indexed by gray flows, whose density reflects their degree of connection. Streams that connect at any point in history may be considered to be part of the same system, indicated with a single color.

Fig. 6.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 6.

The discursive categories of a given historical moment may split in subsequent periods. (Right) “Action and law” discourse cluster of p6, and its two successor discourse clusters in p7, “departments and recommendations” and “law and interstate commerce.” Clusters’ shared and unique member terms are displayed in the table at Left.

Two systems of interconnected streams run the full length of US history, one concerning international and the other domestic matters. For most of the country’s history, discourse about fiscal policy, on one hand, and farming and industry, on the other, ran in parallel; these merged only in the mid-20th century (p8) into a unified discourse of the modern, domestic, economy. Conversely, what exists today as two distinct clusters, one concerning the United States’ role as a superpower, and another about national security, both flow from common origin in a mid-20th-century (p8) discursive stream, which in turn flows from one branch of a conversation that forked for the first time in the period centered on WWI (p6) as the United States adopted an internationalist foreign policy. The other branch—the remaining discourse around bilateralism—died out within the next 20 y. Two streams cover a substantial span of the country’s history. One runs from the founding period and is concerned with the country’s defense infrastructure, in particular the navy; it concludes with discussion of the military of WWII (p9). The other begins in period 5 (1875–1914) and splits in the following period where one branch concludes, the other continuing unbroken to the present, tracking the emergence of the contemporary welfare state.

Although generated no less systematically than the static master categories, the streams recapture the unfolding of American political discourse at a high level of historic detail. Also evident in Fig. 5 are a number of local discussions, which are too specific to their moment to map easily to the master categories described above. Two are time-censored: the contemporary discussion around education and communities “schools and help” that regards market-driven social policy has not had a chance to extend into the future; likewise, the early discussion about control of territory “land and limits” was inherited from an earlier period not captured in the SoU. The stream that covers the political crisis culminating in the Civil War and the discourse of Reconstruction spans only two periods, and disappears without disrupting the overall structure. This is consonant with the results of the previous lexical analysis that demonstrated that the long 19th century (from 1816 to 1917) was a period of marked stability with respect to social and political discourse.

Streams capture qualitative continuity in topics of political discussion, but the consistency of their contents across history is uneven. We calculate the rate of new terms, terms pairs, and pairs of terms remaining in the same category of political discourse for the nine transitions between our ten periods, which gives us a comparative picture of how radically the most important objects in political discussions of the day were changing, underneath the continuities revealed in Fig. 5—and much enhancing the low resolution of change provided by the vector space model (Fig. 2). Here again, we find that the moment following WWI (the period 7–8 transition) is an outlier, exhibiting the most radical turnover in the key objects political discussion all three dimensions—but that notably this change occurs without a fundamental reorganization of salient political categories. What was being talked about changed in the political discourse of the post-WWI period, but the framing of basic issues of governance did not.

Discussion

As striking as what is captured in our analysis of the evolution of American political thought is what is missing. The Civil War, often considered in conventional histories to have transformed the country’s political consciousness, while apparent in political discourse of the time, seems not to have made a lasting imprint on the unfolding of the dominant categories of social and political thought in the SoU. Although discordant with the organization of introductory textbooks, the absence of distinct periodization observed in this study around the Civil War is consistent with historical scholarship that identifies the main conversation in the antebellum period as centered on states’ rights, a debate the war, and reconstruction and its collapse, failed to solve. Likewise, our study challenges histories that identify Reconstruction, the New Deal, and WWII as inaugurations of modern political consciousness in the United States. (Although we find some support for the Marshall Plan as marking a secondary transition within the modern period.)

The central finding is that the modern understanding of politics began with the country’s entry into WWI. The year 1917 ushered in modern objects of political concern and an era of rapid change during which such objects began to pass more quickly through the lens of public discourse. It equally marked the decisive ascendancy of today’s basic understanding of governance as consisting in foreign policy, on one hand, and a domestic policy centered on the economy, on the other. However, the stream of political discourse that would come to characterize the modern period emerged before the moment of transition to modernity. Although war marked a transition to a new regime of political discourse, the topics of conversation that featured within it were already familiar to contemporaries. Careful observers of late 19th-century and early 20th-century American intellectual history (36, 37) would find confirmation of their readings in these results.

Our modern discourse emerges from a conversation that persists unbroken from the late 19th century into the present. For late 20th-century observers, this stream is recognized as “about” the functions of the liberal-democratic state—concerned with the regulation of business and the financing of public infrastructure. It began, however, in the period centered on 1894, as a moralized discussion oriented to political and economic reform (Fig. 5). The stream of discussion then split in the following period where the conversation around the regulation of trusts concludes, and another strand oriented to government reform continues eventually becoming a discussion of the welfare state that persists into the present.

The finding is consistent with historical arguments that focus on reformist impulses of the late 19th and early 20th century, as the intellectual, legal, and moral sources of the postwar social order. Progressive ideas did not reshape the political landscape immediately, however; they did so only after the disruption of WWI induced change across multiple domains. The process by which this occurred is beyond this study’s scope, but the fact that only under a new regime could the progressive era discourse institutionalize deserves further consideration in the context of other work (36).

More generally, by providing a map of politically relevant categories as they evolved, our study affords a variety of insights into US history. These insights depend on the production of a previously unidentified object: the discursive stream. Although esoteric to academics, political actors and lay observers readily understand political discourse as continuous in this way. Here, we observe that change in salient contents often masks continuity at a higher level. For example, a discussion that at one point in history is about individual rights, the Fourteenth Amendment, and the Supreme Court, is later a conversation about gag rules and development aid, and still later about health insurance and religious employers; we recognize this as the political discourse around abortion. Today, despite a shifting set of key terms, it remains the same thing—a fact revealed by the methods developed and deployed in this article. Although continuity is observable in the SoU, critical discontinuities in political and social discourse are also present: our study reveals a massive transition from 19th century to modern categories of thought as new framings of domestic and foreign policy emerged despite apparently unbroken discussion of fiscal affairs, industry, and state relations.

Moving beyond the SoU, we show that text analysis methods that are oriented to distinguishing analytic levels of interest, and finding ways to capture continuity at these different levels, may provide solutions to classical problems of historical periodization and for understanding social action that traverses the frontiers of historical regimes. Our current digital context is increasingly replete with old documents, textual corpora spanning hundreds of years—in which word use changes for a complex variety of reasons. The network-based text analysis methods we present here can distinguish meaningful from meaningless change in word use, and render higher-level meanings directly interpretable. They are thus uniquely suited to the analysis of corpora that span long historic durées.

Linguistic Processing

The SoU corpus includes 228 dated speeches, comprising a total of 1,763,622 words covering the time period P spanning from 1790 to 2014. (Below, we use speech and address interchangeably to refer to documents containing the text of each year’s SoU.) We base our analysis on frequently occurring noun terms, including multiword phrases, e.g., “national security,” “local government,” “fellow citizens,” which we extracted using NLP techniques.

The extraction process proceeded in four steps: first, “Treetagger” (39), a part-of-speech tagger was run over the full corpus. A text-chunking algorithm was then used to identify noun phrases corresponding to a predefined set of grammatical sequences (e.g., the pattern “〈DT〉?〈JJ〉∗〈NN〉+” will extract sequences made of one optional determinant and list of adjectives preceding at least one noun). Extracted multiterms were then gathered under the same class when their stemmed version was identical (“city” and “cities” are then indexed as “city”). Similarly, semantically coherent multiterms like “health care costs,” “cost of health care,” “costs of health care,” “health care costs,” and so on, are gathered into the same class. We index different terms classes and denote their frequency f(w). From this list, we then select the set W of the 1,000 most frequent multiterms that define the dimensions of comparisons of SoU speeches, which we refer to here as “terms.”

Periodization

One address is delivered per year, with the following exceptions: in 1933, Franklin Roosevelt’s first year in office, he did not give a speech. In four separate years, 1790, 1953, 1961, and 2001, the same president delivered two addresses, or produced two substantially different versions (e.g., oral and written) of the same address. In these years, we simply concatenate the two addresses.

To capture the global evolution of vocabulary over time, each one of the set of each address published at time t: At is first modeled as a tf.idf vector of dimension 1,000. Vector components are given by the frequency of word w in a particular address ft(w) multiplied by the inverse of its document frequency idf over the entire corpus, that is, log(225/f(w)):At(w)=ft(w)log(225f(i)).

We normalize each speech’s vector such that At sums to 1 and simply define A1933 as a duplicate version of the previous year. A classic cosine similarity measure (40) is then used to compare each pair of speeches over time, resulting in the matrix plotted in Fig. 1. More precisely, the dissimilarity matrix D is a 225×225 matrix with components computed as follows:D(t,t′)=1−∑w∈WAt(w)At′(w).

We then detect periods by defining a partition of the total time range so as to minimize the difference between speeches falling on either side of it (and conversely, maximizing the average difference between two speeches falling on opposite sides). Given a partition dividing the overall time period at year t into two subperiods, p−(t)=[1790:t−1] and p+(t)=[t:2014], defined as strictly longer than 1 y, we compute the average dissimilarity between distinct speeches: H(p)=(∑(t1,t2)∈p2,t1≠t2D(t1,t2))/(|p|(|p|−1)), which measures the heterogeneity of speeches given during period p. We then attempt to find the cutting year that minimizes the global heterogeneity of the induced partition, defined simply as the weighted average of the heterogeneity score for each subblock, where weights are defined by the relative size of each subblock. Once the best cut point is found (1917), we repeat the process to find the secondary cut points before and after 1917 as illustrated in Fig. S1.

Fig. S1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. S1.

A two-step process for identifying primary (Left) and secondary (Right) optimal cut points over SoU history. Once the cut point minimizing the homogeneity has been identified in 1917, we attempt to find the respective secondary best cuts.

The average similarity of speeches before and after 1917 is markedly different. The longer first period features less dissimilarity between any two given speeches than the second—and thus appears notably more homogenous in terms’ use—revealing a difference between the premodern and modern eras in the pace of change in objects of political concern. We considered the average dissimilarity scores of 100 random samples of 50 y from each period, for terms vectors of 500, 1,000, and 1,500. The greater dissimilarity of the modern period is significant and robust when considering different numbers of terms, as shown in Table S1.

View this table:
  • View inline
  • View popup
Table S1.

Average dissimilarity of random pre- and post-1917 subperiods

If we now reduce our original distance matrix to a 4 × 4 matrix corresponding to the four detected periods: p1=[1790:1815],p2=[1816:1916],p3=[1917:1949],p4=[1950:2014], we obtain the resulting matrix plotted Fig. 2. We observe that the boundaries of our four periods do not correlate with the years in which the modality of delivery of the SoU changed—namely, 1801, 1913, 1947, 1956, and 2002—thus increasing confidence in the stability of SoU as a cultural form.

Semantic Network

The process underlying the semantic network construction is fourfold. First, a co-occurrence matrix is built, from which a similarity matrix is induced. This adjacency matrix is then filtered to produce a semantic network. Finally, a community detection algorithm enables us to identify clusters, which we interpret as categories of political discourse.

  • i) We define the initial co-occurrence matrix Ct as the number of joint appearances of distinct terms w1 and w2 in the same paragraph of speeches published at time t. This temporal matrix can be summed over years to yield an aggregated co-occurrence matrix Cp over any given time period. Subsequently, we drop reference to time, as the principles for semantic network construction do not depend on the time period considered. This co-occurrence matrix may also be computed considering any subset of the vocabulary W∈W, based on the frequency of terms at a given period p. For the sake of clarity, in the following pages, notations do not mention the dependence on vocabulary size.

  • ii) A proximity score is then computed to measure the relatedness of each pair of terms, yielding a weighted semantic network with terms as nodes and edge weights indexing similarity. We use a distributional measure that calculates the relatedness of two terms by comparing their respective context. The measure falls into the category of difference-weighted mutual information based models, which exhibit good performance at pseudodisambiguation tasks (33):

S(w1,w2)=∑c∈W\{w1,w2},I(w1,c)>0min(I(w1,c),I(w2,c))∑c∈W\{w1,w2},I(w1,c)>0I(w1,c),where I(w,c), the pointwise mutual information between terms w and c, plays the role of a weight function over the preferential contexts c of w, and where a context c is considered a feature of word w if their co-occurrence is greater than would be expected if distributions were uncorrelated. Note that this measure is not symmetric and that, in the resulting similarity network, ties are weighted and directed.

  • iii) The semantic network is next filtered to conserve only edges above a given threshold θ. This threshold is automatically computed so that the final network is connected, and its total number of edges is minimized, at the expense of loosely connected terms. More precisely, starting from θ=0, we progressively increase the threshold until we reach the critical parameter for which a component larger than two nodes detaches from the principal connected component. (We thus remove single nodes that are only weakly connected to the network, which results in a global network featuring less than 1,000 nodes.) More precisely, starting from θ=0, we progressively increase the threshold until we reach the critical parameter for which a component larger than two nodes detaches from the principal connected component. For the global network, θc=0.39. This high threshold value only conserves significant relationships between terms, and subsequently allows for the emergence or clearly cut clusters. The resulting semantic network S connects every pair of nodes (w1,w2) whose similarity is above this threshold at strength S(w1,w2).

  • iv) A community detection algorithm (34) is applied to the resulting network to identify cohesive subgraphs that we refer to as discursive categories. We then identify the set of categories Φ={ϕ1;ϕ2;…;ϕl}, where l is the number of clusters automatically identified in the current semantic network. Inducing discursive categories through co-occurrence provides a rich perspective on the inner structure of each topic, illuminating the individual connections between words, the positioning of terms in the clusters, and proximity between topics.

Network Mapping.

Network mapping provides a visual aid for interpretation. We use a classic force directed layout to spatialize our semantic networks. The nodes are then colored according to the cluster to which they belong (Fig. S2). For each node, we also define node to cluster contribution score ν→, as the weighted ratio of edges connecting terms that belong to the same cluster. For a term w belonging to a discursive category ϕ, its contribution score will be measured as ν→ϕ(w)=(∑c∈ϕS(w,c))/(∑c∈WS(w,c)). This score measures the extent to which the neighborhood of a term comprises terms that have been assigned to the same community. Conversely, we also define the cluster to node contribution score ν← as the weighted proportion of internal edges a node contribute to: ν←ϕ(w)=(∑c∈ϕS(w,c))/(∑(c1,c2)∈ϕ2S(c1,c2)). Note that the final mapping provides not just a way to access the list of terms “contained” in a given cluster, it also affords a view of the inner structure of each cluster and the connections between clusters. Nodes scale with terms’ frequency in the corpus, and their opacity is correlated with the product of their contribution scores.

Fig. S2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. S2.

The global semantic network of the SoU address, featuring 887 terms.

Several maps have been produced following the same methodology in this paper:

  • The “global” map (Fig. 3 and Fig. S2) featuring the entire vocabulary W over the whole time period P.

  • The four maps corresponding to the four periods detected (plotted in Fig. S4). Here, we restrain the vocabulary to words occurring at least as frequently as the least frequent word over the whole time range, i.e., 20. A different value of θ has been detected and used to filter the networks at each time period.

  • Finally, we produced 10 networks comprising the 300 most frequently occurring terms over a given “local” historical period, covering the following overlapping sequence of 40-y-long periods: [1790:1834];[1815:1854];…;[1955:1994];[1975:2014]. Note that we extended the initial period, such that it covers 45 y. The different temporal networks thus likely feature different vocabularies at each period pi.

In this case, we were interested in comparing discursive categories over time; we thus defined a unique threshold equal to the smallest θc=0.45 value necessary to preserve the connectedness of all local historical networks.

The ego networks produced for illustrative purposes in Fig. 1 are computed using a different proximity measure. More precisely, we first compute the Poisson significance measure (41) between each word in Cp for each period p. We then filter the resulting network considering only the 10 closest neighbors of each word. The ego network featuring every neighbor of word “constitution” is then extracted. “Constitution” is then dropped from the network as it is linked to every other node. A Louvain community detection algorithm (34) is finally run to unveil the cohesive subgraphs of the network, and nodes are colored accordingly. Node size scales with a term’s number of occurrences at a given period.

Paragraphs Projection.

In general, maps also feature circles centered over terms composing each cluster. These circles’ area scales with the number of paragraphs assigned to each cluster according to a projection procedure, which we also rely upon to construct Fig. 4. The procedure is as follows: we first use a vector space model to describe both paragraphs and clusters. Each paragraph is modeled as a tf.idf vector, in which nonnull components are assigned term frequency (tf) in the paragraph times their inverse document frequency (idf). Cluster vectors feature every member term, weighted by its contribution score ν→. The scalar product of these two vectors measures how well a topic describes the content of a paragraph. The size of a discursive category as it features on the terms maps is scaled simply to the number of paragraphs that have been assigned to this category. Note that this only provides complementary information to the terms maps. As such, the projection procedure affects neither the structure of the semantic network nor the shape of the streams of the river network. The results of the projection procedure, both in terms of the relative representation of clusters over the corpus, and the temporal trends displayed in Fig. 4, are robust to different threshold choices and node-weighting strategies.

River Networks.

The maps computed for each of our 10 periods pi provide the set of discursive categories particular to their historical moment. Identifying how those categories evolve through time entails comparing the composition of clusters in successive time periods. We assign each discursive category a vector over its constitutive terms, weighted by their individual contribution score ν←. We then compute the Bhattacharyya distance between any pair of temporally successive clusters considering their respective contribution vectors. A category ϕi at period p is then linked to categories in the previous period with the following intertemporal proximity measure:r(ϕip,ϕjp−1)=1−12∑w∈(ϕip∪ϕjp−1)(ν←ϕi(w)−ν←ϕj(w))2.

This process creates a weighted network in which edges connect clusters in time. This intertemporal structure is then filtered to keep only most significant transitions. We plot the distribution of intertemporal strengths (Fig. S3). The minimum of the distribution helps us to determine a threshold value separating on one side the spurious and on the other side significant intertemporal connection strengths. Examining the distribution, we identify a range of weights between 0.33 and 0.38 where no connections are found. We thus choose 0.38 as the threshold under which any intertemporal connection will be considered spurious. A thorough manual exploration of every transition show that this threshold indeed selects only pertinent connections as exemplified in Fig. 6, which shows transition close to the threshold. We also compute a series of network-level measures to appraise the pace of change of objects that structure local political discourse. We also define a series of measures at the level of the local network: new nodes, new edges, and dyads changing their community membership. We apply these measures to the nine transitions between our 10 periods. On all measures, the change ratio between periods 7 and 8 always attains the most extreme value, the node change ratio in this period being an outlier according to a Grubb’s test (P = 0.05).

Fig. S3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. S3.

Intertemporal proximity distribution. The strengths r of intertemporal links connecting successive categories shows a bimodal distribution, which allows us to define a threshold that separates spurious and significant connections at the minimum. Note that, for the sake of clarity, we plot the distribution of strengths starting from 0.05.

Fig. S4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. S4.

Semantic networks for four identified periods. For each period, featured terms are restricted to those that occur 20 times, i.e., at least as frequently as the rarest of the 1,000 terms for the whole period. Clockwise from Upper Left: 1790–1816 (featuring 137 terms); 1817–1916 (836 terms); 1917–1949 (400 terms); 1950–2014 (529 terms).

Acknowledgments

We thank Christopher Muller, Adam Reich, Shamus Khan, Suresh Naidu, Christopher Blattman, Timothy Shenk, David Hollinger, David Bearman, and the XS workshop at Columbia’s Department of Sociology for helpful comments. Maps were produced using the CorText platform. Support from the Interdisciplinary Center for Theory and Empirics at Columbia University is gratefully acknowledged.

Footnotes

  • ↵1To whom correspondence should be addressed. Email: psb17{at}columbia.edu.
  • This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected in 2014.

  • Author contributions: A.R. and P.S.B. designed research; A.R., J.-P.C., and P.S.B. performed research; J.-P.C. contributed new reagents/analytic tools; A.R., J.-P.C., and P.S.B. analyzed data; and A.R. and P.S.B. wrote the paper.

  • Reviewers: R.L.B., University of Arizona; and J.M., University of California, Santa Barbara.

  • The authors declare no conflict of interest.

  • This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1512221112/-/DCSupplemental.

Freely available online through the PNAS open access option.

References

  1. ↵
    1. Tulis J
    (1987) The Rhetorical Presidency (Princeton Univ Press, Princeton)
    .
  2. ↵
    1. Cohen JE
    (1995) Presidential rhetoric and the public agenda. Am J Pol Sci 39(1):87–107
    .
    OpenUrlCrossRef
  3. ↵
    1. Hill KQ
    (1998) The policy agendas of the president and the mass public: A research validation and extension. Am J Pol Sci 42(4):1328–1334
    .
    OpenUrlCrossRef
  4. ↵
    1. Edwards GC,
    2. Wood BD
    (1999) Who influences whom? The President, Congress, and the media. Am Polit Sci Rev 93(02):327–344
    .
    OpenUrlCrossRef
  5. ↵
    1. Zarefsky D
    (2008) Strategic maneuvering in political argumentation. Argumentation 22(3):317–330
    .
    OpenUrlCrossRef
  6. ↵
    1. Bimes T,
    2. Mulroy Q
    (2004) The rise and decline of presidential populism. Stud Am Polit Dev 18(2):136–159
    .
    OpenUrl
  7. ↵
    1. Lazar A,
    2. Lazar MM
    (2004) The discourse of the new world order: Out-casting the double face of threat. Discourse Soc 15(2-3):223–242
    .
    OpenUrlAbstract
  8. ↵
    1. Teten RL
    (2003) Evolution of the modern rhetorical presidency: Presidential presentation and development of the State of the Union address. Pres Stud Q 33(2):333–346
    .
    OpenUrlCrossRef
  9. ↵
    1. Laracey M
    (2009) The rhetorical presidency today: How does it stand up? Pres Stud Q 39(4):908–931
    .
    OpenUrlCrossRef
  10. ↵
    1. Mohr JW
    (2013) Graphing the grammar of motives in U.S. national security strategies: Cultural interpretation, automated text analysis and the drama of global politics. Poetics 41(6):670–700
    .
    OpenUrlCrossRef
  11. ↵
    1. Callon M,
    2. Courtial JP,
    3. Laville F
    (1991) Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemsitry. Scientometrics 22(1):155–205
    .
    OpenUrlCrossRef
  12. ↵
    1. Cancho RF,
    2. Solé RV
    (2001) The small world of human language. Proc R Soc Lond B Biol Sci 268(1482):2261–2265
    .
    OpenUrlAbstract/FREE Full Text
  13. ↵
    1. Grimmer J,
    2. Stewart BM
    (2013) Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Polit Anal 21(3):267–297
    .
    OpenUrlAbstract/FREE Full Text
  14. ↵
    1. Klingenstein S,
    2. Hitchcock T,
    3. DeDeo S
    (2014) The civilizing process in London’s Old Bailey. Proc Natl Acad Sci USA 111(26):9419–9424
    .
    OpenUrlAbstract/FREE Full Text
  15. ↵
    1. Blei DM,
    2. Ng AY,
    3. Jordan MI
    (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
    .
    OpenUrlCrossRef
  16. ↵
    1. Chuang J,
    2. Ramage D,
    3. Manning C,
    4. Heer J
    (2012) Interpretation and trust: Designing model-driven visualizations for text analysis. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Association for Computing Machinery, New York), pp 443–452
    .
  17. ↵
    1. Newman D,
    2. Noh Y,
    3. Talley E,
    4. Karimi S,
    5. Baldwin T
    (2010) Evaluating topic models for digital libraries. Proceedings of the 10th Annual Conference on Digital Libraries (Association for Computing Machinery, New York), pp 215–224
    .
  18. ↵
    1. Mimno D,
    2. Blei D
    (2011) Bayesian checking for topic models. Proceedings of the Conference on Empirical Methods in Natural Language Processing (Association for Computational Linguistics, Stroudsburg, PA), pp 227–237
    .
  19. ↵
    1. Tang J,
    2. Meng Z,
    3. Nguyen X,
    4. Mei Q,
    5. Zhang M
    (2014) Understanding the limiting factors of topic modeling via posterior contraction analysis. Proceedings of the 31st International Conference on Machine Learning (Journal of Machine Learning Research, Microtome Publishing, Brookline, MA), Vol 32
    .
  20. ↵
    1. Schmidt BM
    (2012) Words alone: Dismantling topic models in the humanities. Journal of Digital Humanities 2(1):49–65
    .
    OpenUrl
  21. ↵
    1. Hofmann T
    (1999) Probabilistic latent semantic indexing. Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, eds Xing EP, Jebara T (Association for Computing Machinery, New York), pp 50–57
    .
  22. ↵
    1. Inouye D,
    2. Ravikumar P,
    3. Dhillon I
    (2014) Admixture of Poisson MRFs: A topic model with word dependencies. Proceedings of the 31st International Conference on Machine Learning (Journal of Machine Learning Research, Microtome Publishing, Brookline, MA), Vol 32
    .
  23. ↵
    1. Rosen-Zvi M,
    2. Griffiths T,
    3. Steyvers M,
    4. Smyth P
    (2004) The author-topic model for authors and documents. Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence (AUAI Press, Arlington, VI), pp 487–494
    .
  24. ↵
    1. Wang C,
    2. Blei D,
    3. Heckerman D
    (2012) Continuous time dynamic topic models. arXiv:1206.3298
    .
  25. ↵
    1. Wang X,
    2. McCallum A
    (2006) Topics over time: A non-Markov continuous-time model of topical trends. Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ACM, New York), pp 424–433
    .
  26. ↵
    1. Blei DM,
    2. Lafferty JD
    (2006) Dynamic topic models. Proceedings of the 23rd International Conference on Machine Learning (ACM, New York), pp 113–120
    .
  27. ↵
    1. Gao Z, et al.
    (2011) Tracking and connecting topics via incremental hierarchical dirichlet processes. Data Mining (ICDM), 2011 IEEE 11th International Conference, eds Cook D, Pei J, Wang W, ZaÏane O, Wu X (IEEE and CPS Conference Publishing Services, Los Alamitos, CA), pp 1056–1061
    .
  28. ↵
    1. Palla G,
    2. Barabási AL,
    3. Vicsek T
    (2007) Quantifying social group evolution. Nature 446(7136):664–667
    .
    OpenUrlCrossRefPubMed
  29. ↵
    1. Hopcroft J,
    2. Khan O,
    3. Kulis B,
    4. Selman B
    (2004) Tracking evolving communities in large linked networks. Proc Natl Acad Sci USA 101(Suppl 1):5249–5253
    .
    OpenUrlAbstract/FREE Full Text
  30. ↵
    1. Rosvall M,
    2. Bergstrom CT
    (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci USA 105(4):1118–1123
    .
    OpenUrlAbstract/FREE Full Text
  31. ↵
    1. Mucha PJ,
    2. Richardson T,
    3. Macon K,
    4. Porter MA,
    5. Onnela JP
    (2010) Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980):876–878
    .
    OpenUrlAbstract/FREE Full Text
  32. ↵
    1. Chavalarias D,
    2. Cointet JP
    (2013) Phylomemetic patterns in science evolution—the rise and fall of scientific fields. PLoS One 8(2):e54847
    .
    OpenUrlCrossRefPubMed
  33. ↵
    1. Weeds J,
    2. Weir D
    (2005) Co-occurrence retrieval: A flexible framework for lexical distributional similarity. Comput Linguist 31(4):439–475
    .
    OpenUrlCrossRef
  34. ↵
    1. Blondel VD,
    2. Guillaume JL,
    3. Lambiotte R,
    4. Lefebvre E
    (2008) Fast unfolding of communities in large networks. J Stat Mech 2008(10):10008
    .
    OpenUrlCrossRef
  35. ↵
    1. Brodsky A
    (2000) Grover Cleveland: A Study in Character (MacMillan, New York)
    .
  36. ↵
    1. Hollinger DA
    (1989) In the American Province: Studies in the History and Historiography of Ideas (JHU Press, Baltimore)
    .
  37. ↵
    1. Rodgers DT
    (1998) Atlantic crossings (Harvard Univ Press, Cambridge, MA)
    .
    1. Weinstein J
    (1968) The Corporate Ideal in the Liberal State, 1900–1918 (Beacon Press, Boston), pp 105–110
    .
  38. ↵
    1. Schmid H
    (1995) Treetagger: A language independent part-of-speech tagger. Institut fr Maschinelle Sprachverarbeitung. Universitas (Stuttg) 43:28
    .
    OpenUrl
  39. ↵
    1. Salton G,
    2. Buckley C
    (1988) Term-weighting approaches in automatic text retrieval. Inf Process Manage 24(5):513–523
    .
    OpenUrlCrossRef
  40. ↵
    1. Bordag S
    (2008) A comparison of co-occurrence and similarity measures as simulations of context. Computational Linguistics and Intelligent Text Processing, ed Gelbukh A (Springer, Berlin, Heidelberg), Vol 4919, pp 52–63
    .
PreviousNext
Back to top
Article Alerts
Email Article

Thank you for your interest in spreading the word on PNAS.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Lexical shifts, substantive changes, and continuity in State of the Union discourse, 1790–2014
(Your Name) has sent you a message from PNAS
(Your Name) thought you would like to see the PNAS web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
State of the Union
Alix Rule, Jean-Philippe Cointet, Peter S. Bearman
Proceedings of the National Academy of Sciences Sep 2015, 112 (35) 10837-10844; DOI: 10.1073/pnas.1512221112

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
State of the Union
Alix Rule, Jean-Philippe Cointet, Peter S. Bearman
Proceedings of the National Academy of Sciences Sep 2015, 112 (35) 10837-10844; DOI: 10.1073/pnas.1512221112
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Mendeley logo Mendeley

Article Classifications

  • Social Sciences
  • Social Sciences
Proceedings of the National Academy of Sciences: 112 (35)
Table of Contents

Submit

Sign up for Article Alerts

Jump to section

  • Article
    • Abstract
    • Historical Background
    • Methodological Background
    • Methods
    • Results
    • Discussion
    • Linguistic Processing
    • Periodization
    • Semantic Network
    • Acknowledgments
    • Footnotes
    • References
  • Figures & SI
  • Info & Metrics
  • PDF

You May Also be Interested in

Water from a faucet fills a glass.
News Feature: How “forever chemicals” might impair the immune system
Researchers are exploring whether these ubiquitous fluorinated molecules might worsen infections or hamper vaccine effectiveness.
Image credit: Shutterstock/Dmitry Naumov.
Reflection of clouds in the still waters of Mono Lake in California.
Inner Workings: Making headway with the mysteries of life’s origins
Recent experiments and simulations are starting to answer some fundamental questions about how life came to be.
Image credit: Shutterstock/Radoslaw Lecyk.
Cave in coastal Kenya with tree growing in the middle.
Journal Club: Small, sharp blades mark shift from Middle to Later Stone Age in coastal Kenya
Archaeologists have long tried to define the transition between the two time periods.
Image credit: Ceri Shipton.
Mouse fibroblast cells. Electron bifurcation reactions keep mammalian cells alive.
Exploring electron bifurcation
Jonathon Yuly, David Beratan, and Peng Zhang investigate how electron bifurcation reactions work.
Listen
Past PodcastsSubscribe
Panda bear hanging in a tree
How horse manure helps giant pandas tolerate cold
A study finds that giant pandas roll in horse manure to increase their cold tolerance.
Image credit: Fuwen Wei.

Similar Articles

Site Logo
Powered by HighWire
  • Submit Manuscript
  • Twitter
  • Facebook
  • RSS Feeds
  • Email Alerts

Articles

  • Current Issue
  • Special Feature Articles – Most Recent
  • List of Issues

PNAS Portals

  • Anthropology
  • Chemistry
  • Classics
  • Front Matter
  • Physics
  • Sustainability Science
  • Teaching Resources

Information

  • Authors
  • Editorial Board
  • Reviewers
  • Subscribers
  • Librarians
  • Press
  • Cozzarelli Prize
  • Site Map
  • PNAS Updates
  • FAQs
  • Accessibility Statement
  • Rights & Permissions
  • About
  • Contact

Feedback    Privacy/Legal

Copyright © 2021 National Academy of Sciences. Online ISSN 1091-6490