[OC] Don’t count the lies; count the… theys

    by wannagowest

    2 Comments

    1. While Trump did use the word “I” 43% more than Harris, he used the word “they” 23 times as much.

      Tools: Python, Seaborn, Matplotlib, Spacy

      Data: [ABC’s debate transcript from September 10, 2024](https://abcnews.go.com/Politics/harris-trump-presidential-debate-transcript/story?id=113560542), [US Presidential Debate Transcripts 1960-2020 [Kaggle]](https://www.kaggle.com/datasets/arenagrenade/us-presidential-debate-transcripts-19602020)

      Methods: I calculated the enrichment for each word in each speaker’s vocabulary as the log of the ratio between the frequency of that word in Tuesday’s debate for each candidate and the frequency of that word across debates from 1960-2012 (excluding elections with Trump as a candidate).

      Word clouds for each candidate were generated from this enrichment score. I plotted the number of times each candidate used a selection of those enriched words.

    Leave A Reply