Latent Topic Synthesis: Leveraging LLMs for Electoral Ad Analysis

Alexander Brady, Tunazzina Islam. Preprint 2025.

Abstract

Social media platforms play a pivotal role in shaping political discourse, but analyzing their vast and rapidly evolving content remains a major challenge. We introduce an end-to-end framework for automatically generating an interpretable topic taxonomy from an unlabeled corpus. By combining unsupervised clustering with prompt-based labeling, our method leverages large language models to iteratively construct a taxonomy without requiring seed sets or domain expertise. We apply this framework to a large corpus of Meta (previously known as Facebook) political ads from the month ahead of the 2024 U.S. Presidential election. Our approach uncovers latent discourse structures, synthesizes semantically rich topic labels, and annotates topics with moral framing dimensions. We show quantitative and qualitative analysis to demonstrate the effectiveness of our framework. This work supports scalable, interpretable analysis of political messaging on social media, with implications for NLP and computational social science.