Influencing Elections with Statistics: Targeting Voters with Logistic Regression Trees

Rusch, Thomas and Lee, Ilro and Hornik, Kurt ORCID: and Jank, Wolfgang and Zeileis, Achim (2012) Influencing Elections with Statistics: Targeting Voters with Logistic Regression Trees. Research Report Series / Department of Statistics and Mathematics, 117. WU Vienna University of Economics and Business, Vienna.


Download (716kB)


Political campaigning has become a multi-million dollar business. A substantial proportion of a campaign's budget is spent on voter mobilization, i.e., on identifying and influencing as many people as possible to vote. Based on data, campaigns use statistical tools to provide a basis for deciding who to target. While the data available is usually rich, campaigns have traditionally relied on a rather limited selection of information, often including only previous voting behavior and one or two demographical variables. Statistical procedures that are currently in use include logistic regression or standard classification tree methods like CHAID, but there is a growing interest in employing modern data mining approaches. Along the lines of this development, we propose a modern framework for voter targeting called LORET (for logistic regression trees) that employs trees (with possibly just a single root node) containing logistic regressions (with possibly just an intercept) in every leaf. Thus, they contain logistic regression and classification trees as special cases and allow for a synthesis of both techniques under one umbrella. We explore various flavors of LORET models that (a) compare the effect of using the full set of available variables against using only limited information and (b) investigate their varying effects either as regressors in the logistic model components or as partitioning variables in the tree components. To assess model performance and illustrate targeting, we apply LORET to a data set of 19,634 eligible voters from the 2004 US presidential election. We find that augmenting the standard set of variables (such as age and voting history) together with additional predictor variables (such as the household composition in terms of party affiliation and each individual's rank in the household) clearly improves predictive accuracy. We also find that LORET models based on tree induction outbeat the unpartitioned competitors. Additionally, LORET models using both partitioning variables and regressors in the resulting nodes can improve the efficiency of allocating campaign resources while still providing intelligible models.

Item Type: Paper
Keywords: voter targeting / model tree / model-based recursive partitioning / get-out-the-vote / logistic regression / classification tree / campaigning / political marketing / voter identification / voter segmentation
Divisions: Departments > Finance, Accounting and Statistics > Statistics and Mathematics
Depositing User: Josef Leydold
Date Deposited: 12 Mar 2012 17:33
Last Modified: 24 Oct 2019 13:41


View Item View Item


Downloads per month over past year

View more statistics