Unbiased Recursive Partitioning: A Conditional Inference Framework

Hothorn, Torsten and Hornik, Kurt ORCID: https://orcid.org/0000-0003-4198-9911 and Zeileis, Achim (2004) Unbiased Recursive Partitioning: A Conditional Inference Framework. Research Report Series / Department of Statistics and Mathematics, 8. Institut für Statistik und Mathematik, WU Vienna University of Economics and Business, Vienna.


Download (553kB)


Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been known for a long time: Overfitting and a selection bias towards covariates with many possible splits or missing values. While pruning procedures are able to solve the overfitting problem, the variable selection bias still seriously effects the interpretability of tree-structured regression models. For some special cases unbiased procedures have been suggested, however lacking a common theoretical foundation. We propose a unified framework for recursive partitioning which embeds tree-structured regression models into a well defined theory of conditional inference procedures. Stopping criteria based on multiple test procedures are implemented and it is shown that the predictive performance of the resulting trees is as good as the performance of established exhaustive search procedures. It turns out that the partitions and therefore the models induced by both approaches are structurally different, indicating the need for an unbiased variable selection. The methodology presented here is applicable to all kinds of regression problems, including nominal, ordinal, numeric, censored as well as multivariate response variables and arbitrary measurement scales of the covariates. Data from studies on animal abundance, glaucoma classification, node positive breast cancer and mammography experience are re-analyzed.

Item Type: Paper
Keywords: permutation tests / variable selection / multiple testing / ordinal regression trees / multivariate regression trees
Divisions: Departments > Finance, Accounting and Statistics > Statistics and Mathematics
Depositing User: Repository Administrator
Date Deposited: 22 Jul 2004 15:46
Last Modified: 24 Oct 2019 13:41
URI: https://epub.wu.ac.at/id/eprint/676


View Item View Item


Downloads per month over past year

View more statistics