A service provided by the WU Library and the WU IT-Services

Unbiased Recursive Partitioning: A Conditional Inference Framework

Hothorn, Torsten and Hornik, Kurt and Zeileis, Achim (2004) Unbiased Recursive Partitioning: A Conditional Inference Framework. Research Report Series / Department of Statistics and Mathematics, 8. Institut für Statistik und Mathematik, WU Vienna University of Economics and Business, Vienna.

[img]
Preview
PDF
Download (540Kb) | Preview

Abstract

Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been known for a long time: Overfitting and a selection bias towards covariates with many possible splits or missing values. While pruning procedures are able to solve the overfitting problem, the variable selection bias still seriously effects the interpretability of tree-structured regression models. For some special cases unbiased procedures have been suggested, however lacking a common theoretical foundation. We propose a unified framework for recursive partitioning which embeds tree-structured regression models into a well defined theory of conditional inference procedures. Stopping criteria based on multiple test procedures are implemented and it is shown that the predictive performance of the resulting trees is as good as the performance of established exhaustive search procedures. It turns out that the partitions and therefore the models induced by both approaches are structurally different, indicating the need for an unbiased variable selection. The methodology presented here is applicable to all kinds of regression problems, including nominal, ordinal, numeric, censored as well as multivariate response variables and arbitrary measurement scales of the covariates. Data from studies on animal abundance, glaucoma classification, node positive breast cancer and mammography experience are re-analyzed.

Item Type: Paper
Keywords: permutation tests / variable selection / multiple testing / ordinal regression trees / multivariate regression trees
Divisions: Departments > Finance, Accounting and Statistics > Statistics and Mathematics
Depositing User: Repository Administrator
Date Deposited: 22 Jul 2004 17:46
Last Modified: 27 Feb 2017 11:14
URI: http://epub.wu.ac.at/id/eprint/676

Actions

View Item