A service provided by the WU Library and the WU IT-Services

Implications of probabilistic data modeling for rule mining

Hahsler, Michael and Hornik, Kurt and Reutterer, Thomas (2005) Implications of probabilistic data modeling for rule mining. Research Report Series / Department of Statistics and Mathematics, 14. Institut für Statistik und Mathematik, WU Vienna University of Economics and Business, Vienna.

Download (1095Kb) | Preview


Mining association rules is an important technique for discovering meaningful patterns in transaction databases. In the current literature, the properties of algorithms to mine associations are discussed in great detail. In this paper we investigate properties of transaction data sets from a probabilistic point of view. We present a simple probabilistic framework for transaction data and its implementation using the R statistical computing environment. The framework can be used to simulate transaction data when no associations are present. We use such data to explore the ability to filter noise of confidence and lift, two popular interest measures used for rule mining. Based on the framework we develop the measure hyperlift and we compare this new measure to lift using simulated data and a real-world grocery database.

Item Type: Paper
Keywords: data mining / transaction data model / association rules / interest measures
Divisions: Departments > Finance, Accounting and Statistics > Statistics and Mathematics
Depositing User: Repository Administrator
Date Deposited: 02 Mar 2005 11:44
Last Modified: 01 Mar 2017 19:02
URI: http://epub.wu.ac.at/id/eprint/764


View Item