UR Research > Computer Science Department > CS Systems Technical Reports >

New Algorithms for Fast Discovery of Association Rules

URL to cite or link to: http://hdl.handle.net/1802/501

97.tr651.New_algorithms_for_fast_discovery_of_association_rul.ps   395.27 KB (No. of downloads : 256)
Association rule discovery has emerged as an important problem in knowledge discovery and data mining. The association mining task consists of identifying the frequent itemsets, and then forming conditional implication rules among them. In this paper we present efficient algorithms for the discovery of frequent itemsets, which forms the compute intensive phase of the task. The algorithms utilize the structural properties of frequent itemsets to facilitate fast discovery. The related database items are grouped together into clusters representing the potential maximal frequent itemsets in the database. Each cluster induces a sub-lattice of the itemset lattice. Efficient lattice traversal techniques are presented, which quickly identify all the true maximal frequent itemsets, and all their subsets if desired. We also present the effect of using different database layout schemes combined with the proposed clustering and traversal techniques. The proposed algorithms scan a (pre-processed) database only once, addressing the open question in association mining, whether all the rules can be efficiently extracted in a single database pass. We experimentally compare the new algorithms against the previous approaches, obtaining improvements of more than an order of magnitude for our test databases.
Contributor(s):
Mohammed Javeed Zaki - Author

Srinivasan Parthasarathy - Author

Mitsunori Ogihara (1963 - ) - Author

Wei Li - Author

Primary Item Type:
Technical Report
Series/Report Number:
UR CSD / TR651
Language:
English
Subject Keywords:
frequent itemsets;association rules;knowledge discovery and data mining;parallel algorithms;maximal hypergraph cliques;lattice traversals
First presented to the public:
7/22/2004
Original Publication Date:
7/1997
Previously Published By:
University of Rochester. Computer Science Department.
License Grantor / Date Granted:
Suzanne S. Bell / 2004-07-22 21:39:32.0 ( View License )
Date Deposited
2004-07-22 21:39:32.0
Date Last Updated
2012-09-26 16:35:14.586719
Submitter:
Suzanne S. Bell

Copyright © This item is protected by copyright, with all rights reserved.

All Versions

Thumbnail Name Version Created Date
New Algorithms for Fast Discovery of Association Rules1 2004-07-22 21:39:32.0