keyboard_arrow_up
The Bases of Association Rules of High Confidence

Authors

Oren Segal1, Justin Cabot-Miller2, Kira Adaricheva2, J.B.Nation3 and Anuar Sharafudinov4, 1,2Hofstra University, USA, 3University of Hawaii, USA and 4AILabs, Kazakhstan

Abstract

We develop a new approach for distributed computing of the association rules of high confidence in a binary table. It is derived from the D-basis algorithm [1], which is performed on multiple sub-tables of a table given by removing several rows at a time. The set of rules is then aggregated using the same approach as the D-basis is retrieved from a larger set of implications. This allows to obtain a basis of association rules of high confidence, which can be used for ranking all attributes of the table with respect to a given fixed attribute using the relevance parameter introduced in [2]. This paper focuses on the technical implementation of the new algorithm. Some testing results are performed on transaction data and medical data.

Keywords

Association rules, implication, binary table, D-basis, parallel computing

Full Text  Volume 8, Number 11