Data Mining with Optimized Two-Dimensional Association Rules

Takeshi Fukuda, Yasuhiko Morimoto, Shinichi Morishita, Takeshi Tokuyama

Research output: Contribution to journalArticle

87 Citations (Scopus)

Abstract

We discuss data mining based on association rules for two numeric attributes and one Boolean attribute. For example, in a database of bank customers, "Age" and "Balance" are two numeric attributes, and "CardLoan" is a Boolean attribute. Taking the pair (Age, Balance) as a point in two-dimensional space, we consider an association rule of the form ((Age, Balance) ∈ P) ⇒ (CardLoan = Yes), which implies that bank customers whose ages and balances fall within a planar region P tend to take out credit card loans with a high probability. We consider two classes of regions, rectangles and admissible (i.e., connected and x-monotone) regions. For each class, we propose efficient algorithms for computing the regions that give optimal association rules for gain, support, and confidence, respectively. We have implemented the algorithms for admissible regions as well as several advanced functions based on them in our data mining system named SONAR (System for Optimized Numeric Association Rules), where the rules are visualized by using a graphic user interface to make it easy for users to gain an intuitive understanding of rules.

Original languageEnglish
Pages (from-to)179-213
Number of pages35
JournalACM Transactions on Database Systems
Volume26
Issue number2
DOIs
Publication statusPublished - 2001 Jun

Keywords

  • F.2.2 [Analysis of Algorithms and Problem Complexity]: Nonnumerical Algorithms and Problems-geometrical problems and computations
  • G.2.1 [Discrete Mathematics]: Combinatorics-combinatorial algorithms

ASJC Scopus subject areas

  • Information Systems

Fingerprint Dive into the research topics of 'Data Mining with Optimized Two-Dimensional Association Rules'. Together they form a unique fingerprint.

  • Cite this