A Dynamic Itemset Counting Based Two-Phase Algorithm for Mining High Utility Itemsets

B. Anup Bhat, S. V. Harish, M. Geetha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

High Utility Itemset Mining (HUIM) aids in the discovery of itemsets based on quantity and unit price of the items from a transactional database. Since its inception, HUIM has evolved as a generalized form of Frequent Itemset Mining (FIM). Unlike the support of an itemset which is antimonotone and is exploited in the algorithms for mining frequent itemsets, the utility measure is neither antimonotone nor monotone. This makes the problem of mining High Utility Itemsets (HUIs) interesting. In the current study, a novel method based on Dynamic Itemset Counting (DIC) has been proposed to optimize the Apriori-like Two-Phase (TP) algorithm for mining HUIs. Although, the TP algorithm uses antimonotonicity of Transaction Weighted Utility (TWU) of itemsets to prune the search space, the candidates are generated in a level-wise manner. This requires multiple database scans to test the candidates. The proposed method tests and generates higher order candidates at different stops during the database scan and segregates the itemsets for further evaluation. Experiments performed on real-time datasets show significant improvement in execution time of the DIC method when compared to the TP algorithm.

Original languageEnglish
Title of host publicationINDICON 2018 - 15th IEEE India Council International Conference
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781538682357
DOIs
Publication statusPublished - 12-2018
Event15th IEEE India Council International Conference, INDICON 2018 - Coimbatore, India
Duration: 16-12-201818-12-2018

Publication series

NameINDICON 2018 - 15th IEEE India Council International Conference

Conference

Conference15th IEEE India Council International Conference, INDICON 2018
Country/TerritoryIndia
CityCoimbatore
Period16-12-1818-12-18

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture
  • Energy Engineering and Power Technology
  • Electrical and Electronic Engineering
  • Electronic, Optical and Magnetic Materials

Fingerprint

Dive into the research topics of 'A Dynamic Itemset Counting Based Two-Phase Algorithm for Mining High Utility Itemsets'. Together they form a unique fingerprint.

Cite this