Parallelizing pincer search using CUDA-A conceptual idea

Anchit Bhatia, Divyanshu Sharma, S. Chethan

Research output: Contribution to journalArticle

Abstract

Recent times have seen meteoric increase in the data that are available using which we can develop automated data-driven techniques of extracting useful knowledge. Data mining is the important step in this process of knowledge discovery. One of the key problem in most of the data mining applications is discovering the frequent item sets. Scanning of the huge data available to discover frequent item sets are computationally expensive. A conventional multi-core processor might not very effective multi-threading capabilities to be able to process large amounts of data leading to sequential implementation of a considerably large number of processes. Such sequential implementation leads to high computation times due to pipeline latency and other issues. Due to this limitation there is an increasing interest in the researchers to develop parallel data mining algorithms for faster implementation and efficient use of available GPU architectures. Pincer search is one the data mining algorithms which is used to discover the maximum frequent sets. Pincer search algorithm reduces both the number of times the database is scanned and also the number of candidate considered. In this study, we discuss a way to parallelize the pincer search algorithm to further speed up the process of discovering maximum frequent sets.

Original languageEnglish
Pages (from-to)3098-3101
Number of pages4
JournalJournal of Engineering and Applied Sciences
Volume12
Issue number12
DOIs
Publication statusPublished - 01-01-2017
Externally publishedYes

Fingerprint

Data mining
Pipelines
Scanning

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Bhatia, Anchit ; Sharma, Divyanshu ; Chethan, S. / Parallelizing pincer search using CUDA-A conceptual idea. In: Journal of Engineering and Applied Sciences. 2017 ; Vol. 12, No. 12. pp. 3098-3101.
@article{702137f3a36d4ae2bbb060b76b109fa4,
title = "Parallelizing pincer search using CUDA-A conceptual idea",
abstract = "Recent times have seen meteoric increase in the data that are available using which we can develop automated data-driven techniques of extracting useful knowledge. Data mining is the important step in this process of knowledge discovery. One of the key problem in most of the data mining applications is discovering the frequent item sets. Scanning of the huge data available to discover frequent item sets are computationally expensive. A conventional multi-core processor might not very effective multi-threading capabilities to be able to process large amounts of data leading to sequential implementation of a considerably large number of processes. Such sequential implementation leads to high computation times due to pipeline latency and other issues. Due to this limitation there is an increasing interest in the researchers to develop parallel data mining algorithms for faster implementation and efficient use of available GPU architectures. Pincer search is one the data mining algorithms which is used to discover the maximum frequent sets. Pincer search algorithm reduces both the number of times the database is scanned and also the number of candidate considered. In this study, we discuss a way to parallelize the pincer search algorithm to further speed up the process of discovering maximum frequent sets.",
author = "Anchit Bhatia and Divyanshu Sharma and S. Chethan",
year = "2017",
month = "1",
day = "1",
doi = "10.3923/jeasci.2017.3098.310",
language = "English",
volume = "12",
pages = "3098--3101",
journal = "Journal of Engineering and Applied Sciences",
issn = "1816-949X",
publisher = "Medwell Journals",
number = "12",

}

Parallelizing pincer search using CUDA-A conceptual idea. / Bhatia, Anchit; Sharma, Divyanshu; Chethan, S.

In: Journal of Engineering and Applied Sciences, Vol. 12, No. 12, 01.01.2017, p. 3098-3101.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Parallelizing pincer search using CUDA-A conceptual idea

AU - Bhatia, Anchit

AU - Sharma, Divyanshu

AU - Chethan, S.

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Recent times have seen meteoric increase in the data that are available using which we can develop automated data-driven techniques of extracting useful knowledge. Data mining is the important step in this process of knowledge discovery. One of the key problem in most of the data mining applications is discovering the frequent item sets. Scanning of the huge data available to discover frequent item sets are computationally expensive. A conventional multi-core processor might not very effective multi-threading capabilities to be able to process large amounts of data leading to sequential implementation of a considerably large number of processes. Such sequential implementation leads to high computation times due to pipeline latency and other issues. Due to this limitation there is an increasing interest in the researchers to develop parallel data mining algorithms for faster implementation and efficient use of available GPU architectures. Pincer search is one the data mining algorithms which is used to discover the maximum frequent sets. Pincer search algorithm reduces both the number of times the database is scanned and also the number of candidate considered. In this study, we discuss a way to parallelize the pincer search algorithm to further speed up the process of discovering maximum frequent sets.

AB - Recent times have seen meteoric increase in the data that are available using which we can develop automated data-driven techniques of extracting useful knowledge. Data mining is the important step in this process of knowledge discovery. One of the key problem in most of the data mining applications is discovering the frequent item sets. Scanning of the huge data available to discover frequent item sets are computationally expensive. A conventional multi-core processor might not very effective multi-threading capabilities to be able to process large amounts of data leading to sequential implementation of a considerably large number of processes. Such sequential implementation leads to high computation times due to pipeline latency and other issues. Due to this limitation there is an increasing interest in the researchers to develop parallel data mining algorithms for faster implementation and efficient use of available GPU architectures. Pincer search is one the data mining algorithms which is used to discover the maximum frequent sets. Pincer search algorithm reduces both the number of times the database is scanned and also the number of candidate considered. In this study, we discuss a way to parallelize the pincer search algorithm to further speed up the process of discovering maximum frequent sets.

UR - http://www.scopus.com/inward/record.url?scp=85029213830&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85029213830&partnerID=8YFLogxK

U2 - 10.3923/jeasci.2017.3098.310

DO - 10.3923/jeasci.2017.3098.310

M3 - Article

VL - 12

SP - 3098

EP - 3101

JO - Journal of Engineering and Applied Sciences

JF - Journal of Engineering and Applied Sciences

SN - 1816-949X

IS - 12

ER -