Hybrid data mining algorithm in cloud computing using MapReduce framework

Siddharth Sahay, Suruchi Khetarpal, Tribikram Pradhan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

'Data mining' has transformed into a ubiquitous term in the world of IT and Computer Science in recent times. Developments in this field have been countless. Using one of Apriori algorithm's numerous variants with a couple of insightful additions can significantly improve upon the existing standard of Data Mining. In this paper a new approach to considerably reduce the time complexity of the database scan has been proposed. This has been achieved by using the MapReduce framework for Hadoop Distributed File System (HDFS). Coupled with Cloud computing, which handles large data sets and processing remotely, the resultant system - that uses MapReduce for the full table scan, the Pincer-Search Algorithm, and Cloud Computing - is a force to reckon with.

Original languageEnglish
Title of host publicationProceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages507-511
Number of pages5
ISBN (Electronic)9781467395458
DOIs
Publication statusPublished - 24-01-2017
Event2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016 - Ramanathapuram, Tamil Nadu, India
Duration: 25-05-201627-05-2016

Conference

Conference2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016
CountryIndia
CityRamanathapuram, Tamil Nadu
Period25-05-1627-05-16

Fingerprint

Cloud computing
Data mining
Computer science
Processing

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture
  • Signal Processing
  • Computer Networks and Communications
  • Computer Science Applications

Cite this

Sahay, S., Khetarpal, S., & Pradhan, T. (2017). Hybrid data mining algorithm in cloud computing using MapReduce framework. In Proceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016 (pp. 507-511). [7831691] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICACCCT.2016.7831691
Sahay, Siddharth ; Khetarpal, Suruchi ; Pradhan, Tribikram. / Hybrid data mining algorithm in cloud computing using MapReduce framework. Proceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 507-511
@inproceedings{abd712cf3f424bf1b0308c1cb80a7369,
title = "Hybrid data mining algorithm in cloud computing using MapReduce framework",
abstract = "'Data mining' has transformed into a ubiquitous term in the world of IT and Computer Science in recent times. Developments in this field have been countless. Using one of Apriori algorithm's numerous variants with a couple of insightful additions can significantly improve upon the existing standard of Data Mining. In this paper a new approach to considerably reduce the time complexity of the database scan has been proposed. This has been achieved by using the MapReduce framework for Hadoop Distributed File System (HDFS). Coupled with Cloud computing, which handles large data sets and processing remotely, the resultant system - that uses MapReduce for the full table scan, the Pincer-Search Algorithm, and Cloud Computing - is a force to reckon with.",
author = "Siddharth Sahay and Suruchi Khetarpal and Tribikram Pradhan",
year = "2017",
month = "1",
day = "24",
doi = "10.1109/ICACCCT.2016.7831691",
language = "English",
pages = "507--511",
booktitle = "Proceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

Sahay, S, Khetarpal, S & Pradhan, T 2017, Hybrid data mining algorithm in cloud computing using MapReduce framework. in Proceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016., 7831691, Institute of Electrical and Electronics Engineers Inc., pp. 507-511, 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016, Ramanathapuram, Tamil Nadu, India, 25-05-16. https://doi.org/10.1109/ICACCCT.2016.7831691

Hybrid data mining algorithm in cloud computing using MapReduce framework. / Sahay, Siddharth; Khetarpal, Suruchi; Pradhan, Tribikram.

Proceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016. Institute of Electrical and Electronics Engineers Inc., 2017. p. 507-511 7831691.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Hybrid data mining algorithm in cloud computing using MapReduce framework

AU - Sahay, Siddharth

AU - Khetarpal, Suruchi

AU - Pradhan, Tribikram

PY - 2017/1/24

Y1 - 2017/1/24

N2 - 'Data mining' has transformed into a ubiquitous term in the world of IT and Computer Science in recent times. Developments in this field have been countless. Using one of Apriori algorithm's numerous variants with a couple of insightful additions can significantly improve upon the existing standard of Data Mining. In this paper a new approach to considerably reduce the time complexity of the database scan has been proposed. This has been achieved by using the MapReduce framework for Hadoop Distributed File System (HDFS). Coupled with Cloud computing, which handles large data sets and processing remotely, the resultant system - that uses MapReduce for the full table scan, the Pincer-Search Algorithm, and Cloud Computing - is a force to reckon with.

AB - 'Data mining' has transformed into a ubiquitous term in the world of IT and Computer Science in recent times. Developments in this field have been countless. Using one of Apriori algorithm's numerous variants with a couple of insightful additions can significantly improve upon the existing standard of Data Mining. In this paper a new approach to considerably reduce the time complexity of the database scan has been proposed. This has been achieved by using the MapReduce framework for Hadoop Distributed File System (HDFS). Coupled with Cloud computing, which handles large data sets and processing remotely, the resultant system - that uses MapReduce for the full table scan, the Pincer-Search Algorithm, and Cloud Computing - is a force to reckon with.

UR - http://www.scopus.com/inward/record.url?scp=85014159853&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85014159853&partnerID=8YFLogxK

U2 - 10.1109/ICACCCT.2016.7831691

DO - 10.1109/ICACCCT.2016.7831691

M3 - Conference contribution

AN - SCOPUS:85014159853

SP - 507

EP - 511

BT - Proceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Sahay S, Khetarpal S, Pradhan T. Hybrid data mining algorithm in cloud computing using MapReduce framework. In Proceedings of 2016 International Conference on Advanced Communication Control and Computing Technologies, ICACCCT 2016. Institute of Electrical and Electronics Engineers Inc. 2017. p. 507-511. 7831691 https://doi.org/10.1109/ICACCCT.2016.7831691