TY - GEN
T1 - A study of various varieties of distributed data mining architectures
AU - Paul, Sukriti
AU - Shetty, Nisha P.
AU - Balachandra, null
PY - 2018/1/1
Y1 - 2018/1/1
N2 - Owing to the explosion of data in today’s world, datasets are enormous, geographically distributed and heterogeneous. Data mining aims extracting useful information from voluminous repositories where data is stored. Predictive analysis of hidden patterns in massive datasets poses to be a challenge. The problems faced while using the data warehousing model for such datasets were privacy, centralization of the data present at multiple independent sites, bandwidth limitation, complexity of integration, and analysis of the data at a global level. Distributed algorithms have been designed to address the same. Distributed data mining (DDM) techniques regard the distributed datasets as one virtual table and assume the existence of a global model which could be designed if the data were combined centrally. This paper presents distributed data mining systems and frameworks for analyzing data and mining the required knowledge from it. Emphasis has been laid on the architectures of such models. Factors like computation resources, communication, hardware, and usage of distributed resources of data have been considered while analyzing or designing distributed algorithms. Such algorithms primarily aim at memory expense and average distribution of working load. Distributed data finds its application in e-commerce, e-business, intrusion detection systems, and sensor networks.
AB - Owing to the explosion of data in today’s world, datasets are enormous, geographically distributed and heterogeneous. Data mining aims extracting useful information from voluminous repositories where data is stored. Predictive analysis of hidden patterns in massive datasets poses to be a challenge. The problems faced while using the data warehousing model for such datasets were privacy, centralization of the data present at multiple independent sites, bandwidth limitation, complexity of integration, and analysis of the data at a global level. Distributed algorithms have been designed to address the same. Distributed data mining (DDM) techniques regard the distributed datasets as one virtual table and assume the existence of a global model which could be designed if the data were combined centrally. This paper presents distributed data mining systems and frameworks for analyzing data and mining the required knowledge from it. Emphasis has been laid on the architectures of such models. Factors like computation resources, communication, hardware, and usage of distributed resources of data have been considered while analyzing or designing distributed algorithms. Such algorithms primarily aim at memory expense and average distribution of working load. Distributed data finds its application in e-commerce, e-business, intrusion detection systems, and sensor networks.
UR - http://www.scopus.com/inward/record.url?scp=85045659728&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85045659728&partnerID=8YFLogxK
U2 - 10.1007/978-981-10-7563-6_9
DO - 10.1007/978-981-10-7563-6_9
M3 - Conference contribution
AN - SCOPUS:85045659728
SN - 9789811075629
T3 - Advances in Intelligent Systems and Computing
SP - 77
EP - 88
BT - Information and Decision Sciences - Proceedings of the 6th International Conference on FICTA
PB - Springer Verlag
T2 - 6th International Conference on Frontiers of Intelligent Computing: Theory and Applications, FICTA 2017
Y2 - 14 October 2017 through 15 October 2017
ER -