Missing Data Imputation using Machine Learning Algorithm for Supervised Learning

D. Cenitta, R. Vijaya Arjunan, Prema K V

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

With a transience rate of over 18 million per year, Heart Disease (HD) has emerged out to be the lethal disease of the world. Data mining-based heart disease diagnosis systems can surely aid cardiac professionals in a timely diagnosis of the patient's condition. In this proposed work, a Python-based data mining system capable of diagnosing the HD using a Decision Tree has been developed. In the methodology, the UCI data repository was taken into consideration with 14 Attributes. In the dataset, there are few missing values (yet found to be hyperparameter), and pre-processing with such missing values is a common yet challenging problem. A mere substitution will give biased results from the data to be observed for HD diagnosis and will certainly affect the value of the learning process in Machine Learning. Therefore, in the proposed work, a missing value imputation is done, which gave better accuracy, and it is trustable.

Original languageEnglish
Title of host publication2021 International Conference on Computer Communication and Informatics, ICCCI 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728158754
DOIs
Publication statusPublished - 27-01-2021
Event2021 International Conference on Computer Communication and Informatics, ICCCI 2021 - Coimbatore, India
Duration: 27-01-202129-01-2021

Publication series

Name2021 International Conference on Computer Communication and Informatics, ICCCI 2021

Conference

Conference2021 International Conference on Computer Communication and Informatics, ICCCI 2021
CountryIndia
CityCoimbatore
Period27-01-2129-01-21

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems
  • Information Systems and Management
  • Biomedical Engineering
  • Health Informatics
  • Instrumentation

Fingerprint Dive into the research topics of 'Missing Data Imputation using Machine Learning Algorithm for Supervised Learning'. Together they form a unique fingerprint.

Cite this