Detection of acute lymphoblastic leukemia using image segmentation and data mining algorithms

Vasundhara Acharya, Preetham Kumar

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Blood is composed of white blood cells, red blood cells, and platelets. Segmentation of the blood smear cells and extraction of features of the cells is essential in the field of medicine. Acute lymphoblastic leukemia is a form of blood cancer caused due to the abnormal increase in the production of immature white blood cells in the bone marrow. It mostly affects the children below 5 years and adults above 50 years of age. Due to the late diagnosis and cost of the devices used for the determination, the mortality rate has increased drastically. Flow cytometry technique that performs automated counting fails to identify the abnormal cells. Manual recount performed using hemocytometer are prone to errors and are imprecise. The proposed work aims to survey different computer-aided system techniques used to segment the blood smear image. The primary objective here is to derive knowledge from the different methodologies used for extracting features from white blood cells and develop a system that would accurately segment the blood smear image by overcoming the drawbacks of the previous works. The objective mentioned above is achieved in two ways. Firstly, a novel algorithm is developed to segment the nucleus and cytoplasm of white blood cell. Secondly, a model is built to extract the features and train the model. The different supervised classifiers are compared, and the one with the highest accuracy is used for the classification. Six hundred images are used in the experimentation. InfoGainAttributeEval and the Ranker Search method are used to achieve the feature selection which in turn helps in improvising the classifier performance. The result shows the classification of the acute lymphoblastic leukemia into its three respective categories namely: ALL-L1, ALL-L2, ALL-L3. The model can differentiate between a normal peripheral blood smear and an abnormal blood smear. The extracted feature values of a cancerous cell and a normal cell are also shown. The performance of the model is evaluated using the test images stained with various stains. The proposed algorithm achieved an overall accuracy of 98.6%. The promising results show that it can be used as a diagnostic tool by the pathologists. [Figure not available: see fulltext.].

Original languageEnglish
Pages (from-to)1783-1811
Number of pages29
JournalMedical and Biological Engineering and Computing
Volume57
Issue number8
DOIs
Publication statusPublished - 01-08-2019

Fingerprint

Image segmentation
Data mining
Blood
Cells
Classifiers
Flow cytometry
Platelets
Medicine
Feature extraction
Bone

All Science Journal Classification (ASJC) codes

  • Biomedical Engineering
  • Computer Science Applications

Cite this

@article{e075474c580349c48ca6bc2d71ea21a6,
title = "Detection of acute lymphoblastic leukemia using image segmentation and data mining algorithms",
abstract = "Blood is composed of white blood cells, red blood cells, and platelets. Segmentation of the blood smear cells and extraction of features of the cells is essential in the field of medicine. Acute lymphoblastic leukemia is a form of blood cancer caused due to the abnormal increase in the production of immature white blood cells in the bone marrow. It mostly affects the children below 5 years and adults above 50 years of age. Due to the late diagnosis and cost of the devices used for the determination, the mortality rate has increased drastically. Flow cytometry technique that performs automated counting fails to identify the abnormal cells. Manual recount performed using hemocytometer are prone to errors and are imprecise. The proposed work aims to survey different computer-aided system techniques used to segment the blood smear image. The primary objective here is to derive knowledge from the different methodologies used for extracting features from white blood cells and develop a system that would accurately segment the blood smear image by overcoming the drawbacks of the previous works. The objective mentioned above is achieved in two ways. Firstly, a novel algorithm is developed to segment the nucleus and cytoplasm of white blood cell. Secondly, a model is built to extract the features and train the model. The different supervised classifiers are compared, and the one with the highest accuracy is used for the classification. Six hundred images are used in the experimentation. InfoGainAttributeEval and the Ranker Search method are used to achieve the feature selection which in turn helps in improvising the classifier performance. The result shows the classification of the acute lymphoblastic leukemia into its three respective categories namely: ALL-L1, ALL-L2, ALL-L3. The model can differentiate between a normal peripheral blood smear and an abnormal blood smear. The extracted feature values of a cancerous cell and a normal cell are also shown. The performance of the model is evaluated using the test images stained with various stains. The proposed algorithm achieved an overall accuracy of 98.6{\%}. The promising results show that it can be used as a diagnostic tool by the pathologists. [Figure not available: see fulltext.].",
author = "Vasundhara Acharya and Preetham Kumar",
year = "2019",
month = "8",
day = "1",
doi = "10.1007/s11517-019-01984-1",
language = "English",
volume = "57",
pages = "1783--1811",
journal = "Medical and Biological Engineering and Computing",
issn = "0140-0118",
publisher = "Springer Verlag",
number = "8",

}

Detection of acute lymphoblastic leukemia using image segmentation and data mining algorithms. / Acharya, Vasundhara; Kumar, Preetham.

In: Medical and Biological Engineering and Computing, Vol. 57, No. 8, 01.08.2019, p. 1783-1811.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Detection of acute lymphoblastic leukemia using image segmentation and data mining algorithms

AU - Acharya, Vasundhara

AU - Kumar, Preetham

PY - 2019/8/1

Y1 - 2019/8/1

N2 - Blood is composed of white blood cells, red blood cells, and platelets. Segmentation of the blood smear cells and extraction of features of the cells is essential in the field of medicine. Acute lymphoblastic leukemia is a form of blood cancer caused due to the abnormal increase in the production of immature white blood cells in the bone marrow. It mostly affects the children below 5 years and adults above 50 years of age. Due to the late diagnosis and cost of the devices used for the determination, the mortality rate has increased drastically. Flow cytometry technique that performs automated counting fails to identify the abnormal cells. Manual recount performed using hemocytometer are prone to errors and are imprecise. The proposed work aims to survey different computer-aided system techniques used to segment the blood smear image. The primary objective here is to derive knowledge from the different methodologies used for extracting features from white blood cells and develop a system that would accurately segment the blood smear image by overcoming the drawbacks of the previous works. The objective mentioned above is achieved in two ways. Firstly, a novel algorithm is developed to segment the nucleus and cytoplasm of white blood cell. Secondly, a model is built to extract the features and train the model. The different supervised classifiers are compared, and the one with the highest accuracy is used for the classification. Six hundred images are used in the experimentation. InfoGainAttributeEval and the Ranker Search method are used to achieve the feature selection which in turn helps in improvising the classifier performance. The result shows the classification of the acute lymphoblastic leukemia into its three respective categories namely: ALL-L1, ALL-L2, ALL-L3. The model can differentiate between a normal peripheral blood smear and an abnormal blood smear. The extracted feature values of a cancerous cell and a normal cell are also shown. The performance of the model is evaluated using the test images stained with various stains. The proposed algorithm achieved an overall accuracy of 98.6%. The promising results show that it can be used as a diagnostic tool by the pathologists. [Figure not available: see fulltext.].

AB - Blood is composed of white blood cells, red blood cells, and platelets. Segmentation of the blood smear cells and extraction of features of the cells is essential in the field of medicine. Acute lymphoblastic leukemia is a form of blood cancer caused due to the abnormal increase in the production of immature white blood cells in the bone marrow. It mostly affects the children below 5 years and adults above 50 years of age. Due to the late diagnosis and cost of the devices used for the determination, the mortality rate has increased drastically. Flow cytometry technique that performs automated counting fails to identify the abnormal cells. Manual recount performed using hemocytometer are prone to errors and are imprecise. The proposed work aims to survey different computer-aided system techniques used to segment the blood smear image. The primary objective here is to derive knowledge from the different methodologies used for extracting features from white blood cells and develop a system that would accurately segment the blood smear image by overcoming the drawbacks of the previous works. The objective mentioned above is achieved in two ways. Firstly, a novel algorithm is developed to segment the nucleus and cytoplasm of white blood cell. Secondly, a model is built to extract the features and train the model. The different supervised classifiers are compared, and the one with the highest accuracy is used for the classification. Six hundred images are used in the experimentation. InfoGainAttributeEval and the Ranker Search method are used to achieve the feature selection which in turn helps in improvising the classifier performance. The result shows the classification of the acute lymphoblastic leukemia into its three respective categories namely: ALL-L1, ALL-L2, ALL-L3. The model can differentiate between a normal peripheral blood smear and an abnormal blood smear. The extracted feature values of a cancerous cell and a normal cell are also shown. The performance of the model is evaluated using the test images stained with various stains. The proposed algorithm achieved an overall accuracy of 98.6%. The promising results show that it can be used as a diagnostic tool by the pathologists. [Figure not available: see fulltext.].

UR - http://www.scopus.com/inward/record.url?scp=85068131227&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85068131227&partnerID=8YFLogxK

U2 - 10.1007/s11517-019-01984-1

DO - 10.1007/s11517-019-01984-1

M3 - Article

VL - 57

SP - 1783

EP - 1811

JO - Medical and Biological Engineering and Computing

JF - Medical and Biological Engineering and Computing

SN - 0140-0118

IS - 8

ER -