Disease detection and identification using sequence data and information retrieval methods

Sankranti Joshi, Pai M. Radhika, Pai M.M. Manohara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Current clinical methods base disease detection and identification heavily on the description of symptoms by the patient. This leads to inaccuracy because of the errors that may arise in the quantification of the symptoms and also does not give a complete idea about the presence of any particular disease. The prediction of cellular diseases is still more challenging; for we have no measure on the exact quantity, quality and extremeness. The typical symptoms for these diseases are visible at a later stage allowing the disease to silently progress. This paper provides an efficient and novel way of detection and identification of pancreatitis and breast cancer using a combination of sequence data and information retrieval algorithms to provide the most accurate result. The developed system maintains a knowledge base of the mutations of the diseases causing breast cancer and pancreatitis and thus uses techniques of protein sequence scoring and information retrieval for providing the best match of patient protein sequence with the mutations stored. The system has been tested with mutations available online and gives 98 % accurate results.

Original languageEnglish
Title of host publicationProceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015
PublisherSpringer Science and Business Media Deutschland GmbH
Pages565-572
Number of pages8
Volume43
ISBN (Print)9788132225379
DOIs
Publication statusPublished - 2016
Event3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015 - Bhubaneshwar, India
Duration: 23-06-201525-06-2015

Publication series

NameSmart Innovation, Systems and Technologies
Volume43
ISSN (Print)2190-3018
ISSN (Electronic)2190-3026

Conference

Conference3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015
CountryIndia
CityBhubaneshwar
Period23-06-1525-06-15

Fingerprint

Information retrieval
Proteins
Mutation

All Science Journal Classification (ASJC) codes

  • Decision Sciences(all)
  • Computer Science(all)

Cite this

Joshi, S., Radhika, P. M., & Manohara, P. M. M. (2016). Disease detection and identification using sequence data and information retrieval methods. In Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015 (Vol. 43, pp. 565-572). (Smart Innovation, Systems and Technologies; Vol. 43). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-81-322-2538-6_58
Joshi, Sankranti ; Radhika, Pai M. ; Manohara, Pai M.M. / Disease detection and identification using sequence data and information retrieval methods. Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015. Vol. 43 Springer Science and Business Media Deutschland GmbH, 2016. pp. 565-572 (Smart Innovation, Systems and Technologies).
@inproceedings{96e4e53f787643d0946d9004f7d1650a,
title = "Disease detection and identification using sequence data and information retrieval methods",
abstract = "Current clinical methods base disease detection and identification heavily on the description of symptoms by the patient. This leads to inaccuracy because of the errors that may arise in the quantification of the symptoms and also does not give a complete idea about the presence of any particular disease. The prediction of cellular diseases is still more challenging; for we have no measure on the exact quantity, quality and extremeness. The typical symptoms for these diseases are visible at a later stage allowing the disease to silently progress. This paper provides an efficient and novel way of detection and identification of pancreatitis and breast cancer using a combination of sequence data and information retrieval algorithms to provide the most accurate result. The developed system maintains a knowledge base of the mutations of the diseases causing breast cancer and pancreatitis and thus uses techniques of protein sequence scoring and information retrieval for providing the best match of patient protein sequence with the mutations stored. The system has been tested with mutations available online and gives 98 {\%} accurate results.",
author = "Sankranti Joshi and Radhika, {Pai M.} and Manohara, {Pai M.M.}",
year = "2016",
doi = "10.1007/978-81-322-2538-6_58",
language = "English",
isbn = "9788132225379",
volume = "43",
series = "Smart Innovation, Systems and Technologies",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "565--572",
booktitle = "Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015",
address = "Germany",

}

Joshi, S, Radhika, PM & Manohara, PMM 2016, Disease detection and identification using sequence data and information retrieval methods. in Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015. vol. 43, Smart Innovation, Systems and Technologies, vol. 43, Springer Science and Business Media Deutschland GmbH, pp. 565-572, 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015, Bhubaneshwar, India, 23-06-15. https://doi.org/10.1007/978-81-322-2538-6_58

Disease detection and identification using sequence data and information retrieval methods. / Joshi, Sankranti; Radhika, Pai M.; Manohara, Pai M.M.

Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015. Vol. 43 Springer Science and Business Media Deutschland GmbH, 2016. p. 565-572 (Smart Innovation, Systems and Technologies; Vol. 43).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Disease detection and identification using sequence data and information retrieval methods

AU - Joshi, Sankranti

AU - Radhika, Pai M.

AU - Manohara, Pai M.M.

PY - 2016

Y1 - 2016

N2 - Current clinical methods base disease detection and identification heavily on the description of symptoms by the patient. This leads to inaccuracy because of the errors that may arise in the quantification of the symptoms and also does not give a complete idea about the presence of any particular disease. The prediction of cellular diseases is still more challenging; for we have no measure on the exact quantity, quality and extremeness. The typical symptoms for these diseases are visible at a later stage allowing the disease to silently progress. This paper provides an efficient and novel way of detection and identification of pancreatitis and breast cancer using a combination of sequence data and information retrieval algorithms to provide the most accurate result. The developed system maintains a knowledge base of the mutations of the diseases causing breast cancer and pancreatitis and thus uses techniques of protein sequence scoring and information retrieval for providing the best match of patient protein sequence with the mutations stored. The system has been tested with mutations available online and gives 98 % accurate results.

AB - Current clinical methods base disease detection and identification heavily on the description of symptoms by the patient. This leads to inaccuracy because of the errors that may arise in the quantification of the symptoms and also does not give a complete idea about the presence of any particular disease. The prediction of cellular diseases is still more challenging; for we have no measure on the exact quantity, quality and extremeness. The typical symptoms for these diseases are visible at a later stage allowing the disease to silently progress. This paper provides an efficient and novel way of detection and identification of pancreatitis and breast cancer using a combination of sequence data and information retrieval algorithms to provide the most accurate result. The developed system maintains a knowledge base of the mutations of the diseases causing breast cancer and pancreatitis and thus uses techniques of protein sequence scoring and information retrieval for providing the best match of patient protein sequence with the mutations stored. The system has been tested with mutations available online and gives 98 % accurate results.

UR - http://www.scopus.com/inward/record.url?scp=84951753110&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84951753110&partnerID=8YFLogxK

U2 - 10.1007/978-81-322-2538-6_58

DO - 10.1007/978-81-322-2538-6_58

M3 - Conference contribution

SN - 9788132225379

VL - 43

T3 - Smart Innovation, Systems and Technologies

SP - 565

EP - 572

BT - Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015

PB - Springer Science and Business Media Deutschland GmbH

ER -

Joshi S, Radhika PM, Manohara PMM. Disease detection and identification using sequence data and information retrieval methods. In Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, ICACNI 2015. Vol. 43. Springer Science and Business Media Deutschland GmbH. 2016. p. 565-572. (Smart Innovation, Systems and Technologies). https://doi.org/10.1007/978-81-322-2538-6_58