Advanced text documents information retrieval system for search services

Chiranjeevi H S, Manjula K. Shenoy

Research output: Contribution to journalArticlepeer-review

Abstract

Information technology has explored the growth of text documents data in many organizations and the structural arrangement of voluminous data is a complex task. Handling the text document data is a challenging process involving not only the training of models but also numerous additional procedures, e.g., data pre-processing, transformation, and dimensionality reduction. In this paper, we describe the system’s architecture, the technical challenges, and the novel solution we have built. We propose a Recurrent Convolutional Neural network (RCNN), based text information retrieval system which efficiently retrieves the text documents and information for the user query. Pre-processing using tokenization and stemming, retrieval using TF-IDF (Term Frequency-Inverse Document Frequency), and RCNN classifier which captures the contextual information is implemented. A real-time advanced search system is developed on a huge set of MAHE University dataset. The performance of the proposed text document retrieval system is compared with other existing algorithms and the efficacy of the method is discussed. The proposed RCNN-based text document information retrieval model performs better in terms of precision, recall, and F-measure. A high-quality and high-performance text document retrieval search system is presented.

Original languageEnglish
Article number1856467
JournalCogent Engineering
Volume7
Issue number1
DOIs
Publication statusPublished - 2020

All Science Journal Classification (ASJC) codes

  • Computer Science(all)
  • Chemical Engineering(all)
  • Engineering(all)

Fingerprint Dive into the research topics of 'Advanced text documents information retrieval system for search services'. Together they form a unique fingerprint.

Cite this