Reformulation of natural language queries on source code base using NLP techniques

B. P. Swathi, R. Anju

Research output: Contribution to journalArticle

Abstract

Source code retrieval is a branch of text retrieval which helps developer find a piece of code from the code base. The developer can obtain the required code from the code base by issuing a query on the source code base. Generally, a developer who has been working on the code base since a long time will know how to formulate his/her query in order to get a good search result. A developer who is novice to the code base will not know what terms he/she has to include in query to obtain a good search result. In fact, a system should allow developer to issue natural language queries. This arises a need for query reformulation to optimize the developer query when the query does not contain terms from code base. This work has conducted extensive study on areas where natural language queries are applied and the various reformulation techniques. In this work, semantic query reformulation technique is applied on the natural language queries on the source code base. Our discussion and results prove how semantically right word and a word which is in context of the source code can be obtained which acts as a replacement for a query term which is not present in the source code base.

Original languageEnglish
Pages (from-to)3047-3052
Number of pages6
JournalCompusoft
Volume8
Issue number2
Publication statusPublished - 01-01-2019
Externally publishedYes

Fingerprint

Query languages
Semantics

All Science Journal Classification (ASJC) codes

  • Computer Science(all)

Cite this

Swathi, B. P. ; Anju, R. / Reformulation of natural language queries on source code base using NLP techniques. In: Compusoft. 2019 ; Vol. 8, No. 2. pp. 3047-3052.
@article{ed9858b74fb142e4870c2cdab5295662,
title = "Reformulation of natural language queries on source code base using NLP techniques",
abstract = "Source code retrieval is a branch of text retrieval which helps developer find a piece of code from the code base. The developer can obtain the required code from the code base by issuing a query on the source code base. Generally, a developer who has been working on the code base since a long time will know how to formulate his/her query in order to get a good search result. A developer who is novice to the code base will not know what terms he/she has to include in query to obtain a good search result. In fact, a system should allow developer to issue natural language queries. This arises a need for query reformulation to optimize the developer query when the query does not contain terms from code base. This work has conducted extensive study on areas where natural language queries are applied and the various reformulation techniques. In this work, semantic query reformulation technique is applied on the natural language queries on the source code base. Our discussion and results prove how semantically right word and a word which is in context of the source code can be obtained which acts as a replacement for a query term which is not present in the source code base.",
author = "Swathi, {B. P.} and R. Anju",
year = "2019",
month = "1",
day = "1",
language = "English",
volume = "8",
pages = "3047--3052",
journal = "Compusoft",
issn = "2320-0790",
publisher = "National Institute of Science Communication and Information Resources (NISCAIR)",
number = "2",

}

Swathi, BP & Anju, R 2019, 'Reformulation of natural language queries on source code base using NLP techniques', Compusoft, vol. 8, no. 2, pp. 3047-3052.

Reformulation of natural language queries on source code base using NLP techniques. / Swathi, B. P.; Anju, R.

In: Compusoft, Vol. 8, No. 2, 01.01.2019, p. 3047-3052.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Reformulation of natural language queries on source code base using NLP techniques

AU - Swathi, B. P.

AU - Anju, R.

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Source code retrieval is a branch of text retrieval which helps developer find a piece of code from the code base. The developer can obtain the required code from the code base by issuing a query on the source code base. Generally, a developer who has been working on the code base since a long time will know how to formulate his/her query in order to get a good search result. A developer who is novice to the code base will not know what terms he/she has to include in query to obtain a good search result. In fact, a system should allow developer to issue natural language queries. This arises a need for query reformulation to optimize the developer query when the query does not contain terms from code base. This work has conducted extensive study on areas where natural language queries are applied and the various reformulation techniques. In this work, semantic query reformulation technique is applied on the natural language queries on the source code base. Our discussion and results prove how semantically right word and a word which is in context of the source code can be obtained which acts as a replacement for a query term which is not present in the source code base.

AB - Source code retrieval is a branch of text retrieval which helps developer find a piece of code from the code base. The developer can obtain the required code from the code base by issuing a query on the source code base. Generally, a developer who has been working on the code base since a long time will know how to formulate his/her query in order to get a good search result. A developer who is novice to the code base will not know what terms he/she has to include in query to obtain a good search result. In fact, a system should allow developer to issue natural language queries. This arises a need for query reformulation to optimize the developer query when the query does not contain terms from code base. This work has conducted extensive study on areas where natural language queries are applied and the various reformulation techniques. In this work, semantic query reformulation technique is applied on the natural language queries on the source code base. Our discussion and results prove how semantically right word and a word which is in context of the source code can be obtained which acts as a replacement for a query term which is not present in the source code base.

UR - http://www.scopus.com/inward/record.url?scp=85063726676&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85063726676&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:85063726676

VL - 8

SP - 3047

EP - 3052

JO - Compusoft

JF - Compusoft

SN - 2320-0790

IS - 2

ER -