Detection of malicious URLs using machine learning techniques

Immadisetti Naga Venkata Durga Naveen, K. Manamohana, Rohit Verma

Research output: Contribution to journalArticle

Abstract

The primitive usage of URL (Uniform Resource Locator) is to use as a Web Address. However, some URLs can also be used to host unsolicited content that can potentially result in cyber attacks. These URLs are called malicious URLs. The inability of the end user system to detect and remove the malicious URLs can put the legitimate user in vulnerable condition. Furthermore, usage of malicious URLs may lead to illegitimate access to the user data by adversary. The main motive for malicious URL detection is that they provide an attack surface to the adversary. It is vital to counter these activities via some new methodology. In literature, there have been many filtering mechanisms to detect the malicious URLs. Some of them are Black-Listing, Heuristic Classification etc. These traditional mechanisms rely on keyword matching and URL syntax matching. Therefore, these conventional mechanisms cannot effectively deal with the ever evolving technologies and web-access techniques. Furthermore, these approaches also fall short in detecting the modern URLs such as short URLs, dark web URLs. In this paper, we propose a novel classification method to address the challenges faced by the traditional mechanisms in malicious URL detection. The proposed classification model is built on sophisticated machine learning methods that not only takes care about the syntactical nature of the URL, but also the semantic and lexical meaning of these dynamically changing URLs. The proposed approach is expected to outperform the existing techniques.

Original languageEnglish
Pages (from-to)389-393
Number of pages5
JournalInternational Journal of Innovative Technology and Exploring Engineering
Volume8
Issue number4S2
Publication statusPublished - 01-01-2019
Externally publishedYes

Fingerprint

Learning systems
Websites
World Wide Web
Semantics

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering
  • Civil and Structural Engineering
  • Mechanics of Materials
  • Computer Science(all)

Cite this

Naveen, Immadisetti Naga Venkata Durga ; Manamohana, K. ; Verma, Rohit. / Detection of malicious URLs using machine learning techniques. In: International Journal of Innovative Technology and Exploring Engineering. 2019 ; Vol. 8, No. 4S2. pp. 389-393.
@article{758d66e2943b4ebca4134e9e5370e1dc,
title = "Detection of malicious URLs using machine learning techniques",
abstract = "The primitive usage of URL (Uniform Resource Locator) is to use as a Web Address. However, some URLs can also be used to host unsolicited content that can potentially result in cyber attacks. These URLs are called malicious URLs. The inability of the end user system to detect and remove the malicious URLs can put the legitimate user in vulnerable condition. Furthermore, usage of malicious URLs may lead to illegitimate access to the user data by adversary. The main motive for malicious URL detection is that they provide an attack surface to the adversary. It is vital to counter these activities via some new methodology. In literature, there have been many filtering mechanisms to detect the malicious URLs. Some of them are Black-Listing, Heuristic Classification etc. These traditional mechanisms rely on keyword matching and URL syntax matching. Therefore, these conventional mechanisms cannot effectively deal with the ever evolving technologies and web-access techniques. Furthermore, these approaches also fall short in detecting the modern URLs such as short URLs, dark web URLs. In this paper, we propose a novel classification method to address the challenges faced by the traditional mechanisms in malicious URL detection. The proposed classification model is built on sophisticated machine learning methods that not only takes care about the syntactical nature of the URL, but also the semantic and lexical meaning of these dynamically changing URLs. The proposed approach is expected to outperform the existing techniques.",
author = "Naveen, {Immadisetti Naga Venkata Durga} and K. Manamohana and Rohit Verma",
year = "2019",
month = "1",
day = "1",
language = "English",
volume = "8",
pages = "389--393",
journal = "International Journal of Innovative Technology and Exploring Engineering",
issn = "2278-3075",
publisher = "Blue Eyes Intelligence Engineering and Sciences Publication",
number = "4S2",

}

Detection of malicious URLs using machine learning techniques. / Naveen, Immadisetti Naga Venkata Durga; Manamohana, K.; Verma, Rohit.

In: International Journal of Innovative Technology and Exploring Engineering, Vol. 8, No. 4S2, 01.01.2019, p. 389-393.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Detection of malicious URLs using machine learning techniques

AU - Naveen, Immadisetti Naga Venkata Durga

AU - Manamohana, K.

AU - Verma, Rohit

PY - 2019/1/1

Y1 - 2019/1/1

N2 - The primitive usage of URL (Uniform Resource Locator) is to use as a Web Address. However, some URLs can also be used to host unsolicited content that can potentially result in cyber attacks. These URLs are called malicious URLs. The inability of the end user system to detect and remove the malicious URLs can put the legitimate user in vulnerable condition. Furthermore, usage of malicious URLs may lead to illegitimate access to the user data by adversary. The main motive for malicious URL detection is that they provide an attack surface to the adversary. It is vital to counter these activities via some new methodology. In literature, there have been many filtering mechanisms to detect the malicious URLs. Some of them are Black-Listing, Heuristic Classification etc. These traditional mechanisms rely on keyword matching and URL syntax matching. Therefore, these conventional mechanisms cannot effectively deal with the ever evolving technologies and web-access techniques. Furthermore, these approaches also fall short in detecting the modern URLs such as short URLs, dark web URLs. In this paper, we propose a novel classification method to address the challenges faced by the traditional mechanisms in malicious URL detection. The proposed classification model is built on sophisticated machine learning methods that not only takes care about the syntactical nature of the URL, but also the semantic and lexical meaning of these dynamically changing URLs. The proposed approach is expected to outperform the existing techniques.

AB - The primitive usage of URL (Uniform Resource Locator) is to use as a Web Address. However, some URLs can also be used to host unsolicited content that can potentially result in cyber attacks. These URLs are called malicious URLs. The inability of the end user system to detect and remove the malicious URLs can put the legitimate user in vulnerable condition. Furthermore, usage of malicious URLs may lead to illegitimate access to the user data by adversary. The main motive for malicious URL detection is that they provide an attack surface to the adversary. It is vital to counter these activities via some new methodology. In literature, there have been many filtering mechanisms to detect the malicious URLs. Some of them are Black-Listing, Heuristic Classification etc. These traditional mechanisms rely on keyword matching and URL syntax matching. Therefore, these conventional mechanisms cannot effectively deal with the ever evolving technologies and web-access techniques. Furthermore, these approaches also fall short in detecting the modern URLs such as short URLs, dark web URLs. In this paper, we propose a novel classification method to address the challenges faced by the traditional mechanisms in malicious URL detection. The proposed classification model is built on sophisticated machine learning methods that not only takes care about the syntactical nature of the URL, but also the semantic and lexical meaning of these dynamically changing URLs. The proposed approach is expected to outperform the existing techniques.

UR - http://www.scopus.com/inward/record.url?scp=85063340966&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85063340966&partnerID=8YFLogxK

M3 - Article

VL - 8

SP - 389

EP - 393

JO - International Journal of Innovative Technology and Exploring Engineering

JF - International Journal of Innovative Technology and Exploring Engineering

SN - 2278-3075

IS - 4S2

ER -