Phishing webpage detection using weighted URL tokens for identity keywords retrieval

Tan, Choon Lin and Chiew, Kang Leng and Sze, San Nah (2017) Phishing webpage detection using weighted URL tokens for identity keywords retrieval. Lecture Notes in Electrical Engineering, 398. pp. 133-139. ISSN 18761100

[img] PDF

Download (1kB)
Official URL:


Phishing is an online identity theft that has threatened Internet users for more than a decade. This paper proposes an anti-phishing technique based on a weighted URL tokens system, which extracts identity keywords from a query webpage. Using the identity keywords as search terms, a search engine is invoked to pinpoint the target domain name, which can be used to determine the legitimacy of the query webpage. Experiments were conducted over 1000 datasets, where 99.20% true positives and 92.20% true negatives were achieved. Results suggest that the proposed system can detect phishing webpages effectively without using conventional language-dependent keywords extraction algorithms.

Item Type: Article
Uncontrolled Keywords: Identity keywords; Keywords retrieval; Phishing detection; Search engine; Weighted URL tokens, unimas, university, universiti, Borneo, Malaysia, Sarawak, Kuching, Samarahan, ipta, education, research, Universiti Malaysia Sarawak
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions: Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User: Ibrahim
Date Deposited: 24 Jan 2017 01:06
Last Modified: 17 Aug 2020 17:53

Actions (For repository members only: login required)

View Item View Item