skip to main content
Language:
Search Limited to: Search Limited to: Resource type Show Results with: Show Results with: Search type Index

Phishing Detection System through Hybrid Machine Learning Based on URL

IEEE access, 2023-01, Vol.11, p.1-1 [Peer Reviewed Journal]

Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023 ;ISSN: 2169-3536 ;EISSN: 2169-3536 ;DOI: 10.1109/ACCESS.2023.3252366 ;CODEN: IAECCG

Full text available

Citations Cited by
  • Title:
    Phishing Detection System through Hybrid Machine Learning Based on URL
  • Author: Karim, Abdul ; Shahroz, Mobeen ; Mustofa, Khabib ; Belhaouari, Samir Brahim ; Joga, S Ramana Kumar
  • Subjects: Accuracy ; Algorithms ; Blocklists ; Classifiers ; Crime ; Cyber Security ; Cybercrime ; Datasets ; Decision trees ; Electronic mail ; Ensemble Classifier ; Evaluation ; Hybrid systems ; Internet ; Logistic regression ; Machine learning ; Optimization ; Optimization techniques ; Phishing ; Protocol ; Protocols ; Social networks ; support vector machine ; Support vector machine and Decision tree (LSD) ; Support vector machines ; Uniform resource locator (URL) ; Uniform resource locators ; Voting Classifier ; Websites
  • Is Part Of: IEEE access, 2023-01, Vol.11, p.1-1
  • Description: Currently, numerous types of cybercrime are organized through the internet. Hence, this study mainly focuses on phishing attacks. Although phishing was first used in 1996, it has become the most severe and dangerous cybercrime on the internet. Phishing utilizes email distortion as its underlying mechanism for tricky correspondences, followed by mock sites, to obtain the required data from people in question. Different studies have presented their work on the precaution, identification, and knowledge of phishing attacks; however, there is currently no complete and proper solution for frustrating them. Therefore, machine learning plays a vital role in defending against cybercrimes involving phishing attacks. The proposed study is based on the phishing URL-based dataset extracted from the famous dataset repository, which consists of phishing and legitimate URL attributes collected from 11000+ website datasets in vector form. After preprocessing, many machine learning algorithms have been applied and designed to prevent phishing URLs and provide protection to the user. This study uses machine learning models such as decision tree (DT), linear regression (LR), random forest (RF), naive Bayes (NB), gradient boosting classifier (GBM), K-neighbors classifier (KNN), support vector classifier (SVC), and proposed hybrid LSD model, which is a combination of logistic regression, support vector machine, and decision tree (LR+SVC+DT) with soft and hard voting, to defend against phishing attacks with high accuracy and efficiency. The canopy feature selection technique with cross fold valoidation and Grid Search Hyperparameter Optimization techniques are used with proposed LSD model. Furthermore, to evaluate the proposed approach, different evaluation parameters were adopted, such as the precision, accuracy, recall, F1-score, and specificity, to illustrate the effects and efficiency of the models. The results of the comparative analyses demonstrate that the proposed approach outperforms the other models and achieves the best results.
  • Publisher: Piscataway: IEEE
  • Language: English
  • Identifier: ISSN: 2169-3536
    EISSN: 2169-3536
    DOI: 10.1109/ACCESS.2023.3252366
    CODEN: IAECCG
  • Source: IEEE Xplore Open Access Journals
    DOAJ Directory of Open Access Journals

Searching Remote Databases, Please Wait