Comparative evaluation of machine learning algorithms for phishing site detection.

Researchers

Mohammed Alshehri Mohd Anul Haq Noura Fahad Almujahid

Journal

Modalities

Models

Convolutional Neural Network (CNN)Decision Tree (DT)deep learning (DL)Extreme Gradient Boosting (XGBoost)k-Nearest Neighbors (KNN)Logistic Regression (LR)Random Forest (RF)Support Vector Machine (SVM)

Abstract

The advent of Internet technologies has resulted in the proliferation of electronic trading and the use of the Internet for electronic transactions, leading to a rise in unauthorized access to sensitive user information and the depletion of resources for enterprises. As a consequence, there has been a marked increase in phishing, which is now considered one of the most common types of online theft. Phishing attacks are typically directed towards obtaining confidential information, such as login credentials for online banking platforms and sensitive systems. The primary objective of such attacks is to acquire specific personal information to either use for financial gain or commit identity theft. Recent studies have been conducted to combat phishing attacks by examining domain characteristics such as website addresses, content on websites, and combinations of both approaches for the website and its source code. However, businesses require more effective anti-phishing technologies to identify phishing URLs and safeguard their users. The present research aims to evaluate the effectiveness of eight machine learning (ML) and deep learning (DL) algorithms, including support vector machine (SVM), k-nearest neighbors (KNN), random forest (RF), Decision Tree (DT), Extreme Gradient Boosting (XGBoost), logistic regression (LR), convolutional neural network (CNN), and DL model and assess their performances in identifying phishing. This study utilizes two real datasets, Mendeley and UCI, employing performance metrics such as accuracy, precision, recall, false positive rate (FPR), and F-1 score. Notably, CNN exhibits superior accuracy, emphasizing its efficacy. Contributions include using purpose-specific datasets, meticulous feature engineering, introducing SMOTE for class imbalance, incorporating the novel CNN model, and rigorous hyperparameter tuning. The study demonstrates consistent model performance across both datasets, highlighting stability and reliability.©2024 Almujahid et al.

Show Full Text

Comparative evaluation of machine learning algorithms for phishing site detection.

Researchers

Journal

Modalities

Models

Abstract

A similarity-guided segmentation model for garbage detection under road scene.

Evaluation of pooling operations in convolutional architectures for drug-drug interaction extraction.

DeepClas4Bio: Connecting bioimaging tools with deep learning frameworks for image classification.

Nonintrusive wind blade fault detection using a deep learning approach by exploring acoustic information.

Cancer diagnosis using generative adversarial networks based on deep learning from imbalanced data.

COV-ECGNET: COVID-19 detection using ECG trace images with deep convolutional neural network.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply