Download PDFOpen PDF in browserOptimizing Phishing Detection with Advanced Feature Vectorization and Supervised Machine Learning TechniquesEasyChair Preprint 1516814 pages•Date: September 29, 2024AbstractPhishing attacks are among the most common cybersecurity threats, taking advantage of users' trust to gain access to sensitive information. Detecting these attacks effectively is essential for protecting both individuals and organizations. This study focuses on improving phishing detection by developing an optimized framework for feature vectorization, combined with supervised machine learning techniques. By carefully selecting and designing features from email and website data, the goal is to enhance the accuracy of identifying phishing attempts. The analysis includes various text-based, URL-based, and metadata features, emphasizing their role in improving classification performance. Machine learning models such as Support Vector Machines (SVM), Random Forest, and Gradient Boosting are trained and tested on a dataset of legitimate and phishing samples. The study also examines the impact of feature scaling, selection, and dimensionality reduction methods like Principal Component Analysis (PCA) to determine which factors most effectively boost detection accuracy. Experimental findings show that an optimized feature set, combined with strong machine learning algorithms, greatly enhances phishing detection rates while reducing false positives. This approach highlights the potential for reliable, automated phishing detection systems, contributing to stronger cybersecurity defenses.Phishing attacks are among the most common cybersecurity threats, taking advantage of users' trust to gain access to sensitive information. Detecting these attacks effectively is essential for protecting both individuals and organizations. This study focuses on improving phishing detection by developing an optimized framework for feature vectorization, combined with supervised machine learning techniques. Keyphrases: Feature Vectorization, Phishing Detection, Supervised Machine Learning
|