Abstract
Phishing is the largest network security issue among global cybercrimes in 2022. Its frequency of occurrence has maintained rapid growth and has become one of the most important network security issues. In the state of the art of this research field, there was a trade-off between high-precision discriminant models and huge consumption of computing resources. Therefore, the research purpose of this article is mainly to balance the relationship between accuracy and computing resources (performance) to achieve accuracy and computing efficiency at the same time. This article uses principal component analysis (PCA) as a tool, uses its excellent dimensionality reduction ability to process sample data, compresses the original feature set, and then uses different machine learning models to conduct experiments. In the end, the random forest model after PCA achieved a discrimination accuracy of 97.157% with a performance improvement of 25.1%, effectively achieving a win-win balance between accuracy and performance.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 397-403 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798350308693 |
| DOIs | |
| Publication status | Published - 2023 |
| Event | 15th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023 - Jiangsu, China Duration: 2 Nov 2023 → 4 Nov 2023 |
Publication series
| Name | Proceedings - 2023 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023 |
|---|
Conference
| Conference | 15th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2023 |
|---|---|
| Country/Territory | China |
| City | Jiangsu |
| Period | 2/11/23 → 4/11/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 16 Peace, Justice and Strong Institutions
Keywords
- Cyber Crime
- Machine Learning
- PCA
- Phishing Detection
- Random Forest
Fingerprint
Dive into the research topics of 'Classification and Identification of Phishing Websites based on Machine Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver