Big Data-Driven Phishing Detection in Smart Devices Using Chi-Square and Optimized Gradient Boosting

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Phishing attacks are a major cybersecurity threat, especially in smart devices, where attackers exploit vulnerabilities to steal sensitive information. As the complexity of phishing techniques grows, the need for robust detection methods becomes critical. This paper presents a Big Data based model to identify phishing in smart devices. Following PySpark for data preprocessing and Chi-Square feature selection, the suggested model optimizes a Gradient Boosting model using the Probabilistic Bees Algorithm (BeesA). With high accuracy, recall, and F1-score, the model was assessed on a dataset including more than 11,000 webpages. Comparative study with conventional classifiers like Random Forest, SVM, and Naive Bayes shows the better performance of the suggested model. The results show how well integrating Big Data methods with sophisticated optimization algorithms improves phishing detection in smart device scenarios.

Original languageEnglish
Title of host publicationUbi-Media Computing, Pervasive Systems, Algorithms and Networks - 13th International Conference, Ubi-Media 2025, and 17th International Symposium, I-SPAN 2025, Proceedings
EditorsLin Hui, Ching-Hsien Hsu, Somchoke Ruengittinun
Pages204-217
Number of pages14
DOIs
Publication statusPublished - 2026
Event13th International Conference on Ubi-Media Computing, Ubi-Media 2025 and 17th International Symposium on Pervasive Systems, Algorithms, and Networks, I-SPAN 2025 - Bangkok, Thailand
Duration: 19 Jan 202523 Jan 2025

Publication series

NameCommunications in Computer and Information Science
Volume2379 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference13th International Conference on Ubi-Media Computing, Ubi-Media 2025 and 17th International Symposium on Pervasive Systems, Algorithms, and Networks, I-SPAN 2025
Country/TerritoryThailand
CityBangkok
Period19/01/2523/01/25

Keywords

  • Big Data
  • Gradient Boosting
  • Phishing Detection
  • Probabilistic Bees Algorithm
  • Smart Devices

Fingerprint

Dive into the research topics of 'Big Data-Driven Phishing Detection in Smart Devices Using Chi-Square and Optimized Gradient Boosting'. Together they form a unique fingerprint.

Cite this