Enhancing the CIC IoT Dataset 2023 for Improved Attack Detection through GANs Augmentation and Federated Learning
- 1 Information Technology Department, College of Computing and Informatics, Saudi Electronic University, Riyadh, Saudi Arabia
Abstract
The escalating frequency and sophistication of cyber-attacks on Internet of Things (IoT) devices present a pressing challenge to cybersecurity. With IoT device connections projected to exceed 42 billion by 2025, the vulnerability of these devices to cyber-attacks has never been more evident. This paper investigates the integration of Machine Learning (ML) and data augmentation, specifically Generative Adversarial Networks (GAN) and Federated Learning (FL), as innovative measures to fortify IoT security. The study aims to balance the CIC IoT Dataset 2023 using GAN-generated synthetic data and to enhance ML model performance through FL, with eXtreme Gradient Boosting (XGBoost) as the FL framework's backbone. The utilization of GAN for data augmentation addresses the persistent challenge of data imbalances in datasets. The comparison between the FL and traditional approaches in IoT security analytics reveals distinctad vantages of FL, particularly in data privacy, scalability, and handling imbalanced data. While FL consistently delivers high accuracy, precision, recall, and F1-scores, the traditional approach varies more, often requiring additional data balancing and model tuning.
DOI: https://doi.org/10.3844/jcssp.2025.1688.1704
Copyright: © 2025 Shahad Alahmari and Noura Aleisa. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 37 Views
- 10 Downloads
- 0 Citations
Download
Keywords
- Internet of Things
- Privacy Preserving Mode
- Security
- Federated Learning
- Data Augmentation