Distributed denial of service attack detection using machine learning and class oversampling
- Authors: Shafin, Sakib , Prottoy, Shafin , Abbas, Saif , Hakim, Safayat , Chowdhury, Abdullahi , Rashid, Md Mamanur
- Date: 2021
- Type: Text , Conference paper
- Relation: First International Conference on Applied Intelligence and Informatics, AII 2021, Nottingham, UK, July 30-31, 2021 Vol. 1435, p. 247-259
- Full Text: false
- Reviewed:
- Description: Distributed Denial of Services (DDoS) attack, one of the most dangerous types of cyber attack, has been reported to increase during the COVID-19 pandemic. Machine learning techniques have been proposed in the literature to build models to detect DDoS attacks. Existing works in literature tested their models with old datasets where DDoS attacks are not specific. These works mainly focus on detecting the presence of an attack rather than the type of DDoS attacks. However, detection of the attack type is vital for the review and analysis of enterprise-level security policy. Cyber-attacks are inherently an imbalanced data problem, but none of the models treated DDoS attack detection from this perspective. In this work, we present a machine learning model that takes the imbalance nature of the DDoS attack data into consideration for both presence/absence and the type of DDoS attack detection. Extensive experiment analysis with the recent and DDoS attack-specific dataset shows that the proposed technique can effectively identify DDoS attacks. © 2021, Springer Nature Switzerland AG.
Detection of android malware using tree-based ensemble stacking model
- Authors: Shafin, Sakib , Ahmed, Md Maroof , Pranto, Mahmud , Chowdhury, Abdullahi
- Date: 2021
- Type: Text , Conference paper
- Relation: 2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE 2021, Brisbane, 8-10 December 2021
- Full Text: false
- Reviewed:
- Description: Increasing use of smartphones for everyday activities from banking, education to social networking is putting our personal information at risk as smartphone operating systems and applications are vulnerable to various types of attacks including malware attack. To this end Android operating system is particularly targeted as it is the most widely used mobile operating system. Building a robust detection system that can provide protection against recent attacks and can deliver not only accurate detection but also the type of the attack in order to protect the system is vital. In this study, we propose a twolayer Machine Learning detection model based on Ensemble Learning and Stacked Generalization method to accurately predict and classify the growing attacks on Android smartphones. We evaluated the proposed model on a very recent dataset, named CIC-Maldroid-2020, which contains 11,598 samples with various malicious attack types. The performance of our proposed model was evaluated on widely used metrics, like accuracy, precision, recall & F1-score. It outperforms previous studies done on the same dataset and achieves an accuracy of 99.49% in classifying each attack type. © IEEE 2022.
Churn prediction in telecom industry using machine learning ensembles with class balancing
- Authors: Chowdhury, Abdullahi , Kaisar, Shahriar , Rashid, Md Mamunur , Shafin, Sakib , Kamruzzaman, Joarder
- Date: 2021
- Type: Text , Conference paper
- Relation: 2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering, CSDE 2021, Brisbane, 8-10 December 2021
- Full Text: false
- Reviewed:
- Description: Telecommunication service providers are going through a very competitive and challenging time to retain existing customers by offering new and attractive services (e.g., unlimited local and international calls, high-speed internet, new phones). It is therefore imperative to analyse and predict customer churn behaviour more accurately. One of the major challenges to analyse churn data and build better prediction model is the imbalance nature of the data. Customer behaviour for churn and non-churn scenarios may contain resembling features. Using a single classifier or simple oversampling method to handle data imbalance often struggles to identify the minority (churn) class data. To overcome the issue, we introduce a model that uses sophisticated oversampling technique in conjunction with ensemble methods, namely Random Forest, Gradient Boost, Extreme Gradient Boost, and AdaBoost. The hyperparameters of the baseline ensemble methods and the oversampling methods were tuned in several ways to investigate their impact on prediction performances. Using a widely used publicly available customer churn dataset, prediction performance of the proposed model was evaluated in term of various metrics, namely, accuracy, precision, recall, F-1 score, AUC under ROC curve. Our model outperformed the existing models and significantly reduced both false positive and false negative prediction. © IEEE 2022.