skip to main content

DDoS attack detection with feature engineering and machine learning: the framework and performance evaluation

Aamir, Muhammad ; Zaidi, Syed Mustafa Ali

International journal of information security, 2019-12, Vol.18 (6), p.761-785 [Periódico revisado por pares]

Berlin/Heidelberg: Springer Berlin Heidelberg

Texto completo disponível

Citações Citado por
  • Título:
    DDoS attack detection with feature engineering and machine learning: the framework and performance evaluation
  • Autor: Aamir, Muhammad ; Zaidi, Syed Mustafa Ali
  • Assuntos: Algorithms ; Artificial intelligence ; Coding and Information Theory ; Collinearity ; Communications Engineering ; Computer Communication Networks ; Computer Science ; Cryptology ; Datasets ; Denial of service attacks ; Engineering ; Engineering education ; Experimentation ; Machine learning ; Management of Computing and Information Systems ; Mathematical models ; Networks ; Operating Systems ; Performance evaluation ; Reduction ; Regular Contribution ; Support vector machines
  • É parte de: International journal of information security, 2019-12, Vol.18 (6), p.761-785
  • Descrição: This paper applies an organized flow of feature engineering and machine learning to detect distributed denial-of-service (DDoS) attacks. Feature engineering has a focus to obtain the datasets of different dimensions with significant features, using feature selection methods of backward elimination, chi2, and information gain scores. Different supervised machine learning models are applied on the feature-engineered datasets to demonstrate the adaptability of datasets for machine learning under optimal tuning of parameters within given sets of values. The results show that substantial feature reduction is possible to make DDoS detection faster and optimized with minimal performance hit. The paper proposes a strategic-level framework which incorporates the necessary elements of feature engineering and machine learning with a defined flow of experimentation. The models are also validated with cross-validation and evaluated for area-under-curve analyses. It provides comprehensive solutions which can be trusted to avoid the overfitting and collinearity problems of data while detecting DDoS attacks. In the case study of DDoS datasets, K-nearest neighbors algorithm overall exhibits the best performance followed by support vector machine, whereas low-dimensional datasets of discrete feature types perform better under the Random Forest model as compared to high dimensions with numerical features. The accuracy scores of dataset with the lowest number of features remain competitive with other datasets under all machine learning models, leading to a substantially reduced processing overhead. The experiments show that approximately 68% reduction in the feature space is possible with an impact of only about 0.03% on accuracy.
  • Editor: Berlin/Heidelberg: Springer Berlin Heidelberg
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.