论文标题
IGRF-RFE:用于基于MLP的网络入侵检测的混合功能选择方法在UNSW-NB15数据集上
IGRF-RFE: A Hybrid Feature Selection Method for MLP-based Network Intrusion Detection on UNSW-NB15 Dataset
论文作者
论文摘要
机器学习模型的有效性受数据集大小的显着影响,并且功能的质量是冗余和无关的功能可以从根本上降低性能。本文提出了IGRF-RFE:一种使用多层perceptron(MLP)网络的多级网络异常的混合功能选择方法。 IGRF-RFE可以视为基于滤波器特征选择方法和包装器特征选择方法的功能还原技术。在我们提出的方法中,我们使用滤波器特征选择方法,即信息增益和随机森林重要性的组合来减少特征子集搜索空间。然后,我们将递归功能消除(RFE)作为包装特征选择方法,以进一步消除还原的特征子集上的冗余特征。我们基于UNSW-NB15数据集获得的实验结果证实,我们提出的方法可以提高异常检测的准确性,同时降低特征维度。结果表明,该特征维度从42降低到23,而MLP的多分类精度从82.25%提高到84.24%。
The effectiveness of machine learning models is significantly affected by the size of the dataset and the quality of features as redundant and irrelevant features can radically degrade the performance. This paper proposes IGRF-RFE: a hybrid feature selection method tasked for multi-class network anomalies using a Multilayer perceptron (MLP) network. IGRF-RFE can be considered as a feature reduction technique based on both the filter feature selection method and the wrapper feature selection method. In our proposed method, we use the filter feature selection method, which is the combination of Information Gain and Random Forest Importance, to reduce the feature subset search space. Then, we apply recursive feature elimination(RFE) as a wrapper feature selection method to further eliminate redundant features recursively on the reduced feature subsets. Our experimental results obtained based on the UNSW-NB15 dataset confirm that our proposed method can improve the accuracy of anomaly detection while reducing the feature dimension. The results show that the feature dimension is reduced from 42 to 23 while the multi-classification accuracy of MLP is improved from 82.25% to 84.24%.