Please wait a minute...
  2021, Vol. 2 Issue (1): 66-82    doi: 10.23919/ICN.2020.0023
Networks     
CNN and MLP neural network ensembles for packet classification and adversary defense
Bruce Hartpence*(),Andres Kwasinski()
GCCIS i-School at the Rochester Institute of Technology, Rochester, New York, NY 14623, USA
Department of Computer Engineering, KGCOE at the Rochester Institute of Technology, Rochester, New York, NY 14623, USA
Download: PDF (12614 KB)      HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

Machine learning techniques such as artificial neural networks are seeing increased use in the examination of communication network research questions. Central to many of these research questions is the need to classify packets and improve visibility. Multi-Layer Perceptron (MLP) neural networks and Convolutional Neural Networks (CNNs) have been used to successfully identify individual packets. However, some datasets create instability in neural network models. Machine learning can also be subject to data injection and misclassification problems. In addition, when attempting to address complex communication network challenges, extremely high classification accuracy is required. Neural network ensembles can work towards minimizing or even eliminating some of these problems by comparing results from multiple models. After ensembles tuning, training time can be reduced, and a viable and effective architecture can be obtained. Because of their effectiveness, ensembles can be utilized to defend against data poisoning attacks attempting to create classification errors. In this work, ensemble tuning and several voting strategies are explored that consistently result in classification accuracy above 99%. In addition, ensembles are shown to be effective against these types of attack by maintaining accuracy above 98%.



Key wordsConvolutional Neural Network (CNN)      Multi-Layer Perception (MLP)      ensemble      classification      adversary     
Received: 03 September 2020      Online: 19 August 2021
Corresponding Authors: Bruce Hartpence     E-mail: bhhics@rit.edu;axkeec@rit.edu
About author: Bruce Hartpence received the PhD degree in computing and information sciences from the Rochester Institute of Technology in 2020, and the MS degree in information technology from the same institute from in 1998. He is currently a professor for the GCCIS i-School at the Rochester Institute of Technology (RIT) where he teaches networking and communication. For the last several years, he has explored the application of neural networks to communication challenges. He has also authored several books and video series for O’Reilly publishing including the Packet Guide series. He is also a co-developer of the IEEE 1910.1 Meshed Tree standard for the IEEE. His current areas of research include neural network ensembles applied wired and wireless, real time communication, and intelligent networking.|Andres Kwasinski received the diploma degree in electrical engineering from the Buenos Aires Institute of Technology, Buenos Aires, Argentina, in 1992 and the MS and PhD degrees in electrical and computer engineering from the University of Maryland, College Park, MD, USA, in 2000 and 2004, respectively. He is currently a professor at the Department of Computer Engineering, Rochester Institute of Technology, Rochester, NY, USA. He has co-authored over 90 publications in peer-reviewed journals and international conferences. He has also co-authored the books Cooperative Communications and Networking (Cambridge University Press, 2009) and 3D Visual Communications (Wiley, 2013). His current areas of research include cognitive radios and wireless networks, cross-layer techniques in wireless communications, and signal processing applied to smart infrastructures. He is also the chief editor of the Signal Processing Repository (SigPort) and an area editor for Special Initiatives of the IEEE Signal Processing Magazine.
Cite this article:

Bruce Hartpence,Andres Kwasinski. CNN and MLP neural network ensembles for packet classification and adversary defense. , 2021, 2: 66-82.

URL:

http://icn.tsinghuajournals.com/10.23919/ICN.2020.0023     OR     http://icn.tsinghuajournals.com/Y2021/V2/I1/66

Fig. 1 Architecture overview.
GeneralTCPUDP
EmptyPort 443DNS
STPPort 80DHCP
CDPPort 8080SSDP
Loopback-NBNS
ARP-RTP 1
ICMP Echo Req-RTP 2
ICMP Echo Reply-RTP 3
IGMP-RTP 4
UDP--
TCP--
Table 1 Breakdown of the class structure.
Fig. 2 Raw packet vs. Cleaned packets.
Fig. 3 Packet image.
Fig. 4 Base structure of convolutional neural network.
Fig. 5 MLP neural network.
Fig. 6 General stage.
Fig. 7 UDP stage.
InputLayersHidden nodeBatch sizeOptimizer
128220128Adam
14423064Adam
156220128Adam
Table 2 MLP model configuration.
InputFilter 1Filter 2Fully-connected layersOptimizer
19682080-40-20AdaDelta
196122496-48-24AdaDelta
19682080-40-20Adam
Table 3 CNN model configuration.
DataCNN 1CNN 2MLP 1MLP 2Ensemble
00.99490.99761.00001.00000.9984
10.96100.97590.97980.99830.9734
20.98620.98980.98920.99590.9913
30.98380.98350.97820.99760.9974
40.98820.98650.99230.99830.9956
50.97440.96890.99410.98960.9687
60.94590.97110.98940.99830.9846
Ave.0.97630.98190.98900.99690.9871
Table 4 Early model comparison.
DataCNN 1CNN 2MLP 1MLP 2Ensemble
00.99820.99771.00001.00001.0000
10.98570.9740.98830.99110.9926
20.99230.98670.99150.99260.9931
30.96850.98180.98220.99110.9911
40.97720.99010.99420.99580.9953
50.99290.97280.99310.99250.9930
60.99090.96840.99120.99340.9945
Ave.0.98650.98160.99150.99380.9942
Table 5 Elimination model comparison.
DataCNN 1CNN 2MLP 1MLP 2Ensemble
00.99980.99991.00001.00001.0000
10.97980.97740.99830.99890.9989
20.98940.9940.99380.98960.9948
30.98010.96080.98780.99660.9981
40.99080.99410.99250.99580.9963
50.99400.98580.99220.99260.9933
60.97030.97100.99090.99830.9909
Ave.0.98630.98330.99360.99600.9960
Table 6 Elimination model—1000 iterations.
DataCNN 1CNN 2MLP 1MLP 2Ensemble
00.99980.99991.00001.00001.0000
10.98640.99110.99890.99810.9994
20.99730.99750.99540.99660.9969
30.97490.99720.9950.99420.9974
40.99590.99510.99580.99630.9974
50.99160.99910.99140.99590.9935
60.98060.99790.99820.99830.9995
Ave.0.98950.99680.99640.99710.9977
Table 7 Modified models—1000 iterations.
DataCNNMLPEnsemble
123123
00.99890.99980.99931.00001.00001.00001.0000
10.98000.99000.98870.99180.98860.99780.9913
20.98480.99710.99330.99550.99520.99520.9965
30.99150.98710.96970.99250.99750.99810.9969
40.99290.99290.99530.99880.99880.99860.9993
50.99350.99670.99350.99490.99520.99670.9959
60.95870.99670.98520.9980.99690.99580.9991
Ave.0.98580.99430.98930.99590.99600.99750.9970
Table 8 Adding a model—500 iterations.
DatasetCNNMLPEnsemble
123123
00.99990.99990.99991.00001.00001.00001.0000
10.98580.98540.99540.99710.99920.99810.9991
20.99600.99680.99630.99180.9990.99460.9974
30.99920.99920.99780.9990.99980.99710.9998
40.99240.99550.99620.99890.99860.99830.9989
50.99380.99430.99670.99470.99920.99490.9976
60.98010.98030.98170.99690.9980.99720.9990
71.00001.00001.00001.00001.00001.00001.0000
81.00001.00001.00001.00001.00001.00001.0000
Ave.0.99410.99460.99600.99760.99930.99780.9991
Iteration8711231419-
Table 9 Final ensemble.
DatasetCNNMLPEnsemble
123123
01.00001.00001.00001.00001.00001.00001.0000
10.98720.95870.99000.92210.89450.88950.9890
20.99350.95810.98520.52470.54670.54430.9808
30.97580.99311.00000.72320.81310.78890.9931
40.9890.92170.98710.47420.50740.4190.9494
50.99850.97660.99680.88940.87470.87940.9895
60.99270.93500.98610.63500.49570.49040.9569
71.00001.00001.00001.00001.00001.00001.0000
81.00001.00001.00001.00001.00001.00001.0000
Ave.0.99300.97150.99390.79650.79250.77910.9843
Iteration224134652562-
Table 10 Final UDP ensemble.
DatasetCNNMLPEnsemble
123123
01.00001.00001.00001.00001.00001.00001.0000
10.75800.73210.82100.89720.90580.88930.9251
20.80330.69880.86920.84310.89540.89790.9218
30.90840.93270.94020.97180.98210.97640.9861
40.75100.51590.64030.86860.86250.64030.8695
50.30510.93220.94921.00000.77971.00001.0000
60.58110.63490.57690.91010.93200.83790.9041
7-------
8-------
Ave.0.72960.77810.82810.92730.90820.89170.9438
Iteration221188353117-
Table 11 Final TCP ensemble.
StageCNNMLPEnsemble
123123
Gen0.99410.99460.99600.99760.99930.99780.9991
UDP0.99300.97150.99390.79650.79250.77910.9843
TCP0.72960.77810.82810.92730.90820.89170.9438
Ave.0.90560.91470.93930.90710.90000.88950.9757
Table 12 Overall performance.
StageCNNMLP
123123
General0.98580.98540.99540.99710.99920.9981
UDP0.98720.95870.99000.92210.89450.8895
TCP0.75800.73210.82100.89720.90580.8893
Table 13 Initial validation based weights.
StageCNNMLP
123123
General0.99000.97440.99180.99660.99760.9946
UDP0.98240.96910.97180.90710.88720.9008
TCP0.76470.81250.79710.89720.89380.8892
Table 14 Final validation based weights.
StageInitialCycle 1Cycle 2Cycle 3Final
General0.99910.99910.99790.99790.9987
UDP0.98430.94640.97850.95080.9384
TCP0.94380.92140.91720.93910.9262
Table 15 Validation weight performance.
StageCNNMLP
123123
General0.99120.99190.9940.99640.99890.9967
UDP0.98940.95720.99080.69470.68860.6685
TCP0.68440.74110.79940.91510.89290.8736
Table 16 Initial model average based weights.
StageCNNMLP
123123
General0.99350.99290.99360.99650.99700.9967
UDP0.95160.94780.96020.66930.68150.7156
TCP0.73960.75560.71620.91000.89590.8866
Table 17 Final model average based weights.
StageInitialCycle 1Cycle 2Cycle 3Final
General0.99910.99860.99910.99820.9979
UDP0.98430.93600.97220.99040.9717
TCP0.94380.92810.93310.92980.9246
Table 18 Ensemble average weight performance.
StageCNNMLP
123123
General0.99870.91910.99320.99740.99570.9976
UDP0.93470.98770.96640.71750.83110.8729
TCP0.78630.80160.77260.92150.90980.8757
Table 19 Actual model Ave Wt performance.
LoopSTPCDP
290714 534484
Table 20 Dataset 5—Layer 2.
IGMPDNSDHCPSSDPNBNS
235686867311127
Table 21 Dataset 5—Layers 3 and 4.
TestIGMPDNSNBNSTCPGeneral accuracyUDP accuracy
DS 5235673890670.99720.9843
zero’d23588009120.99980.9843
9’s235678088270.99890.9849
ARP235773189750.99690.9845
TCP2356579682480.98980.9893
Table 22 Dataset 5—Poisoned.
[1]   Hartpence B. and Kwasinski A., Fast internet packet and flow classification based on artificial neural networks, in Proc. 2019 SoutheastCon, Huntsville, AL, USA, 2019, p. 19433953.
[2]   Hartpence B. and Kwasinski A., A convolutional neural network approach to improving network visibility, in Proc. 2020 29th Wireless and Optical Communications Conf. (WOCC), Newark, NJ, USA, 2020, pp. 1-6.
[3]   Sharkey A. J. C., Sharkey N. E., Gerecke U., and Chandroth G. O., The “test and select” approach to ensemble combination, in Proc. 1st Int. Workshop on Multiple Classifier Systems, Cagliari, Italy, 2000, pp. 30-44.
[4]   Dietterich T. G., Ensemble methods in machine learning, in Proc. 1st Int. Workshop on Multiple Classifier Systems, Cagliari, Italy, 2000, pp. 1-15.
[5]   Yaman M. A., Subasi A., and Rattay F., Comparison of random subspace and voting ensemble machine learning methods for face recognition, Symmetry, vol. 10, no. 11, p. 651, 2018.
[6]   Knauer U., Von Rekowski C. S., Stecklina M., Krokotsch T., Minh T. P., Hauffe V., Kilias D., Ehrhardt I., Sagischewski H., Chmara S., et al., Tree species classification based on hybrid ensembles of a convolutional neural network (CNN) and random forest classifiers, Remote Sens., vol. 11, no. 23, p. 2788, 2019.
[7]   Tasci E., Voting combinations-based ensemble of fine-tuned convolutional neural networks for food image recognition, Multimed. Tools Appl., vol. 79, no. 41, pp. 30 397-30 418, 2020.
[8]   Livieris I. E., Kanavos A., Tampakas V., and Pintelas P., A weighted voting ensemble self-labeled algorithm for the detection of lung abnormalities from x-rays, Algorithms, vol. 12, no. 3, p. 64, 2019.
[9]   Strauss T., Hanselmann M., Junginger A., and Ulmer H., Ensemble methods as a defense to adversarial perturbations against deep neural networks, arXiv preprint arXiv: 1709.03423, 2017.
[10]   Wei W. Q., Liu L., Loper M., Chow K. H., Gursoy E., Truex S., and Wu Y. Z., Cross-layer strategic ensemble defense against adversarial examples, in Proc. 2020 Int. Conf. Computing, Networking and Communications (ICNC), Big Island, HI, USA, 2020, pp. 456-460.
[11]   He K. M., Zhang X. Y., Ren S. Q., and Sun J., Deep residual learning for image recognition, in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770-778.
[12]   Simonyan K. and Zisserman A., Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv: 1409.1556, 2014.
[13]   Pan B., Shi Z. W., and Xu X., Hierarchical guidance filtering-based ensemble classification for hyperspectral images, IEEE Trans. Geosci. Remote Sens., vol. 55, no. 7, pp. 4177-4189, 2017.
[14]   Gunderson S. and Jagodzinski F., Ensemble voting schemes that improve machine learning models for predicting the effects of protein mutations, in Proc. 2018 ACM Int. Conf. Bioinformatics, Computational Biology, and Health Informatics, New York, NY, USA, 2018, pp. 211-219.
[15]   Sin E. and Wang L. P., Bitcoin price prediction using ensembles of neural networks, in Proc. 2017 13th Int. Conf. Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNCFSKD), Guilin, China, 2017, pp. 666-671.
[16]   Xu L. Y., Zhou X., Ren Y. M., and Qin Y. F., A traffic classification method based on packet transport layer payload by ensemble learning, in Proc. 2019 IEEE Sympo. Computers and Communications (ISCC), Barcelona, Spain, 2019, pp. 1-6.
[17]   Gómez S. E., Hernández-Callejo L., Martínez B. C., and Sánchez-Esguevillas A. J., Exploratory study on class imbalance and solutions for network traffic classification, Neurocomputing, vol. 343, pp. 100-119, 2019.
[18]   Gómez S. E., Martínez B. C., Sánchez-Esguevillas A. J., and Callejo L. H., Ensemble network traffic classification: Algorithm comparison and novel ensemble scheme proposal, Comput. Networks, vol. 127, pp. 68-80, 2017.
[19]   Gargiulo F., Kuncheva L. I., and Sansone C., Network protocol verification by a classifier selection ensemble, in Proc. 8th Int. Workshop on Multiple Classifier Systems, Reykjavik, Iceland, 2009, pp. 314-323.
[20]   CAIDA, .
[21]   Szegedy C., Vanhoucke V., Ioffe S., Shlens J., and Wojna Z., Rethinking the inception architecture for computer vision, in Proc. 2016 IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 2818-2826.
[22]   Tramèr F., Kurakin A., Papernot N., Goodfellow I., Boneh D., and McDaniel P., Ensemble adversarial training: Attacks and defenses, arXiv preprint arXiv: 1705.07204, 2017.
[23]   Nasr M., Bahramali A., and Houmansadr A., Blind adversarial network perturbations, arXiv preprint arXiv: 2002.06495, 2020.
No related articles found!