Elsevier

Computer Communications

Volume 176, 1 August 2021, Pages 249-257
Computer Communications

ShuffleNet-inspired lightweight neural network design for automatic modulation classification methods in ubiquitous IoT cyber–physical systems

https://doi.org/10.1016/j.comcom.2021.05.005Get rights and content

Abstract

Automatic modulation classification (AMC) is one of the most important technologies of cognitive radios and ubiquitous internet of things (IoT) cyber–physical systems, and it can be adopted to recognize unknown signals. Recently, deep learning (DL) has been applied into AMC for the advanced classification performance. However, DL-based AMC methods generally have high computation complexity and large model sizes, which means that these methods can be rarely implemented into some IoT devices. In this paper, inspired by ShuffleNet, we design a lightweight convolutional neural network (CNN), which is named as ShuffleCNN, and a ShuffleCNN-based AMC (ShffuleAMC) method is proposed for the ubiquitous IoT cyber–physical systems with orthogonal frequency division multiplexing (OFDM). Besides, we also introduce fast Fourier transform (FFT) to pre-process the OFDM signals for the classification performance improvement, and apply 2 regularization to avoid overfitting. It is demonstrated by simulation results that our proposed ShuffleAMC method has little performance loss, when compared with the common CNN-based AMC methods. More importantly, our proposed ShuffleAMC method also has the strengths of low computation complexity and few model sizes.

Introduction

Advanced signal processing techniques are required to investigate to develop reliable and secure wireless communications and internet of things (IoT) [1], [2], [3], [4], [5]. Automatic modulation classification (AMC) is one of key technologies of ubiquitous internet of things (IoT) cyber–physical systems and cognitive radios [6], [7], [8], [9], which has been introduced into military and civilian scenarios for analyzing the characterizes of unknown signals. Most of traditional AMC methods are modeled as classification problems based on manmade features [10], [11], such as instantaneous feature (IF) [12], high order cumulants (HOC) [13] and cyclic spectrum-based feature [14], and the original classification algorithms rely on multiple thresholds, but with the development of machine learning (ML), ML-based classifiers [15], [16], such as decision tree (DT), random forest (RF), K-nearest neighbor (KNN) and support vector machine (SVM), are introduced into AMC for better classification performance. In recent years, deep learning (DL) has been applied to various communication technologies [17], [18], [19], [20], [21], [22], [23], [24], including AMC [25], [26], [27], [28], [29], because of its powerful feature extraction and advanced classification capability. DL-based AMC methods has been demonstrated to outperform the feature-based AMC methods, and it also has outstanding robustness in various communication scenarios [28], [29], [30], [31], [32], [33], [34].

However, the previously proposed AMC methods are based on DL models with huge model parameters and high complexity [25], [26], [27], [28], [29], and they must be supported by powerful graph process unit (GPU), which mean that it is difficult for these methods to apply into the practical ubiquitous IoT cyber–physical systems. Thus, we redesign a lightweight convolution neural network (CNN) model for AMC in the orthogonal frequency division multiplexing (OFDM) systems, which is inspired by the famous ShuffleNet [35]. In detail, fast Fourier transform (FFT) is first to pre-process the received OFDM signals. Then, we apply efficient convolution, channel shuffle and global pooling to reconstruct the CNN, which is named as ShuffleCNN. Finally, based on the pre-processed OFDM signals, the ShuffleCNN-based AMC method. i.e., ShuffleAMC, is trained with 2 regularization, which can avoid overfitting and slightly improving the classification performance. The contributions of this paper are listed as follows.

  • (1)

    The FFT is introduced as the pre-processing method to improve the classification performance at low signal-to-noise ratios (SNRs).

  • (2)

    We design and implement a lightweight CNN (LCNN)-based AMC methods, i.e., ShuffleAMC, and our proposed ShuffleAMC method has the similar classification performance with the original AMC method, and its main advances are the fewer model parameters and lower computation complexity, when compared with the original AMC method.

  • (3)

    The 2 regularization is introduced into the training processing, and the experimental results show that 2 regularization with the appropriate regularization factor can prevent overfitting, improve the classification performance and accelerate the training process.

Section snippets

Related works

In the related works, we introduce the classical ML-based AMC methods [36] with traditional features, the existed DL-based AMC methods, and the basic principle of lightweight neural network (LNN) design with its applications in communication systems, respectively.

Signal model in OFDM systems

The signal generation is shown in Fig. 1. In the transmitter, transmitted symbols X(k) are modulated and then these modulation signal inserted with pilots is converted into multiple signal streams by serial-to-parallel (S/P) conversion. Next, the inverse fast Fourier transform (IFFT) is applied to transform every signal stream from frequency domain into time domain. After IFFT, the cyclic prefix (CP) is inserted to every signal stream for mitigating inter symbol interface. Finally, multiple

CNN structure

In this paper, we adopt a CNN with two convolution layers and three fully-connected (FC) layers [26]. Convolution layers are followed by batch normalization (BN), rectified linear unit (ReLU) and dropout, while ReLU and dropout follow the former two FC layers and the last FC layer use Softmax as activation function (see Fig. 2).

Loss function with 2 regularization

AMC is modeled as a multi-class classification problem, and cross entropy (CE) is generally applied as its loss function. Suppose that the training samples and labels

Simulation parameters and implementation platform

The generation of IQ dataset and traditional features relies on Matlab, and DL and ML algorithms are based on Python with Tensorflow and Scikit-learn library. The simulations about DL are implemented by NVIDIA GeForce GTX1080Ti, and other simulation parameters are given in Table 1.

No FFT pre-processing vs. FFT pre-processing

Different from paper [26], the received OFDM signal is pre- processed by FFT before classification. Here, we give the simulation results of the CNN-based AMC method with FFT pre-processing or without FFT

Conclusion

In this paper, we proposed a ShuffleAMC method for the IoT cyber–physical systems, which is based on a lightweight model (named as ShuffleCNN), and our proposed ShuffleAMC has far fewer parameters and less complexity than the CNN-based AMC method, but there is almost no performance gap between the CNN-based AMC method and ShuffleAMC methods. Except this, we also introduced the FFT to pre-process the received OFDM signals for the classification performance improvement and the training

CRediT authorship contribution statement

Jie Yin: Conceived of the presented idea, Developed the algorithm part of shuffleNet-inspired lightweight neural network design for automatic modulation classification methods. Liang Guo: Developed the algorithm part of shuffleNet-inspired lightweight neural network design for automatic modulation classification methods. Wei Jiang: Conceived of the presented idea. Sheng Hong: Conducted the simulation experiments and verified the proposed methods. Jie Yang: Conducted the simulation experiments

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

All authors discussed the results and contributed to the final manuscript.

References (48)

  • HanX. et al.

    Congestion recognition for hybrid urban road systems via digraph convolutional network

    Transp. Res. C

    (2020)
  • S. Jiang, J. Cao, H. Wu, Y. Yang, Fairness-based packing of industrial IoT data in permissioned blockchains, IEEE...
  • SunJ. et al.

    Aviation data lake: using side information to enhance future air-ground vehicle networks

    IEEE Veh. Technol. Mag.

    (2021)
  • ZhaoY. et al.

    Visual analytics for electromagnetic situation awareness in radio monitoring and management

    IEEE Trans. Vis. Comput. Graphics

    (2020)
  • Y. Zhao, H. Jiang, Q. Chen, Y. Qin, H. Xie, Y. Wu, S. Liu, Z. Zhou, J. Xia, F. Zhou, Preserving minority structures in...
  • GuiG. et al.

    6G: Opening new horizons for integration of comfort, security and intelligence

    IEEE Wirel. Commun. Mag.

    (2020)
  • RamkumarB.

    Automatic modulation classification for cognitive radios using cyclic feature detection

    IEEE Circuits Syst. Mag.

    (2009)
  • Z. Guo, Y. Shen, A.K. Bashir, M. Imran, N. Kumar, D. Zhang, K. Yu, Robust spammer detection using collaborative neural...
  • W. Wang, X. Zhao, et al. An attention-based deep learning framework for trip destination prediction of sharing bike,...
  • X. Kong, S. Tong, H. Gao, G. Shen, K. Wang, M. Collotta, I. You, S. Das, Mobile edge cooperation optimization for...
  • WangY.

    Deep learning-based cooperative automatic modulation classification method for MIMO systems

    IEEE Trans. Veh. Technol.

    (2020)
  • A. Hazza, M. Shoaib, S.A. Alshebeili, A. Fahad, An overview of feature-based methods for digital modulation...
  • A. Hazza, M. Shoaib, S.A. Alshebeili, A. Fahad, An overview of feature-based methods for digital modulation...
  • XieW. et al.

    Deep learning in digital modulation recognition using high order cumulants

    IEEE Access

    (2019)
  • H. Zhao, Y. Zhou, B. Sun, L. Tian, J. Shi, Cyclic spectrum based intelligent modulation recognition with machine...
  • Abu-RomohM. et al.

    Automatic modulation classification using moments and likelihood maximization

    IEEE Commun. Lett.

    (2018)
  • AhmedN. et al.

    Internet of things (IoT) for smart precision agriculture and farming in rural areas

    IEEE Internet Things J.

    (2018)
  • YeH. et al.

    Deep learning-based end-to-end wireless communication systems with conditional GANs as unknown channels

    IEEE Trans. Wireless Commun.

    (2020)
  • LuoC. et al.

    Channel state information prediction for 5g wireless communications: A deep learning approach

    IEEE Trans. Netw. Sci. Eng.

    (2020)
  • LeeH. et al.

    Deep learning framework for wireless systems: Applications to optical wireless communications

    IEEE Commun. Mag.

    (2019)
  • HuangH. et al.

    Fast beamforming design via deep learning

    IEEE Trans. Veh. Technol.

    (2020)
  • WangT. et al.

    Deep learning for wireless physical layer: Opportunities and challenges

    China Commun.

    (2017)
  • Z. Guo, K. Yu, Y. Li, G. Srivastava, J. Lin, Deep learning-embedded social internet of things for ambiguity-aware...
  • ZhangJ. et al.

    3D reconstruction for motion blurred images using deep learning-based intelligent systems

    Comput. Mater. Contin.

    (2021)
  • Cited by (0)

    This work was supported by the Major Project of the Ministry of Industry and Information Technology of China under Grant TC190A3WZ-2, the Open Project Program of the State Key Lab of CAD&CG, China (A2102), Zhejiang University, the project of the Key Laboratory of Universal Wireless Communications (BUPT) of Ministry of Education of China under Grant KFKT-2020106.

    View full text