Abstract
Signal detection scheme is the key technology to the implementation of multiple-input multiple-output (MIMO) wireless communication system, while the spatial-multiplexing coded MIMO systems cause a severe design challenge for signal detection algorithms. Although many researches focus on searching the solution space for optimal solution based on more efficient searching algorithm, the signal detection of MIMO system does not regarded as a classification problem. In this paper, the detection problem is considered as a feature classification, and a novel signal detection scheme of MIMO system based on extreme learning machine auto encoder (ELM-AE) is proposed. The proposed algorithm can efficiently extract the features of input data by ELM-AE and classify these representations to corresponding groups rapidly by using extreme learning machine (ELM). This paper has constructed a theoretical model of the proposed signal detector for MIMO system and carried out simulations to evaluating its performance. Simulation results indicate that the proposed detector outperforms many traditional schemes and state-of-the-art algorithms.
Similar content being viewed by others
Keywords
1 Introduction
Multiple-input multiple-output (MIMO) technology has been one of the most significant techniques of wireless communication system. The attractions of MIMO are the high performance gain and channel capacity which linear increase with the number of transmitter and receiver antennas due to the multipath parallel data transmission in the same frequency band [1]. The key to the implementation of MIMO system is the signal detection algorithm. Although many reliable signal detection methods for MIMO system have been proposed, it is still urgent for researchers to seek more efficient signal detection schemes to improve the current systems’ performances and adapt to the applications of massive MIMO and super massive MIMO systems in the future. Therefore, it is a hot topic of MIMO system that how to search the better signal detection algorithms balancing the relationship of complexity and performance of detection scheme.
There are many signal detection algorithms for MIMO system available, and maximum likelihood (ML) detector is the optimal detector which can minimize the bit-error-rate (BER) of detection but it has the highest computational complexity at the same time [2]. Zero-forcing (ZF) and minimum-mean-square-error (MMSE) algorithms are typical linear detection schemes with much lower complexities but less BER performances than ML [3]. Zero-forcing successive-interference-cancellation (ZF-SIC) [4] and minimum-mean-squared-error successive-interference-cancellation (MMSE-SIC) [5] are improved algorithms of ZF and MMSE respectively based on SIC method, they utilize the cancellation of the former detected symbol’s information to decrease BER. Radial basis function network optimized by quantum genetic algorithm (QGA-RBF) [6] and quantum ant colony algorithm (QACA) [7] are intelligent optimization algorithm that can shrink the searching region where the optimal solution exists, but the computational complexity will increase concurrently with respect to the population size of QGA. Though there are many signal detection algorithms, but few of them have applied machine learning methods to signal detection of MIMO system, and none of them regards the problem as pattern recognition or feature classification problem, while this kind of issue transformation might be necessary in the process of high dimensional received data of massive MIMO systems in the future.
Deep learning and machine learning algorithms have been successfully applied in feature extraction and classification tasks in decades, and extreme learning machine algorithm stands out from others in virtue of its fast training speed, good generalization and universal approximation capability [8, 9]. In this paper, a novel signal detection algorithm based on extreme learning machine auto encoder (ELM-AE) [10] has been proposed. The proposed algorithm consists of two parts; the feature extractor based on ELM-AE can obtain useful feature representations of input samples by separating the channel state information via the projection of the learnt connection weights parameters from the unsupervised learning of the ELM-AE, the trained ELM classifier is used to recognize the corresponding original symbols of the features. Due to the fast learning speed and high classification accuracy of ELM, simulation results indicate that the proposed algorithm is more efficient than ML, QGA-RBF and the performance outperforms ZF, MMSE, and ZF-SIC, MMSE-SIC algorithms.
The rest of this paper is organized as follows. Section 2 introduces the MIMO system model, and ELM, ELM-AE algorithms. The proposed signal detection algorithm is described in Sect. 3. Section 4 presents the simulation results. Finally, conclusions are drawn in Sect. 5.
2 MIMO System, ELM and ELM Auto-Encoder
2.1 MIMO System
This paper investigates a point-to-point MIMO system with N transmitter antennas and M receiver antennas, where N ≤ M. The structure of MIMO wireless system is presented as in Fig. 1.
As shown in Fig. 1, the input-output relationship of MIMO system can be expressed in vector form as follows:
where \( \varvec{y}{\mathbf{ \;=\; }}\left[ {\varvec{y}_{{\mathbf{1}}} {\mathbf{,}}\varvec{y}_{{\mathbf{2}}} {\mathbf{,}}\,{\mathbf{ \ldots ,}}\varvec{y}_{\varvec{M}} } \right]^{{\mathbf{T}}} \) is received signal vector, the corresponding transmitted vector \( \varvec{x}{\mathbf{ \;=\; }}\left[ {\varvec{x}_{{\mathbf{1}}} {\mathbf{,}}\varvec{x}_{{\mathbf{2}}} {\mathbf{,}}\,{\mathbf{ \ldots ,}}\varvec{x}_{\varvec{N}} } \right]^{{\mathbf{T}}} \), H s denotes a M × N channel state matrix and n is the additive white Gaussian noise with zero mean and variance of σ 2 [11].
In MIMO system the received symbol on every receiving antenna contains all information from transmitted symbols [12], as presented in Eq. (2).
where h k,i denotes the channel gain from the i th transmitting antenna to the k th receiving antenna, n k represents the white Gaussian additive noise on k th receiving antenna.
Signal detection of MIMO system is to obtain the optimal solution ẍ which has the minimum difference compares to the source signal x from the received symbols y.
The maximum likelihood (ML) detection is an optimal detection algorithm; it searches the entire solution space Ф to find the optimal solution ẍ which can minimize the cost function
On account of the high complexity, suboptimal detection scheme is required. Thus linear detection methods (ZF and MMSE) and nonlinear detection methods (QGA-RBF, QACA etc.) are designed based on optimal detection theory [7].
2.2 Original Extreme Learning Machine
In order to solve the single hidden layer neural network (SLFNs), Huang [8, 13] has proposed a novel fast learning algorithm called extreme learning machine (ELM). The original ELM is shown in Fig. 2.
The first layer’s parameters of ELM are randomly generated and does not need to be fine turned and the output weights are obtained by Eq. (8). Huang has proved that ELM has the same solution formula in binary classification case, multi-classification case and regression case [14], thus ELM has the generic form in the application of classification and regression as presents in Fig. 2. Original ELM model is described as follows:
-
(1)
Define parameters of ELM. Suppose { x k , t k | x k ∈ R d , t k ∈ R m , k = 1, …, N } is the training set, where x k is the k th training vector, t k represents the k th target output (label) of corresponding training sample, and d, m denote the dimension of training samples and labels respectively, N is the number of training samples. w ij is defined as the random connection weight between i th input neuron and j th hidden neuron, b j is the bias of j th hidden neuron, they are randomly generated based on Gaussian distribution, and g is a activation function of hidden layer, typically is given by users.
-
(2)
Calculate the output feature representation matrix H of hidden layer
$$ H = \left[ {\begin{array}{*{20}c} {h_{1} \left( x \right)} \\ \vdots \\ {h_{L} \left( x \right)} \\ \end{array} } \right]^{T} = \left[ {\begin{array}{*{20}c} {h_{1} \left( {x_{1} } \right)} & \cdots & {h_{1} \left( {x_{N} } \right)} \\ \vdots & \ddots & \vdots \\ {h_{L} \left( {x_{1} } \right)} & \cdots & {h_{L} \left( {x_{N} } \right)} \\ \end{array} } \right]. $$(4)$$ {\text{Where}}\,\,\,\,h_{j} \left( {x_{n} } \right) = g\left( {\sum\limits_{i} {x_{n} \left( i \right) \cdot w_{i,j} } + b_{j} } \right). $$(5)and (i = 1, …, d, j = 1, …, L, L is the number of hidden neurons) [15].
-
(3)
Calculate the output weights. The target of our training is to obtain a weight matrix β which satisfies the equation:
$$ T = \beta \cdot \, H $$(6)where
$$ T = \left[ {\begin{array}{*{20}c} {t_{1} } \\ \vdots \\ {t_{N} } \\ \end{array} } \right]^{T} = \left[ {\begin{array}{*{20}c} {t_{11} } & \cdots & {t_{1N} } \\ \vdots & \ddots & \vdots \\ {t_{m1} } & \cdots & {t_{mN} } \\ \end{array} } \right] $$(7)is the target matrix (labels). Then we can obtain the output weight matrix
$$ \beta = T\cdot{\text{ H}}^{\dag } . $$(8)where T = [t 1 , …, t N ], H † is the Moore-Penrose generalized inverse of matrix H. Typically we can calculate the MP inverse efficiently with the orthogonal projection method [15]: H † = (H T H)−1 H T, for nonsingular case of H T H; if H T H is singular, H † = H T (HH T)−1. According to [14], if a positive value C is added to the diagonal of H T H or HH T, the solution could be more stable and has better generalization performance based on the ridge regression theory [16]. Thus the modified β is
$$ \beta = T \cdot H^{T} \left( {\frac{I}{C} + HH^{T} } \right)^{ - 1} . $$(9)$$ {\text{or}}\,\,\,\beta = T \cdot \left( {\frac{I}{C} + H^{T} H} \right)^{ - 1} H^{T} . $$(10) -
(4)
Trained ELM for classification and regression etc. The trained output weights β and random connection weights w and biases b is our target parameters for application. If testing data set is \( \{ S_{\text{i}} \,|\,S_{\text{i}} \in R^{d} ,i = 1, \ldots ,N_{s} \} \), then the corresponding output of ELM is as follows while H T H is nonsingular.
$$ f\left( S \right) = h\left( S \right) \cdot \beta = g\left( {w \cdot S + b} \right) \cdot TH^{T} \left( {\frac{I}{C} + HH^{T} } \right)^{ - 1} . $$(11)
2.3 ELM Auto-Encoder
Auto encoder (AE) is a representative unsupervised deep learning method, typically AE is used for feature extraction from unlabeled input data [17], and it can reduce the redundancies of input data. In addition a multilayer or a deep hierarchy’s structure can be built by stacking AEs on top of each other [18].
The ELM auto-encoder (ELM-AE) is a kind of auto encoder which is established based on the random projection and fast learning speed of ELM, it could be seen as a special case of ELM where the input is the target output meanwhile [10], the ELM-AE is consist of three layers as shown in Fig. 3.
The working process of ELM-AE is as the same as ELM as show in Fig. 2. There are connection weights w, a ′ between adjacent layers and bias b in hidden layer.
The input weights w and bias b of ELM-AE are randomly generated as the same as ELM in this paper, thus as show in Fig. 3 the input data x is mapped to L-dimensional ELM random feature space first, then transformed to a more stable and generalized feature space by activation function g. The output weights a ′ form a more stable and generalized projection of the input data than w via the unsupervised learning of ELM, therefore (a ′)T is used as the input weights instead of (w)T in ELM-AE as show in Fig. 4.
3 ELM Auto-Encoder for MIMO Signal Detection
The signal detection problem of MIMO system is regarded as a classification or pattern recognition problem in this paper, thus it is reasonable to resolve the problem with machine learning algorithms. In this paper, a novel detector for MIMO system based on unsupervised feature learning and classification via ELM based auto-encoder and ELM classifier is proposed, the schematic diagram is presented in Fig. 4. The detection algorithms is designed as follows:
-
(1)
Train ELM based auto encoder. ELM based auto-encoder is a special case of ELM, thus the training processing of ELM-AE is the same to ELM: First the input weights and biases of hidden layer [W, b] are randomly initialized based on Gaussian distribution, then the codes and the weights which are utilized to reconstruct input data can be obtained:
$$ {\text{Random projection code}}: \boldsymbol{h} = g\left( \boldsymbol{W} \cdot \boldsymbol{x} + \boldsymbol{b} \right). $$(12)$$ {\text{The output weights}}:\, a' = \boldsymbol{x \cdot h^{T}} \left( {{\mathbf{I}}/{\text{C }} + \boldsymbol{h \cdot h}^{T} } \right)^{ - 1} . $$(13) -
(2)
Feature projection. The output weight matrix a ′ of ELM-AE is served as the output of ELM-AE, and its transposed matrix a is set as the input weight of feature projection layer of the proposed detector:
$$ \boldsymbol{a} = \, \left( {\boldsymbol{a'}} \right)^{T} . $$(14)The output H of feature layer are the feature representations of input data, they are more stable and generalized than the random projection code of ELM-AE:
$$ \boldsymbol{H} = g\left( {\boldsymbol{a}} \cdot \boldsymbol{x} + \boldsymbol{b} \right). $$(15) -
(3)
Train ELM classifier. The next step as the dotted arrow shown in Fig. 4 is to train the original ELM classifier after the representations of input data x are obtained. The training sample of the ELM is H. The training label of H is the same to x, it’s the target output of our detector. The class label of x could be the corresponding transmitted signal of the received signals x of MIMO system or the class number (corresponds to the transmitted signal) of x, in this paper the class label of x is the later one.
-
(4)
Testing the detector. Feature extractor and classifier are obtained from the training above. Then the proposed detector is ready to extract the features of input samples and classify the features to corresponding classes which can reconstruct the information of the transmitted symbols.
4 Simulation and Performance Evaluation
Computer simulations are conducted to investigate the performance of the proposed ELM-AE detector. Simulations are based on a simplified 4 × 4 point to point BPSK modulated MIMO system. Suppose the channel state H is known.
4.1 Parameter Selection
In this section, some simulations have been conducted to search for the best parameters of the proposed detector.
Figure 5(a) shows the testing result of positive value C and the number of hidden neurons L. In this simulation C and L are set as {10−9, 10−8, …, 100, …, 107, 108, 109}, {10, 20, …, 440, 450} respectively. The ‘TestingRate’ represents the difference value of bit error rate (BER) of ML detector and the proposed detector. The results indicate that the BER performance could be better when C ranges form 10−3 to 101 and L is more than 100, in the next simulations L is set as 120, and C is set as 1/snr in this paper where snr represents signal to noise ratio.
Figure 5(b) shows the BER performance of different number of training data. Based on the results the training samples is set as 2400 × 4 in the next experiments under the premise that the performance is guaranteed.
This paper selects and uses tanh as the activation function of ELM and ELM-AE, as show in Fig. 5(c), arc-tan function tanh outperforms sine function sin, hard limit function hardlim, and sigmoid function sig, while the hidden layer number is 1, hidden neurons L = 120, the positive value C = 1/snr.
4.2 Comparisons Between ELM-AE Detector and Other Detection Methods
In order to verify the good performance of the proposed ELM-AE detector, this section carried out the traditional detection algorithms ML, ZF, MMSE in Fig. 6(a), and several state-of-the-art detection algorithms such as ZF-SIC, MMSE-SIC, QGA-RBF for comparison in Fig. 6(b).
Figure 6 and Table 1 illustrate the bit error rate, the mean detection time and the mean error rates compares to ML detector of these algorithms. It is event that the proposed detector outperforms the detector based on ZF, MMSE, and reaches a similar performance to the optimal detector ML from Fig. 6(a). Figure 6(b) indicates that the performance of the proposed detector outperforms ZF-SIC, MMSE-SIC, and exceeds QGA-RBF detection when SNR is more than 9 dB. The detection time of the proposed algorithm is 0.9402 s which is 20-times the detection speed of ML while detecting the symbols one by one, but the proposed algorithm supports the way of block detection and detection efficiency increase 4.4 times.
In addition, channel estimation is not required in the proposed algorithm while the channel sate is contained in the output connection weights learnt from the unsupervised learning of ELM-AE.
5 Conclusion
In this paper, the Extreme Learning Machine based auto encoder is applied to MIMO system for signal detection. The signal detection scheme for MIMO system based on ELM-AE is proposed, channel estimation is embedded in learnt connection weights of ELM-AE, and the simulations of detection performance are made and analyzed, the simulation results show that the proposed scheme not only has better performance than many linear and nonlinear detection schemes, but also reaches much lower complexity than these methods, and its performance is very close to optimal detection algorithm. Massive MIMO system is a wide concerned MIMO system, its channel capacity is much higher than normal MIMO system, while the detection complexity is much higher at the same time, which is a big obstacle for its application. In the future works we will going to apply this detection scheme to massive MIMO system, and searching the possibility of reducing the detection complexity.
References
Telatar, E.: Capacity of multi-antenna Gaussian channels. Trans. Emerg. Telecommun. Technol. 10(6), 585–595 (1999)
Zhu, X., Murch, R.D.: Performance analysis of maximum likelihood detection in a MIMO antenna system. IEEE Trans. Commun. 50(2), 187–191 (2002)
Liu, D.N., Fitz, M.P.: Low complexity affine MMSE detector for iterative detection-decoding MIMO OFDM systems. IEEE Trans. Commun. 56(1), 150–158 (2008)
Xu, J., Tao, X., Zhang, P.: Analytical SER performance bound of M-QAM MIMO system with ZF-SIC receiver. In: IEEE International Conference on Communications, pp. 5103–5107 (2008)
Sarkar, S.: An advanced detection technique in MIMO-PSK wireless communication systems using MMSE-SIC detection over a Rayleigh fading channel. CSI Trans. ICT 3(10), 1–7 (2016)
Li, F., Zhou, M., Li, H.: A novel neural network optimized by quantum genetic algorithm for signal detection in MIMO-OFDM systems. In: IEEE Symposium on Computational Intelligence in Control & Automation, CICA, pp. 170–177. IEEE (2011)
Yang, Y., Hu, F., Jiang, Z.: Signal detection of MIMO system based on quantum ant colony algorithm. In: IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC, pp. 1–5. IEEE (2016)
Huang, G.B., Li, M.B., Chen, L., Siew, C.K.: Incremental extreme learning machine with fully complex hidden nodes. Neurocomputing 71(4–6), 576–583 (2008)
Huang, G.B.: An insight into extreme learning machines: random neurons, random features and kernels. Cogn. Comput. 6(3), 376–390 (2014)
Kasun, L.L.C., Zhou, H., Huang, G.B., Chi, M.V.: Representational learning with ELMs for Big Data. IEEE Intell. Syst. 28(6), 31–34 (2013)
Liu, L., Lofgren, J., Nilsson, P.: Low-complexity likelihood information generation for spatial-multiplexing MIMO signal detection. IEEE Trans. Veh. Technol. 61(2), 607–617 (2012)
Rusek, F., Persson, D., Lau, B.K., Larsson, E.G., Marzetta, T.L., Edfors, O., Tufvesson, F.: Scaling up MIMO: opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 30(1), 40–60 (2012)
Huang, G.B., Chen, L., Siew, C.K.: Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans. Neural Networks 17(4), 879–892 (2006)
Huang, G.B., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B 42(2), 513–529 (2012)
Tang, J., Deng, C., Huang, G.B.: Extreme learning machine for multilayer perceptron. IEEE Trans. Neural Netw. Learn. Syst. 27(4), 809 (2016)
Hoerl, A.E., Kennard, R.W.: Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12(1), 55–67 (1970)
Fei, W., Ye, X., Sun, Z., Huang, Y., Zhang, X., Shang, S.: Research on speech emotion recognition based on deep auto-encoder. In: IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems, CYBER (2016)
Masci, J., Meier, U., Cireşan, D., Schmidhuber, J.: Stacked convolutional auto-encoders for hierarchical feature extraction. In: Honkela, T., Duch, W., Girolami, M., Kaski, S. (eds.) ICANN 2011. LNCS, vol. 6791, pp. 52–59. Springer, Heidelberg (2011). doi:10.1007/978-3-642-21735-7_7
Acknowledgments
The authors would like to thank the anonymous reviewers for their constructive and insightful comments for further improving the quality of this work. The research work was partially supported by the National Natural Science Foundation of China under Grant (61263005, 61563009), New Century Talents Project of the Ministry of Education under Grant No. NCET-12-0657.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Long, F., Yan, X. (2017). ELM-Based Signal Detection Scheme of MIMO System Using Auto Encoder. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, ES. (eds) Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science(), vol 10639. Springer, Cham. https://doi.org/10.1007/978-3-319-70136-3_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-70136-3_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70135-6
Online ISBN: 978-3-319-70136-3
eBook Packages: Computer ScienceComputer Science (R0)