Keywords

1 Introduction

The development of future wireless communication systems, e.g., the fifth generation (5G) or beyond, continuously demands higher data rates and larger user capacities, which faces significant challenges. It requires not only wideband transceiver architecture, but also higher-order modulation schemes. The signals of these systems characterized by non-constant envelopes and high peak-to-average power ratio (PAPR), leading to stringent linearity requirements for signal amplification. In the meantime, the power dissipation of the future communication systems must be remained as low as possible [1]. To cope with these challenges, high efficiency and linear radio frequency (RF) power amplifiers (PAs) are indispensable components. Unfortunately, due to the inherent nonlinear behavior of PAs, efficiency and linearity requirements often conflict each other. In order to provide highly-efficient power conversion, PAs should be driven into the saturation region. However, the saturated PAs produce not only in-band distortion but also result in spectral regrowth that interferes the adjacent frequency band channels. Consequently, the spectra utilization efficiency is reduced. In contrast, the nonlinear distortion can be mitigated by a traditional back-off approach, but this generates low power efficiency due to high PAPR of the transmitted signals. In order to maintain a low level of distortion without sacrificing the system energy efficiency requirement, PA linearization techniques are often used [2]. Thanks to its flexibility and excellent linearization performance, baseband digital predistortion (DPD) has been recognized as one of the most cost-effective linearization techniques [3,4,5,6,7,8,9], and it also tends to be popularly and widely used in wireless transmitters for the next generation wireless communication systems. In this scheme, a predistorter (PD) block is placed in front of a PA. The PA input signal is pre-distorted by the PD whose transfer function is the inverse of that of the PA. Ideally, the cascade of the PD and PA behaves as a linear amplification system and the original input is amplified by a constant gain.

In practice, the PA characteristics change with time due to process, supply voltage, and temperature (PVT) variations. In order to track time-varying change in the PA characteristics, an adaptive DPD using cost-effective learning architectures has become one of the most preferred choices. There are two commonly and widely used learning architectures for PD parameter identification: indirect learning architecture (ILA) [10,11,12] and direct learning architecture (DLA) [8, 9, 13, 14]. Although DLA is more robust than ILA in terms of noise at the PA output and can provide unbiased parameter estimates, it is more complex identification process since the adaptive algorithms used in DLA require many iterations to find a set of parameters that minimizes the optimization criterion [3]. For these reasons, the adaptive ILA is most often used for identifying the PD parameters in RF PAs [3]. The adaptive ILA using least mean squares (LMS) for linearizing PAs was developed in [15]. The advantage of LMS is its simple implementation. However, it provides inaccurate estimation and has slow convergence since increasing the step size parameter leads instability problems. Moreover, it is also sensitive to the scaling of the input signal, making it very hard to choose a proper step size [15]. In order to obtain faster convergence of the adaptation, authors in [10, 12] proposed the adaptive ILA using recursive least squares (RLS). It is worth noting that the choice of forgetting factor \(\lambda \) is often essential to make a good trade-off between the convergence and accuracy. For RLS, a decrease in the forgetting factor \(\lambda \) leads to its sensitivity to noise and a larger fluctuation of parameter estimates [16], resulting in inefficiency linearization performance.

In this paper, we propose an adaptive ILA using recursive prediction error minimization (RPEM) algorithm to linearize PAs, which allows time-varying forgetting factor \(\lambda \). Thus, the RPEM algorithm reduces the fluctuation of the PD parameter estimates, speeds up the convergence, mitigates the steady-state mean square error and hence minimizes the total nonlinear distortion at the PA output. As a result, the adaptive ILA with RPEM effectively compensate the nonlinear distortion of the PA even if the PA characteristics changes due to PVT drift and other factors such as type of signals, high-order modulation schemes, input power levels etc. The rest of the paper is organized as follows. Section 2 proposes the one using RPEM. Simulation results are presented in Sect. 3. Conclusions are finally drawn in Sect. 4.

Fig. 1.
figure 1

Block diagram of Indirect learning architecture (ILA) using the proposed RPEM adaptive algorithm.

2 Proposed Adaptive ILA Using RPEM for Linearizing RF Power Amplifiers

Figure 1 shows the block diagram of the ILA-based DPD technique, where a post-distorter (or training) block is used to identify the postinverse of the PA. The baseband signal u(n) is fed to the predistorter, which generates a signal x(n) that is a PA input. The PA output signal is normalized by a linear gain \(G_0\), producing the normalized output z(n), i.e., \(z(n)=\frac{y(n)}{G_0}\). The postdistorter model has the input z(n) and the output \(z_\mathrm{p}(n)\). Its parameters are identified by minimizing the error signal \(e(n) = x(n) - z_\mathrm{p}(n)\) using the adaptive algorithms. Note that both the PD and postdistorter models are identical. Thus, when the coefficients of the postdistorter are identified, they are directly copied to the PD model. This process is repeated iteratively until the ILA linearization has converged. At convergence, the cascaded PD and PA system behaves linearly. Since the MP models have owned low computational cost, satisfactory accuracy, and easy hardware implementation, they have become promising choices and been widely applied for behavioral modeling and predistortion of PAs exhibiting nonlinear memory effects [2, 4, 12, 17]. Therefore, both the PD and postdistorter are modeled by the same MP model that has Q as the nonlinearity order and P as memory depth, and \({\omega _{km}}\) as coefficients. The input and output relation of the PD model is given by

$$\begin{aligned} x(n) = \sum \limits _{k = 1}^Q {\sum \limits _{m = 0}^P {{\omega _{km}}u(n - m){{\left| {u(n - m)} \right| }^{k - 1}} = {\omega ^\mathrm{{T}}}\phi } } (n), \end{aligned}$$
(1)

where

$$\begin{aligned} \omega = {\left[ {{\omega _{10}}, \ldots ,{\omega _{Q0}}, \ldots ,{\omega _{1P}}, \ldots ,{\omega _{QP}}} \right] ^\mathrm{{T}}}, \end{aligned}$$
(2)

and

$$\begin{aligned} \phi (n) = {\left[ {{\phi _{10}}(n), \ldots ,{\phi _{Q0}}(n), \ldots ,{\phi _{1P}}(n), \ldots ,{\phi _{QP}}(n)} \right] ^\mathrm{{T}}} \end{aligned}$$
(3)

with

$$\begin{aligned} {\phi _{km}}(n) = u(n - m){\left| {u(n - m)} \right| ^{k - 1}}. \end{aligned}$$
(4)

The symbol \(\mathrm {T}\) indicates the matrix transpose.

The input and output of the postdistorter model can be expressed by

$$\begin{aligned} z_\mathrm{p}(n) = \sum \limits _{k = 1}^Q {\sum \limits _{m = 0}^P {{\omega _{km}}z(n - m){{\left| {z(n - m)} \right| }^{k - 1}}} } = {\omega ^\mathrm{{T}}}{} \mathbf{{z}}(n), \end{aligned}$$
(5)

where \(\omega \) is defined as in (2) and

$$\begin{aligned} \mathbf{{z}}(n) = {\left[ {{z_{10}}(n), \ldots ,{z_{Q0}}(n), \ldots ,{z_{1P}}(n), \ldots ,{z_{QP}}(n)} \right] ^\mathrm{{T}}} \end{aligned}$$
(6)

with

$$\begin{aligned} {z_{km}}(n) = z(n - m){\left| {z(n - m)} \right| ^{k - 1}}. \end{aligned}$$
(7)

The prediction error \(e\left( {n,\omega }\right) \) is defined by

$$\begin{aligned} e\left( {n,\omega } \right) = x(n) - z_\mathrm{p}(n) = x(n) - {\omega ^\mathrm{{T}}}{} \mathbf{{z}}(n). \end{aligned}$$
(8)

The adaptive algorithms are derived by minimizing corresponding lost functions that refer to scalar-valued functions of all the prediction errors \(e\left( {n,\omega } \right) \).

The coefficient vector \({{\omega }}\) of the predistorter is estimated by using the Gauss-Newton RPEM algorithm in [16] that minimizes the following cost function.

$$\begin{aligned} {f_L}\left( \omega \right) = \mathop {\lim }\limits _{L \rightarrow \infty } \frac{1}{L}\sum \limits _{l = 1}^L {\mathrm{{E}}\left\{ {{e^2}\left( {l,\omega } \right) } \right\} }, \end{aligned}$$
(9)

where \(e(l,\omega )\) is given as in (8).

The formulation of the RPEM algorithm is derived in [16], which requires the negative gradient of \(e\left( {l,\omega } \right) \) with respect to \(\omega \). From (8), the negative gradient is given by

$$\begin{aligned} -\frac{{\partial e\left( {n,\omega } \right) }}{{\partial \omega }} = {\mathbf{{z}}^\mathrm{{T}}}(n). \end{aligned}$$
(10)

When applying the RPEM algorithm [16] for PA linearization, the adaptive ILA-based DPD using RPEM algorithm is described in Algorithm 1, where \(\rho \) also is a positive constant and \(\lambda (n)\) is a forgetting factor that tends exponentially to 1 as \(n \rightarrow \infty \). \({\lambda _0}\), \({\lambda (0)}\) and \(\mathbf{{P}}(0)\) are initial variables designed by users.

figure a

Typically chosen values for \({\lambda _0}\) and \({\lambda (0)}\) are \({\lambda _0=0.99}\) and \({\lambda (0)=0.95}\) [16].

It is crucial that the evaluation criteria should be adopted to clearly validate the performance of PA behavioral modeling and DPDs. Therefore, this part defines the figures of merit for performance evaluation. The most commonly used criteria are normalized mean square error (NMSE) in time domain, adjacent channel power ratio (ACPR) in frequency domain, and error vector magnitude (EVM) that are defined as in [3, 18].

Firstly, NMSE is an estimator of the overall difference between the predicted and measured signals in time domain. It is often defined in decibels as

$$\begin{aligned} \mathrm{{NMSE\, = \,}}10{\log _{10}}\left( {\frac{{\sum \limits _{n = 1}^N {{{\left( {\left| {y[n] - x[n]} \right| } \right) }^2}} }}{{\sum \limits _{n = 1}^N {{{\left( {\left| {x[n]} \right| } \right) }^2}} }}} \right) , \end{aligned}$$
(11)

where x(n) is the experimental output (or desired output) of the DUT, and y(n) is the output obtained from the model.

Moreover, ACPR is the ratio between the total adjacent channels’ powers to the main channel signal power. It describes the degree of the signal regrowth into neighbouring channels. Since the ACPR characterizes the maximum power allowed to be radiated outside the allocated band, it plays a very important role in wireless radio standards. The ACPR is often expressed in decibels as

$$\begin{aligned} \mathrm{{ACPR\, =\, }}10{\log _{10}}\left( {\frac{{\int _{{\mathrm{{B}}_{\mathrm{{adj}}}}} {{{\left| {Y(f)} \right| }^2}} }}{{\int _{{\mathrm{{B}}_{\mathrm{{ch}}}}} {{{\left| {Y(f)} \right| }^2}} }}} \right) \end{aligned}$$
(12)

where \({\left| {Y(f)} \right| }\) denotes the power spectrum of the measured output signal y(n), \({{\mathrm{{B}}_{\mathrm{{adj}}}}}\) and \({{\mathrm{{B}}_{\mathrm{{ch}}}}}\) refer to the bandwidth of the adjacent and main channels, respectively.

The EVM is a measure criterion that quantifies the imperfection to the output signal when compared to the input one. It describes the in-band distortion of the PA and is defined as

$$\begin{aligned} \mathrm{{EVM = }}\sqrt{\frac{{\sum \limits _{j = 0}^L {\left[ {{{\left( {{I_j} - {{\hat{I}}_j}} \right) }^2} + {{\left( {{Q_j} - {{\hat{Q}}_j}} \right) }^2}} \right] } }}{{\sum \limits _{j = 0}^L {\left[ {I_j^2 + Q_j^2} \right] } }}} \end{aligned}$$
(13)

where \(I_j\) and \(Q_j\) are the ideal output signal in-phase and quadrature components, and \({\hat{I}}_j\) and \({\hat{Q}}_j\) are their output measured counterparts, respectively.

3 Simulation Results

In order to demonstrate the proposed DPD linearization method, we tested a simulated PA that is modeled by a Wiener model consisting of a FIR filter followed by memoryless nonlinearity model. The coefficients of the FIR filter are as in [19,20,21]

$$\begin{aligned} {h_0} = 0.7692,{h_1} = 0.1538,{h_2} = 0.0769. \end{aligned}$$
(14)

For the memoryless nonlinearity model, we use Saleh’s model [22], which is defined by

$$\begin{aligned} y(n) = \frac{{{\alpha _a}\left| {v(n)} \right| }}{{1 + {\beta _a}{{\left| {v(n)} \right| }^2}}}{e^{j\angle \left[ {v(n) + \frac{{{\alpha _\varphi }{{\left| {v(n)} \right| }^2}}}{{1 + {\beta _\varphi }{{\left| {v(n)} \right| }^2}}}} \right] }}, \end{aligned}$$
(15)

with

$$\begin{aligned} v(n) = {h_0}x(n) + {h_1}x(n - 1) + {h_2}x(n - 2), \end{aligned}$$
(16)

where x(n) and y(n) are the input and output of the simulated PA, respectively, and v(n) is the input of Saleh model. The parameters of Saleh model are as in [19]

$$\begin{aligned} {\alpha _a} = 20,{\beta _a} = 2.2,{\alpha _\varphi } = 2,{\beta _\varphi } = 1. \end{aligned}$$
(17)

The transmitted symbols are modulated by 16-QAM with \(3.84~\mathrm{MHz}\) bandwidth. The input modulated signal is filtered by a raised cosine pulse shaping filter with the roll-off factor of 0.22.

Fig. 2.
figure 2

The PA characteristics. (a) AM/AM. (b) AM/PM.

Fig. 3.
figure 3

Gain versus average input power of simulated PA.

Fig. 4.
figure 4

NMSE versus N and M.

Fig. 5.
figure 5

Power spectral density (PSD) of the PA output before and after DPD using various adaptive algorithms with the input power of \(-4\,\mathrm{dBm}\).

Fig. 6.
figure 6

Learning curves for adaptive RPEM algorithms.

Fig. 7.
figure 7

ACPR and EVM performance after the proposed DPD using adaptive RPEM algorithm. (a) ACPR. (b) EVM.

The AM/AM and AM/PM characteristics computed at the instantaneous samples of the PA input and output, are shown in Fig. 2. It is clear that the simulated PA suffers from the nonlinearity and memory effects. Figure 3 shows the gain performance of the simulated PA with the average input power. One can observe that the gain in linear region is about \(26\,\mathrm{dB}\). The average input power at \(1\,\mathrm{dB}\) compression point and at \(3\,\mathrm{~dB}\) are around \(-1\,\mathrm{dBm}\) and \(4.3\,\mathrm{dBm}\), respectively.

The MP model is used to model nonlinear behavior of the PA. In order to reduce the computational complexity, the orders (N and M) of the MP model are optimized by using a performance-based sweeping method [17]. Figure 4 shows the NMSE performance versus the orders of the PA model. From this figure, we can see that the optimal values of N and M are \(N=5\) and \(M=2\), respectively, in order to achieve a good trade-off between the best \(\mathrm NMSE\) and computational complexity.

In order to validate the proposed DPD, the RPEM algorithm is initialized when \({\lambda _0=0.99}\), \({\lambda (0)=0.95}\) and the initial weight vectors \(\omega (0)\) have a first element as 1 and the others as 0. In this simulation, the ACPR values are measured at the upper adjacent channels, corresponding to frequency offsets of \(5\,\mathrm{MHz}\).

Figure 5 shows effectiveness in canceling the spectral regrowth of the proposed approach. for the input power of \(-4\,\mathrm{dBm}\). It can be seen that there is a significant spectral regrowth reduction after DPD. The adaptive RPEM algorithm converges after 10-K samples, as shown in Fig. 6.

Figures 7(a) and (b) respectively show the ACPR and EVM performance of the proposed DPD for various input power levels. From these figures, one can observe that the proposed DPD technique shows a significant performance improvement in terms of ACPR and EVM. It obtains the ACPR values almost equal to those of input. Furthermore, after applying RPEM-ILA, the EVM values are significantly reduced and less than \(0.26\%\), which shows excellent performance in in-band distortion mitigation. This is because the RPEM algorithm makes the PD coefficient estimates more consistent and precise in steady state. The simulation results have clarified the improvements of the proposed technique compared with LMS [15] and RLS-based [10, 12] ILA methods.

4 Conclusions

In this paper, an adaptive ILA linearization using the RPEM algorithm has been proposed. Thanks to the time-varying forgetting factor, the PD coefficient estimates are consistent and accurate in steady state, leading to speed up the convergence, reduce the NMSE, and minimize the total nonlinear distortion at the PA output. The simulation results confirm that the nonlinear distortion of the PA operated under different conditions (for example, the different input powers), can be almost fully compensated by employing the adaptive ILA with RPEM. In other words, the proposed DPD technique effectively linearizes the PA even if its characteristics change. So, this approach provides a very promising solution for future wireless communication system where the PA characteristics change due to the type of signal, high-order modulation, working condition, etc. The future works will target the optimization, hardware implementation and more detail analysis results of the proposed linearization technique.