## 17.5 An Intrinsically Linear Wideband Digital Polar PA Featuring AM-AM and AM-PM Corrections Through Nonlinear Sizing, Overdrive-Voltage Control, and Multiphase RF Clocking

Mohsen Hashemi<sup>1</sup>, Yiyu Shen<sup>1</sup>, Mohammadreza Mehrpoo<sup>1</sup>, Mustafa Acar<sup>2</sup>, René van Leuken<sup>1</sup>, Morteza S. Alavi<sup>1</sup>, Leonardus de Vreede<sup>1</sup>

<sup>1</sup>Delft University of Technology, Delft, The Netherlands <sup>2</sup>Ampleon, Nijmegen, The Netherlands

To fully benefit from the progress of CMOS technologies, it is desirable to completely digitize the TX, replacing its final stage with a digitally controlled PA (DPA). The DPA consists of arrays of small sub-PAs that are digitally controlled to modulate the output amplitude, thus operating as an RF-DAC [1-6]. DPAs are normally designed in a switched mode (Classes E/D/D<sup>-1</sup>, etc.) to achieve high efficiency while using high sampling rate to attenuate and push the spectral images to higher frequencies. However, they suffer from high nonlinearity in their AM-code-word (ACW) to AM and ACW-to-PM conversion. To correct for such nonlinearities, digital pre-distortion (DPD) of the input signal is often used [1-3], typically implemented by look-up tables (LUT). Unfortunately, DPD approaches suffer from large signal-BW expansion due to their inherently nonlinear characteristics. This, combined with the already present BW regrowth in a polar TX in the AM and PM paths, yields significant hardware-speed/power constraints when the signal BW becomes large. For a Cartesian TX, the use of LUT-DPD is even more complicated since a full 2D LUT is typically required [2]. To relax the overall system complexity, it is highly desirable to have a PA with a maximum inherent linearity without compromising its power or efficiency. In this work, an ACW-AM correction based on nonlinear sizing along with controlling the peak voltage of RF clocks (overdrive voltage tuning) and a ACW-PM correction based on multiphase RF clocking are introduced to linearize the characteristic curves of a Class-E polar DPA with intent to avoid any kind of pre-distortion.

Figure 17.5.1 depicts this concept featuring 9b amplitude modulation. The DPA consists of 8 differential sub-PA segments. These segments are nonlinearly sized to linearize the ACW-AM. The PVT/load impedance variations are compensated by digitally tuning the supply voltage of PA buffers with an on-chip programmable LDO. The input PM RF clock is amplified and fed to the multiphase RF-clocking circuit, which generates 5 differential RF clocks with different phase offsets. The output clocks are applied simultaneously to different sub-PA segments to flatten the ACW-PM curve. Explicit capacitive tuning between the DPA differential outputs is used to enhance the efficiency at full power against clock skewing and duty-cycle variation. A passive mesh structure equalizes the delays for all RF clock lines from the multiphase RF-clocking circuit to the DPA.  $P_{OuT}$ -aware clock gating is used to reduce driver power in back-off. Coarse time alignment between the amplitude and phase information is performed in the digital domain; fine adjustments are handled by a 4b delay line in the path of the ACW sampling clock.

Figure 17.5.2 shows the ACW-AM linearization concept (top) and its simplified circuit implementation (bottom). In a Class-E (or D<sup>-1</sup>) DPA, which uses transistor on-resistance ( $R_{ON}$ ) for AM control [1], the ACW-AM curve is a strongly nonlinear function of  $R_{ON}$ -to- $R_{Load}$  ratio. Conventionally, the total size of the switched-on devices is a linear function of ACW [1-6]. By nonlinearly sizing the sub-PA segments based on the inverse of the ACW-AM curve, the overall ACW-AM curve of the DPA is piecewise linearized (Fig. 17.5.2 top). Since the transistor  $R_{ON}$  is a function of the overdrive voltage ( $V_{OD}$ ) too, it is used for compensating the PVT/load variations with insignificant impact on the peak drain efficiency (DE) and  $P_{SAT}$ . A 6b programmable on-chip LDO controls the  $V_{OD}$  for the whole DPA by controlling the  $V_{DD}$  of RF-clock buffers. It has a resolution of 9 to ~10mV with an output range of 0.6 to 1.2V. Once programmed for optimum linearity, it is fixed during the normal DPA operation.

Figure 17.5.3 shows the ACW-PM correction based on multiphase RF clocking. It consists of a bank of 5 static phase shifters implemented by programmable delay lines. For a nonlinearly sized Class-E DPA with the conventional single-phase RF clocking, the output phase decreases with increasing ACW (Fig. 17.5.3 top). In time domain, this translates to an increment in the delay at the output. In order to reduce the phase error, the RF clocks of smaller sub-PA segments are applied

with larger fixed delays (larger phase offsets) and the larger segments are fed with smaller fixed delays (smaller phase offsets). In this technique, in contrast to a conventional DPA with a DPD approach, no dynamic phase correction is needed for each ACW code. For example, at 6dB back-off power, the sub-PA segments 1-2, 3-4 are fed simultaneously with phase offsets  $\Delta \varphi_1 > \Delta \varphi_2$ , respectively, while other segments are turned off and at full power, the sub-PA segments 1-2, 3-4, 5-6, 7, 8 are fed simultaneously with phase offsets  $\Delta \varphi_1 > \Delta \varphi_2 > \Delta \varphi_3 > \Delta \varphi_4 > \Delta \varphi_5$ , respectively. The output currents of these individual sub-PA segments are summed, and the overall output phase is inherently averaged, allowing a controllable and considerable reduction of phase error in the ACW-PM curve. The phase-offset of each clock is digitally controllable over a range of 80° with a resolution of 5°, which is more than enough to cover errors caused by PVT/load changes.

The circuit is prototyped in 40nm bulk CMOS. The core area is 1mm×0.45mm (Fig. 17.5.7). The raw ACW is applied to the DPA using an on-chip 4K SRAM running at 625MHz. The input RF signal is phase modulated off-chip. All measurements are done without applying any kind of DPD. Measured ACW-AM and ACW-PM for a digital input ramp are shown in Fig. 17.5.4(top). The effectiveness of PVT/load compensation for ACW-AM is measured and plotted for different control-bit settings. The ACW-PM curves before averaging (for one ramp period) and after averaging (over 256 ramp periods) are shown. The two-tone measurements (Fig. 17.5.4 bottom) with a 1.2MHz bandwidth show an IM3 and IM5 of -60dBc and -50dBc, respectively, at 2 to 2.1 GHz. Peak P<sub>OUT</sub>, DE and PAE are measured and plotted vs. frequency for different PA V<sub>DD</sub> (Fig. 17.5.4 bottom). It can be seen that by increasing the drain voltage, the peak P<sub>OUT</sub> and PAE increase, while the peak DE does not change significantly. However, for reliability concerns, the drain voltage is kept less than 0.6V during the normal operation. With a  $V_{DD}$ =0.6V, the peak P<sub>OUT</sub>, DE and PAE are 16.1dBm, 43.7% and 32%, respectively.

Figure 17.5.5 shows the dynamic performance for QAM modulated signals measured at  $f_c$ =2GHz. ACPR1 is as low as -49 to -40dBc for modulation bandwidths of 1.2 to 40MHz. The measured EVM is -33dB for 16-QAM and -31dB for 64-QAM signals with a 40MHz BW.

Figure 17.5.6 summarizes and compares this work with the state of the art. The nonlinear sizing with  $V_{0D}$  tuning and multiphase RF clocking provide a very high DPD-less RF-DAC linearity (EVM/ACPR) for wideband signals (>=20MHz) without sacrificing output power or efficiency. In fact, its linearity competes with state-of-the-art DPD-less Cartesian DPA [4] while the output power and efficiency performance are superior by 5 to ~6dB and 10 to 12%, respectively.

## Acknowledgment:

The authors acknowledge Atef Akhnoukh from TU Delft and the imec/Europractice IC service team for their unlimited and high quality support, the people of Ampleon and NXP for their encouragement and advices, the projects SEEDCOM (STW) and EAST (Catrene) for the financial support and Masoud Babaie and Earl McCune for their useful suggestions.

## References:

[1] D. Chowdhury, et al., "An Efficient Mixed-Signal 2.4-GHz Polar Power Amplifier in 65-nm CMOS Technology", *IEEE JSSC*, vol. 46, no. 8, pp. 1796-1809, Aug. 2011.

[2] W. Yuan, et al., "A Quadrature Switched Capacitor Power Amplifier", *IEEE JSSC*, vol. 51, no. 5, pp. 1200-1209, May 2016.

[3] J. Park, et al., "A Highly Linear Dual-Band Mixed-Mode Polar Power Amplifier in CMOS with An Ultra-Compact Output Network", *IEEE JSSC*, vol. 51, no. 8, pp. 1756-1170, Aug. 2016.

[4] P. Paro Filho, et al., "A Transmitter with 10b 128MS/S Incremental-Charge-Based DAC Achieving –155dBc/Hz Out-of-Band Noise", *ISSCC*, pp. 164–165, Feb. 2015.

[5] S. Zheng, et al., "A WCDMA/WLAN Digital Polar Transmitter with Low-Noise ADPLL, Wideband PM/AM Modulator, and Linearized PA", *IEEE JSSC*, vol. 50, no. 7, pp. 1645-1656, July 2015.

[6] A. Ba, et al., "A 1.3nJ/b IEEE 802.11ah Fully Digital Polar Transmitter for IOE Applications", *ISSCC*, pp. 440-441, Feb. 2016.



| Figure 17.5.7: Die micrograph. |  |
|--------------------------------|--|
|                                |  |
|                                |  |