

Received 21 December 2022; revised 22 February 2023; accepted 8 March 2023. Date of publication 14 March 2023; date of current version 24 March 2023. Digital Object Identifier 10.1109/OJCAS.2023.3256368

# An Ultra-Low-Voltage Single-Crystal Oscillator-Timer (XO-Timer) Delivering 16-MHz and 32.258-kHz Clocks for Sub-0.5-V Energy-Harvesting BLE Radios in 28-nm CMOS

LIWEN LIN<sup>1,2</sup> (Graduate Student Member, IEEE), KA-MENG LEI<sup>(1),2</sup> (Senior Member, IEEE), PUI-IN MAK<sup>(1),2</sup> (Fellow, IEEE), AND RUI P. MARTINS<sup>(1),2,3</sup> (Fellow, IEEE)

<sup>1</sup> State Key Laboratory of Analog and Mixed-Signal VLSI/Institute of Microelectronics, University of Macau, Macau, China

<sup>2</sup>Department of ECE/ Faculty of Science and Technology, University of Macau, Macau, China

<sup>3</sup>Instituto Superior Técnico, Universidade de Lisboa, 1649-004 Lisbon, Portugal

This article was recommended by Guest Editor M. Johnston.

CORRESPONDING AUTHORS: K.-M. LEI AND P.-I. MAK (e-mail: kamenglei@um.edu.mo; pimak@um.edu.mo)

This work was supported in part by the Science and Technology Development Fund, Macau, SAR, under Grant 0043/2020/A1 and Grant SKL-AMSV(UM)-2023-2025, and in part by the University of Macau under Grant MYRG2022-00034-IME.

**ABSTRACT** This paper reports an ultra-low-voltage (ULV) single-crystal oscillator-timer (XO-Timer) for sub-0.5 V Bluetooth low-energy (BLE) radios that aims for self-powering by harvesting the ambient energies. Specifically, we tailor an on-chip micropower manager ( $\mu$ PM) to customize the voltage and current budgets for each sub-function of the XO-Timer. Such  $\mu$ PM shows a high power efficiency by introducing a 3-stage cascaded structure and a single voltage-regulation loop; they together uphold the performance of the XO-Timer amid supply-voltage and temperature variations. The core amplifier of the XO-Timer is ULV-enabled, and is reconfigurable (i.e., 1-stage and 3-stage g<sub>m</sub>) to balance between the power budget and performance under the high-performance mode (HPM) and low-power mode (LPM). Fabricated in 28-nm CMOS, the XO-Timer in HPM generates a 16-MHz clock with a power of 24.3  $\mu$ W, and a phase noise of -133.8 dBc/Hz at 1-kHz offset, resulting in a Figure-of-Merit (FoM<sub>1</sub>) of -236 dBc/Hz. In the LPM, the XO-Timer delivers a 32.258-kHz clock while consuming 11.4  $\mu$ W. The sleep-timer FoM<sub>2</sub> is 14.8  $\mu$ W and the Allan deviation is 35.1 ppb, achieving the lowest supply voltage (0.25 V) not only for a dual-mode XO-Timer but also for a MHz-range XO.

**INDEX TERMS** Bluetooth low-energy (BLE), clock, CMOS, crystal oscillator-timer (XO-Timer), energy harvesting, Internet-of-Things (IoT), micropower manager ( $\mu$ PM), ultra-low-power (ULP), ultra-low-voltage (ULV).

### I. INTRODUCTION

INTERNET-OF-THINGS (IoT) has the potential to connect the world in a pervasive way and improve different industrial sectors such as logistics, healthcare, and smart cities. Still, the underlying challenges hindering its wide dissemination are the cost, power, and size limitations of the hardware. Batteries dominate the size and operating cost of the IoT networks as their replacement is highly labor-intensive, and their disposal incurs a high environmental cost [1]. An energy harvester (EH) that extracts the ambient energy is promising to prolong the lifetime of the batteries, or even replace them completely. Yet, the EH output voltage is deeply low (~50 to 300 mV) and inexact [2], being highly difficult to comply with the standard analog and RF circuitries. The micropower manager ( $\mu$ PM) is a viable solution to customize and stabilize the EH output voltage ( $V_{\rm EH}$ ) [3], [4]. It tailors different voltages and loading currents using separated charge pumps (CPs) and multiple voltage-regulation loops for the circuitries. Nevertheless, their power and area overheads are still high.

Besides the battery, another factor limiting the footprint of the IoT nodes is the crystal; typically there are two. The first generates a MHz-range reference clock for the RF phase-locked loop (PLL), where the aimed precision is  $\pm 50$  ppm for the Bluetooth-Low-Energy (BLE) standard. The second is a kHz-range timer for synchronizing the IoT node in sleep mode. In light of the rapid downscaling of the CMOS technologies, the crystal's size (e.g.,  $3.2 \times 2.5 \text{ mm}^2$ ) is comparable to the system chip, inevitably increasing the device's footprint and bill of material (BoM) cost. Although the integrated RC timers [5], [6], [7], [8] potentially eliminate the off-chip crystals, a µW-level timer usually has frequency stability >500 ppm. For instance, the frequency variations of the RC timer in [7] against voltage and temperature variation are 2.3% and 2.2%, which are much poorer than that of the crystal oscillators (XOs). Thus, there was ambition to implement a dual-mode XO that offers two clocks with one MHz-range crystal [9], [10], [11], [12]. Yet, their supply voltages are >0.9 V, requiring DC-DC converters to boost the voltages from the EH's output and penalizing the overall efficiency (e.g., 40 to 75% in [13]). In [9], the programmable loading capacitors  $(C_L)$ and frequency divider (DIV) enable the generation of a kHz-range timer by frequency division. Due to the fixed Pierce oscillator in both modes, the power is still sub-optimal (37  $\mu$ W), limiting the effectiveness of duty-cycling the IoT radio. Recently, a single-XO-based clock-management IC was proposed [11]. It utilized a 76.8-MHz XO to provide the MHz-range signal and calibrate the machine-learning-based RC-oscillator to provide the 32.768-kHz clock. Yet, the RCoscillator's high power consumption (48.2  $\mu$ W) still renders it ill-suitable as a sleep timer. Hence, this project aims to develop a dual-mode reconfigurable single-crystal oscillatortimer (XO-Timer) directly powered by  $V_{\rm EH}$  (0.25 to 0.3 V, 20% variation); it avoids the need for extra regulators that otherwise worsen the power efficiency, and balances the performance and power consumption for different scenarios.

This paper reports an ultra-low-voltage (ULV) XO-Timer suitable for sub-0.5 V BLE radios that target batteryless operation [14]. To supply the modules that require a higher voltage budget but with low current consumption, such as the DIV for the timer, bias circuit for the core amplifier  $(A_{\rm XT})$ , and the mode control logic, we utilize a  $\mu PM$  to boost the voltage from the EH (0.25 V), and distribute multiple voltages to the modules accordingly. Unlike the traditional  $\mu$ PM [3], [4] that equip with separated CPs and multiple regulation loops, we co-design the XO-Timer and  $\mu$ PM with a cascaded 3-stage charge pump (CP), where its stage-to-stage voltages ( $V_{\rm EH} < V_{\rm O,1} < V_{\rm O,2} < V_{\rm O,3}$ ) and currents ( $I_{\rm EH} > I_{\rm O,1} > I_{\rm O,2} > I_{\rm O,3}$ ) are customizable for each module. As a result, it is more power-efficient to support different sub-functions according to their desired voltage-current budgets, avoiding excessive voltage headroom. Additionally, with a single voltage-regulation loop, the proposed  $\mu$ PM shields all internal voltages against the supply voltage  $V_{\rm EH}$  (0.25 to 0.3 V) and temperature (-20 to 70 °C) variation with a targeted output ripple (5 mV at  $V_{\rm O,1}$ ), while the output ripple is scaled with  $V_{\rm EH}$  in [3], [4]. Sharing a single 16-MHz crystal, our XO-Timer is reconfigurable to generate a 16-MHz reference clock for the radio's PLL by using a 1-stage  $g_{\rm m}$ , and a 32.258-kHz clock as the sleep-timer that governs the radio's wake-up accuracy by a power-efficient 3-stage  $g_{\rm m}$ . It attains a lower power consumption (24.3  $\mu$ W for the 16-MHz clock and 11.4  $\mu$ W for the timer) than the prior works with a state-of-the-art performance under a 0.25-V supply.

This paper has the following organization. Section II gives a system overview of the XO-Timer. Section III details the  $\mu$ PM, and its advantages over the previous topology. Section IV focuses on the design and analysis of the core amplifier of the XO-Timer. Section V discusses the measurement results, and finally, Section VI concludes the article and presents its key contributions.

#### **II. SYSTEM OVERVIEW**

Fig. 1 depicts the block schematic of our ULV XO-Timer. It features a fully on-chip  $\mu$ PM to interface with the EH. The  $\mu$ PM has two key functions: customize the internal voltage and current budgets for each sub-function, and regulate the internal voltages ( $V_{0,1}$ ,  $V_{0,2}$ ,  $V_{0,3}$ ) and bias ( $V_{\text{bias}}$ ) against the inexact  $V_{\text{EH}}$  and temperature.

The stage voltages (V<sub>0,1</sub>, V<sub>0,2</sub>, V<sub>0,3</sub>) serve different purposes:  $V_{0,1}$  powers up the DIV and part of the reconfigurable  $A_{\rm XT}$ . The 2<sup>nd</sup> stage of the CP,  $V_{\rm O,2}$ , powers the static subfunctions (e.g., reference generator and bias circuit) that demand less current but higher voltage headroom. The final stage,  $V_{0,3}$ , powers up the error amplifier ( $A_E$ ) and the modecontrol logic. The  $A_{\rm E}$  and the reference generator form a global voltage-regulation loop to regulate the output voltages of the  $\mu$ PM against voltage and temperature variations. The most power-hungry sub-functions are the bootstrapped ring oscillator (BTRO) and reconfigurable core amplifier  $(A_{XT})$ .  $V_{\rm EH}$  directly powers them to alleviate the power loss and cost of any interim converters or regulators. The 6-phase nonoverlap clocks produced by the BTRO benefit the cascaded 3-stage CP to have smaller output ripples and eliminate the short-circuit current loss.

Associated with the  $\mu$ PM, the proposed reconfigurable  $A_{\rm XT}$  is ULV-designed to operate with  $V_{\rm DD}$  down to 0.25 V. The XO-Timer generates a 16-MHz clock in the highperformance mode (HPM) for the circuits that require better spectral purity, such as the PLL and analog-to-digital converter (ADC). The 1-stage  $g_{\rm m}$  dominates the power in the HPM and allows a clock signal with low phase noise (PN). The always-on timer is available in both HPM and low-power mode (LPM) for timekeeping. The DIV divides the 16-MHz clock by 496 to obtain a 32.258-kHz clock. Further, in the LPM, we adopt a 3-stage  $g_{\rm m}$ , plus  $C_{\rm L}$  reduction to powerefficiently sustain the oscillation on the crystal. Hence, our



FIGURE 1. Proposed ULV XO-Timer delivering a 16-MHz clock and a 32.258-kHz timer from a single 16-MHz crystal. The on-chip  $\mu$ PM aids in tolerating  $V_{EH}$  variation and customizing the internal voltages and currents. An ULV-enabled reconfigurable  $A_{XT}$  balances the power and performance in the HPM and LPM.

reconfigurable  $A_{\rm XT}$  can provide the clock for the PLL and the always-on timer from a single 16-MHz crystal under an inexact ULV  $V_{\rm EH}$  and temperature, while balancing the power budget with the performance.

#### III. MICROPOWER MANAGER (µPM)

The  $\mu$ PM (Fig. 2) boosts the low-and-unstable  $V_{\rm EH}$  to highand-regulated voltage levels for different sub-functions. It aims to cover  $V_{\rm EH}$  of 0.25 to 0.3 V (20% variation) and a temperature range of -20 to 70 °C. The nominal output voltages are  $V_{\rm O,1} = 0.37$  V,  $V_{\rm O,2} = 0.7$  V, and  $V_{\rm O,3} = 1$  V, under  $V_{\rm EH} = 0.25$  V.

# A. CASCADED 3-STAGE CP

The µPM dominates the total power budget of the XO-Timer. Thus, we contrive a  $\mu$ PM topology that exhibits a high power efficiency. The conventional µPM [3], [4] employs separated CPs and multiple voltage-regulation loops to generate different output voltages, which requires individual switching clocks. Herein, we propose a µPM with a cascaded 3-stage CP and a single voltage-regulation loop to reduce the power and area overheads, described as follows: A switchedcapacitor voltage adder (SCVA) heads our cascaded 3-stage CP [Fig. 2(b)]. The complementary non-overlap clocks  $\Phi_{1,2}$ control the switches for each unit cell. When  $\Phi_1$  is low,  $V_{\text{REG}}$  (generated by the regulator to stabilize  $V_{0,1}$ , detailed in Section III-C) charges the flying capacitor  $C_1$  (25 pF). In the next phase, when  $\Phi_2$  is low, the bottom plate of  $C_1$  is boosted to  $V_{\rm EH}$ . Together with the charges stored in  $C_1$  in the previous phase, the output voltage  $V_{out}$  equals the sum of  $V_{\text{REG}}$  and  $V_{\text{EH}}$ . The simulated nominal  $V_{0,1}$  is pitched to 0.37 V by the regulation loop. This  $V_{0,1}$  is offered to

the DIV, part of the reconfigurable  $A_{\rm XT}$ , and the subsequent stage of the CP. They together draw 9.46  $\mu$ A from  $V_{\rm O,1}$ .

The 2<sup>nd</sup> stage of the CP is a switched-capacitor voltage doubler (SCVD), which is similar to the SCVA with its unit cell shown in Fig. 2 (b) and  $C_2$  is 6.16 pF. Theoretically, its output voltage  $V_{0,2}$  is  $2 \times V_{0,1}$ . The simulated  $V_{0,2}$  is 0.7 V. This  $V_{0,2}$  powers up the reference generator for the regulation loop, the bias circuit for the reconfigurable  $A_{XT}$ , and the 3<sup>rd</sup> stage of CP, which altogether draw 2.23  $\mu$ A of current.

The final stage of the CP is a single cross-coupled rectifier ( $C_3$ : 2.8 pF), as shown in Fig. 2(b). The BTRO generates its switching clocks with  $V_{O,1}$  regulating the swing. Hence, the output voltage of this rectifier ( $V_{O,3}$ ) is  $V_{O,2} + V_{O,1}$ . With a simulated  $V_{O,3}$  of 1 V, this stage supplies a differential amplifier  $A_E$  with an active current-mirror load to secure the robustness of the regulation loop. Further, it also powers the mode control logic for  $\mu$ PM and  $A_{XT}$ . Its current budget is 0.2  $\mu$ A, which has a negligible impact on the system's overall efficiency.

# **B.** NON-OVERLAP BOOTSTRAPPED RING OSCILLATOR We can upsize the SCVA and SCVD to raise their outputcurrent capacity and reduce the output ripples by parallelizing 6-unit cells with 6 clock phases [15], [16], [17]. A 3-stage differential BTRO [18], directly powered by $V_{\rm EH}$ for self-startup, generates their 6-phase non-overlap clock (Fig. 3). The swing-boosted clock reduces the loss of the CP and avoids extra clock buffers by using the bootstrapped capacitor $C_{\rm b}$ . The differential bootstrapped inverter generates complementary clocks $Out_{+/-}$ with swing from $-V_{\rm EH}$ to $2V_{\rm EH}$ . Each of these six outputs drives the non-overlap clock generator to deliver non-overlap clocks to reduce the



FIGURE 2. (a) Proposed  $\mu$ PM featuring a cascaded 3-stage CP and a single voltage-regulation loop. Simulated output voltages and currents also shown. Its stage-to-stage voltages ( $V_{0,1} < V_{0,2} < V_{0,3}$ ) and currents ( $l_{0,1} > l_{0,2} > l_{0,3}$ ) are customizable for each sub-function of the XO-Timer. (b) Implementation of the unit cell of SCVA, SCVD, and rectifier.

short-circuit current within the SCVA/SCVD. The NOR and NOT gates within the non-overlap clock generator are also bootstrapped. We also utilize a pair of regular inverters D powered by  $V_{O,1}$  to drive the rectifier. From the simulation, the 6-phase operation of the SCVA and SCVD allows a 48.6% reduction of the total smoothing capacitors ( $C_{O1-2}$ ) under the same output ripple target (i.e., 5 mV at  $V_{O,1}$ ).

# C. VOLTAGE REGULATION

Our voltage-regulation loop uses the  $A_{\rm E}$  and a reference generator to globally regulate all internal voltages. During the startup, the voltage reference and error amplifier ( $A_{\rm E}$ ) take time to enter steady-state and their output may not be accurate. Hence, a kickstart switch aids in securing a robust startup by forcing  $V_{\rm REG} = V_{\rm EH}$  (Fig. 2). After voltage reference and  $A_{\rm E}$  entering the steady-state (1 ms in our design to cope with PVT variations), the kickstart switch is disabled. Then  $A_{\rm E}$  compares  $V_{\rm SEN}$  (scaled from  $V_{\rm O,1}$  by  $R_1$  and  $R_2$ ) with the reference voltage  $V_{\rm REF}$ . Together with  $M_{\rm VR}$ , they can control  $V_{\rm REG}$  and thus regulate  $V_{\rm O,1}$  as well as  $V_{\rm O,2}$  and  $V_{\rm O,3}$ . Fig. 2(a) shows the timing diagram of the kickstart control,  $V_{\rm REF}$ ,  $V_{\rm O,1}$   $V_{\rm O,2}$  and  $V_{\rm O,3}$  during the startup. Our reference generator operates under a wide supply voltage range of 0.5 to 0.9 V [19]. The transistors work in the subthreshold region with low gate-to-source voltages (~0.2 to 0.3 V). The simulated power consumption of the voltage reference is 181 nW at  $V_{0,2} = 0.7$  V.  $V_{\text{REF}}$  is 0.19 V and its simulated line sensitivity is 0.24%/V.

Fig. 4 (a) shows the simulated  $V_{0,1}$ ,  $V_{0,2}$ , and  $V_{0,3}$  against supply-voltage variation at room temperature with and without the regulation loop. Without the regulation loop, the simulated line regulations ( $\Delta V_{OUT}/\Delta V_{IN}$ ) are 1.9 V/V, 3.1 V/V, and 4.8 V/V, for  $V_{0,1}$ ,  $V_{0,2}$ , and  $V_{0,3}$ , respectively. They improve to 0.2 V/V, 0.97 V/V, and 1.46 V/V with the regulation loop in-place. Similarly, Fig. 4(b) plots the simulated  $V_{0,1}$ ,  $V_{0,2}$ , and  $V_{0,3}$  against temperature variation at  $V_{\rm EH} = 0.25$  V with and without the regulation loop. Without the regulation loop,  $V_{0,3}$  significantly exceeds the device voltage limit (>1.18 V). In contrast, with the regulation loop,  $V_{0,3}$  is limited to <1.06 V. Besides, as shown in Fig. 4(c), the regulation loop also reduces the power of the  $\mu$ PM by 1.72× at  $V_{\rm EH} = 0.25$  V. The overall power efficiency of cascaded 3-stage CP is 47.8%. The  $M_{\rm VR}$  [Fig. 2(a)] consumes 25.5% of power to safeguard the operation of the µPM against supply-voltage and temperature variation. The



FIGURE 3. The schematics of the BTRO to achieve a swing of  $3 \times V_{EH}$ . We utilize non-overlap clock generators to obviate the short-circuit current loss within the SCVA and SCVD. The logic elements within the non-overlap clock generators are also bootstrapped logic gates powered by  $V_{EH}$ , except D which generate  $\Phi'_1$  and  $\Phi'_2$  are regular inverters powered by  $V_{0,1}$ .



**FIGURE 4.** Output voltages of each stage with and without the voltage-regulation loop versus (a)  $V_{EH}$  and (b) temperature. (c) Simulated power consumption of the  $\mu$ PM with and without the voltage-regulation loop versus  $V_{EH}$ . (d) Simulated output ripple at  $V_{0,1}$  of traditional regulation loop and proposed regulation loop.

power efficiencies of the 1<sup>st</sup>, 2<sup>nd</sup>, and 3<sup>rd</sup> stages of the CP are 93.3%, 60.3%, and 34.8%, respectively. The power efficiency decreases at the subsequent stages as the shunt loss is proportional to  $V_o^2 C_p$ , where  $C_P$  is the parasitic capacitance [17]. In our design, the parasitic capacitances from the flying capacitors dominate  $C_P$ . Thus, we downsize the flying capacitors for the subsequent stages as  $C_1>C_2>C_3$  to reduce the  $C_P$  and the power loss. This reasoning justifies



**FIGURE 5.** One-stage Pierce Oscillator where  $L_M$ ,  $R_M$  and  $C_M$  are the motional inductance, motional resistance, and motional capacitance, respectively, and  $C_S$  is the parasitic capacitance of the crystal package.

the design viewpoint that  $I_{O,1} > I_{O,2} > I_{O,3}$  since the subsequent stage cannot supply heavy load due to the small flying capacitor.

The traditional µPM utilizes separate regulation loops for each output and alters the BTRO's frequency to regulate the output voltages [3], [4]. This architecture leads to two critical drawbacks. First, when  $V_{\rm EH}$  increases, the loop reduces the switching frequency of the BTRO to maintain the output voltages. Hence, the output ripple deteriorates, aggravating the reference spur to the XO-Timer. Fig. 4(d) depicts the output ripple of  $V_{0,1}$  against  $V_{EH}$  for both the traditional and proposed regulation loops. The output ripple of the proposed structure improves as VEH increases, along with the frequency of the BTRO, in contrast to the traditional approach. The output ripple minimization of  $V_{0,2}$  and  $V_{0,3}$ is consistent with  $V_{0,1}$ ; the output ripples on  $V_{0,2}$  and  $V_{0,3}$ are 4.6 and 2.4 mV, respectively. Additionally, most blocks powered by  $V_{0,2}$  and  $V_{0,3}$  are for control logic, which are insensitive to the ripple, except for the biasing circuit of  $A_{\rm XT}$ , whose PSRR at 16 MHz is -10 dB. Second, in the traditional µPM, multiple BTROs are compulsory for different CPs to regulate their output voltages, while only one BTRO is necessary for our architecture. From the simulation, with identical output voltages and currents, the power consumption of the proposed  $\mu$ PM is 8.7  $\mu$ W, while that of the traditional  $\mu$ PM is 12  $\mu$ W, attributable to the extra two BTROs.

### **IV. ULV RECONFIGURABLE CORE AMPLIFIER AXT**

The Pierce Oscillator (Fig. 5) with an optimum  $C_L$  is well-suited as the XO due to its simplicity and low PN profile [20]. The motional inductance  $L_M$ , motional resistance  $R_M$ , motional capacitance  $C_M$  and the parasitic capacitance of the crystal package  $C_S$  form the equivalent circuit of the crystal unit. We can determine the characteristics and performance of the XO with the small-signal impedance  $Z_C$ sensed from the motional branch. The negative resistance, defined as

$$R_N \equiv -Re(Z_C) \tag{1}$$



FIGURE 6. Reconfigurable core amplifier (A<sub>XT</sub>) in the (a) HPM and (b) LPM.

should be larger than  $R_{\rm M}$  to compensate for the loss of the crystal and sustain the oscillation. With a MHz-range 1-stage Pierce Oscillator, the power consumption is at tens of  $\mu$ W, rendering it sub-optimal as the sleep-timer [21], [22].

Compared with the 1-stage  $g_m$ , a multi-stage  $g_m$  [20], [23] can provide a higher transconductance-over-current ratio with the effective  $g_{\rm m}$  of the amplifier boosted by the gains of the preceding stages. Usually, we choose  $C_{\rm L}$  as specified by the manufacturer to obtain an accurate frequency. For the sleep-timer with the frequency accuracy relaxed, keeping such a specified  $C_{\rm L}$  is not mandatory. Since  $R_{\rm N}$  is inversely proportional to  $C_{\rm L}^2$ , we can set a lower  $g_{\rm m}$  with a reduced  $C_{\rm L}$ while maintaining the  $R_{\rm N}$ , resulting in a lower power budget for  $A_{\rm XT}$ . To this end, we propose a reconfigurable  $A_{\rm XT}$  featuring different  $g_m$  in the HPM and LPM [Fig. 6(a) and (b)]. A 3-stage  $g_{\rm m}$  with  $C_{\rm L}$  reduction secures  $R_{\rm N}$  and sustains the oscillation of the XO with reduced power consumption in the LPM. In the HPM, we juxtapose a conventional 1-stage  $g_{\rm m}$  with the 3-stage  $g_{\rm m}$  to provide an oscillation signal with a premium PN.

Fig. 6(b) presents the reconfigurable  $A_{\rm XT}$  in the LPM, which is a 3-stage cascaded common-source (CS) amplifier. The first two stages provide gain to the third stage, thus boosting the effective  $g_{\rm m}$  of the  $A_{\rm XT}$  to:

$$g_{m,tot-3} = g_{m1}R_{out1}g_{m2}R_{out2}g_{m3} \tag{2}$$

where  $R_{out1}$  ( $R_{out2}$ ) stands for the parallel resistance of  $R_{L1}$ and  $R_X$  ( $R_{L2}$  and  $R_Y$ ). Also, we remove  $C_L$  from the 3-stage  $g_m$ . Still, there are residue capacitances from the I/O pads and the PCB traces (~4 pF on each node). Fig. 7(a) plots the  $R_N$ as a function of the total bias current at room temperature, manifesting the improvement brought by the 3-stage  $g_m$  plus





**FIGURE 7.** (a)  $R_N$  of 1-stage  $g_m$  with  $C_L$  of 6 pF and 2 pF, and 3-stage  $g_m$  with  $C_L$  of 2 pF versus  $I_0$  at room temperature. (b)  $R_N$  of the 3-stage  $g_m$  and 5-stage  $g_m$  versus  $I_0$  at room temperature. For (a) and (b),  $R_N = 100 \ \Omega$  are depicted with the black dashed line.

 $C_{\rm L}$  reduction.  $R_{\rm N}$  should be >100  $\Omega$  to cover the  $R_{\rm M}$  variation of different crystals (measured  $R_{\rm M}$ : 42.5  $\Omega$ ). Yet, for the 1stage  $g_{\rm m}$  without  $C_{\rm L}$  reduction,  $R_{\rm N}$  is <30  $\Omega$  for  $I_{\rm D}$  < 5.5  $\mu$ A. If we downsize  $C_{\rm L}$  to 2 pF,  $R_{\rm N}$  of the 1-stage  $g_{\rm m}$ can increase. Then, it is still <100  $\Omega$  with  $I_{\rm D}$  < 4  $\mu$ A. To further  $R_{\rm N}$ , we reconfigure  $A_{\rm XT}$  as a 3-stage  $g_{\rm m}$ . With both techniques applied, we obtain  $R_{\rm N}$  > 100  $\Omega$  over a wide current range (3 to 5.5  $\mu$ A). For instance, when  $I_{\rm D}$  = 5.5  $\mu$ A,  $R_{\rm N}$  of the 3-stage- $g_{\rm m}$  is >574  $\Omega$ , which is ~4.2× larger than that only with  $C_{\rm L}$  reduction.

Theoretically, we can add the number of stages to achieve a higher  $R_N$ . Yet, 3-stage  $g_m$  is adequate here for two reasons. First, under the same current budget, increasing the number of stages results in a lower current for each stage. Consequently, the preceding stages take up most of the current to secure the bandwidth, leaving a tiny budget for the final stage. Verified from simulation,  $R_N$  of 3-stage  $g_m$  is larger than that of 5-stage  $g_m$  when  $I_D < 4.7 \ \mu A$  [Fig. 7(b)]. Second, the  $R_N$  of a 3-stage  $g_m$  is less sensitive to PVT



FIGURE 8. 200-run Monte-Carlo  $R_N$  simulation: (a) 3-stage  $g_m$  and (b) 5-stage  $g_m$ .

variations than that of a 5-stage  $g_m$ . Considering a 5-stage  $g_m$ , we can express the effective  $g_m$  as:

$$g_{m,tot-5} = g_{m1}R_{out1}g_{m2}R_{out2}$$
$$g_{m3}R_{out3}g_{m4}R_{out4}g_{m5}$$
(3)

To simplify the discussion, we omit  $C_S$  here. Also, we assume an equal distribution of the total current (thus  $g_m$ ) to each stage for both 3-stage and 5-stage  $g_m$ . The impedance  $Z_{amp}$  across the amplifier (Fig. 5) is:

$$Z_{amp} = \frac{-g_m}{4\omega_0^2 C_L^2} + \frac{1}{j\omega_0 C_L}$$
(4)

Substituting (2) and (3) into (4) with  $R_{out1}$ ,  $R_{out2}$ ,  $R_{out3}$ , and  $R_{out4}$  regarded as R,  $g_{m1}$ ,  $g_{m2}$ ,  $g_{m3}$ ,  $g_{m4}$ , and  $g_{m5}$ regarded  $g_m/N$  where N is the number of the stages, the  $R_N$  of the 3-stage and 5-stage- $g_m$  are:

$$R_{N,3} = \frac{-\left(\frac{g_m}{3}\right)^3 R^2}{4\omega_0^2 C_L^2}$$
(5)

$$R_{N,5} = \frac{-\left(\frac{g_m}{5}\right)^5 R^4}{4\omega_0^2 C_L^2} \tag{6}$$

We can express the relative sensitivity of  $R_N$  to  $g_m$  as:

$$S = \frac{\frac{\partial R_N}{R_N}}{\frac{\partial g_m}{g_{m,tot}}} \tag{7}$$

Hence, the relative sensitivity of  $R_N$  to  $g_m$  for the 3-stage  $g_m$  and 5-stage  $g_m$  are  $S_3 = 3$  and  $S_5 = 5$ , respectively, signifying that the sensitivity of  $R_N$  to  $g_m$  is commensurate with *N*. From the Monte-Carlo simulation, the average  $R_N$  of 3-stage  $g_m$  is 659  $\Omega$  (minimum: 165  $\Omega$ ) with  $I_D = 5.5 \ \mu A$  (Fig. 8). Under the same current budget, the standard deviation of  $R_N$  of 5-stage  $g_m$  is 20.9× larger than that of 3-stage  $g_m$ , and 20% of  $R_N$  is <100  $\Omega$ .

Tuning the  $C_{\rm L}$  results in a frequency deviation from the nominal frequency. The frequency deviation between HPM and LPM can be formulated as:

$$\frac{\Delta f_{H-L}}{f_{\rm S}} = \frac{C_M}{2} \left( \frac{1}{C_L + C_T' + C_{\rm S}} - \frac{1}{C_T' + C_{\rm S}} \right). \tag{8}$$

For  $C_{\rm M} = 1.95$  fF,  $C_{\rm S} = 1$  pF,  $C_{\rm L} = 6$  pF,  $C_{\rm T}$ ' (dominated by I/O pads and PCB traces) = 2 pF, and  $f_{\rm s}$  (series resonance frequency of the crystal) = 16 MHz, the frequency deviation is 216 ppm, conforming with the BLE standard ( $\leq \pm 500$  ppm for the sleep-timer [24]).

In the HPM, we use the 16-MHz clock as the frequency reference for the PLL. As the frequency reference requires a superior PN, we employ the Pierce oscillator as the 1-stage  $g_{\rm m}$  [20]. Its PN due to the flicker noise is given as [25]

$$S_{\varphi_n^2} = \frac{\overline{\Gamma_i}^2 K_{fi}}{(2C_L | V_{XO+}|)^2 \Delta \omega^3} \left(\frac{C_m}{2C_L}\right)^2 \tag{9}$$

where  $\overline{\Gamma_i}$  is the effective impulse sensitivity function,  $K_{fi}$  is the flicker noise current source,  $V_{XO+}$  is the output swing, and  $\Delta \omega$  is the offset frequency. From (9), the PN is proportional to  $K_{fi}$  and inversely proportional to  $C_L^3$ , which explains the PN degradation in the LPM as we only put the noisy 3-stage  $g_m$  in place and remove the 6-pF  $C_L$  from  $A_{XT}$ . CMOS transistors inserted as switches control the state of  $A_{XT}$ .  $V_{O,3}$  buffers the control signals to improve the onresistances of the switches. In LPM,  $S_{1-3}$  isolate  $C_L$  and  $R_L$ from the 3-stage  $g_m$ . Also,  $S_{1-2}$  cut down the power of the 1-stage  $g_m$  in LPM. They should be optimized not only to minimize their on-resistances in the on-state, but also to limit the leakage currents in the off-state. In LPM, the leakage power for the 1-stage  $g_m$  is 2.52 nW at room temperature, which is negligible to the sleep-timer's power.

 $V_{\rm EH}$  directly powers most of  $A_{\rm XT}$  to improve power efficiency. Hence, it is essential to design  $A_{\rm XT}$  capable of functioning with  $V_{DD}$  down to 0.25 V. Current-source load prevails as the core amplifier of XO [9], [10]. Yet, it demands a higher voltage headroom to bias the active load, rendering it unsuitable for ULV operation. Further, we cannot self-bias the reconfigurable  $A_{\rm XT}$  by the feedback resistor with  $V_{DD}$ of 0.25 V. Thus, we employ resistive loads for both 1-stage  $g_{\rm m}$  and 3-stage  $g_{\rm m}$ , which offer higher effective  $g_{\rm m}$  (>4×) and are less prone to VT variation than the current-source load. Although the resistive loads are prone to power-supply noise, for energy harvesting applications powered by the solar cell with typical noise spectral density, e.g., < 45  $nV/\sqrt{Hz}$  with 400  $\mu$ W power at 1-kHz in [26], it degrades the PN by < 0.98 dBc/Hz at 1-kHz offset in the HPM. Herein, a voltage-and-temperature-tracking  $V_{\text{bias}}$  from  $\mu$ PM secures the robustness of  $A_{\rm XT}$  in both HPM and LPM.  $V_{\rm O,2}$ powers this bias circuit, which is a typical beta-multiplier. Its power consumption is 1.23  $\mu$ W. The bias-purposed  $R_X$  and  $R_{\rm Y}$  are pseudo-resistors to save the chip area. Their resistances are >15.3 M $\Omega$  across -20 to 70 °C, and >18.3 M $\Omega$ over a  $V_{\rm EH}$  variation of 0.25 to 0.3 V. The simulated PN only degrades by 0.3 dBc/Hz at 1-kHz offset with pseudo-resistors compared with regular polysilicon resistors.

We build the 3-stage  $g_{\rm m}$  (Fig. 6) with cascaded CS amplifiers.  $V_{\rm EH}$  drives the first and third branches to maximize the power efficiency, while  $V_{\rm O,1}$  drives the second stage to enhance the swing of  $V_{\rm Tim}$ , the output to the DIV. After securing the gain and bandwidth of the first two stages, the current is allocated to the third stage to maximize the



FIGURE 9. The output swing of  $V_{\text{Tim}}$  as a function of the DC voltage of  $V_{\text{Tim}}$ .



FIGURE 10. DIV implementation, with the signals S<4:0> provided off-chip for design flexibility.

effective  $g_{\rm m}$ . In the conventional amplifier design where linearity is critical,  $R_{\rm L2}$  should be designed such that  $V_{\rm Tim}$ is  $\sim 0.5 \times V_{\rm O,1}$  and the swing headroom and linearity are maximized. Yet, for the timer where linearity is relatively unimportant,  $R_{\rm L2}$  can be further to enhance the amplifier gain and thus the signal's amplitude and safeguard the operation of the DIV. Fig. 9 shows the signal swing at  $V_{\rm Tim}$  as a function of the DC value of  $V_{\rm Tim}$  by changing  $R_{\rm L2}$  and preserving the current budget. The swing at  $V_{\rm Tim}$  reaches a maximum when  $V_{\rm Tim}$  is  $\sim 100$  mV, then decreases when  $V_{\rm Tim}$  rises, attributable to the drops in  $R_{\rm L2}$  and the gain.

DIV is powered by  $V_{0,1}$  to save power (simulated: 139.4 nW). Four asynchronous dividers implemented with low- $V_t$  transistors head the DIV, thus enabling fast switching and dividing the frequency of  $V_{\text{Tim}}$  by  $16 \times$  (Fig. 10). This 1-MHz signal is then delivered to a synchronous divider (divby-31). The divider is based on T flip-flops and logic gates with standard- $V_t$  transistors to generate the final 32.258-kHz clock. This scheme saves 39.4% (simulation) power due to the inevitable leakage current in the employed 28-nm CMOS process compared with the scheme that has all logic gates implemented with low- $V_t$  transistors while safeguarding the functionality of the DIV.

#### **V. MEASUREMENT RESULTS**

We prototyped the ULV XO-Timer with fully on-chip  $C_L$  in 28-nm CMOS (Fig. 11). We selected the 28-nm CMOS



FIGURE 11. Chip micrograph of the fabricated ULV XO-Timer.



FIGURE 12. Power breakdown of ULV XO-Timer at (a) HPM and (b) LPM.



FIGURE 13. Measured  $V_{0,1}$ ,  $V_{0,2}$ , and  $V_{0,3}$  against (a) temperature and (b)  $V_{EH}$ .

process to streamline the integration of this XO-timer with the mainstream process for IoT transceivers [3], [4]. The active area is 0.127 mm<sup>2</sup>, dominated by the  $\mu$ PM (0.121 mm<sup>2</sup>). We employed 16-MHz quartz crystal 7V-16.000MAAE-T from the TXC Corporation. Its inherent frequency stability is ±30 ppm over -20 to 70 °C from the datasheet [27]. Fig. 12 summarizes the simulated power consumption of the XO-Timer in HPM and LPM. The  $\mu$ PM takes up 89.5% of the power at LPM and 92% of the area, which can be partially absorbed at the system level by sharing it with other circuitries [3], [4].

We first characterize the  $\mu$ PM. Fig. 13 plots the  $V_{0,1}$ ,  $V_{0,2}$ , and  $V_{0,3}$  variations against the temperature and supply voltage. Benefitting from the voltage-regulation loop, the variation of  $V_{0,1}$ ,  $V_{0,2}$ , and  $V_{0,3}$  is 9.6%, 14.4%, and 20.6% over -20 to 70 °C, whereas the variation of  $V_{0,1}$ ,  $V_{0,2}$  and  $V_{0,3}$  is 2.2%, 5.2% and 6.4% over  $V_{\text{EH}}$  from 0.25 to



FIGURE 14. Measured frequency deviations of 6 samples against temperature and supply voltage variations at the HPM: (a) and (b), and LPM: (c) and (d).

0.3 V (20% variation), respectively. The BTRO frequency is 2.7 MHz at 0.25 V and at room temperature, which is  $1.52 \times$  slower than the simulation result in SS corner. This deviation indeed limits our minimum  $V_{\rm EH}$  as the low BTRO frequency paralyzes the operation of the  $\mu$ PM. When the frequency is tallied with the simulation result, we could decrease  $V_{\rm EH}$  to <0.2 V. On the other hand, the breakdown voltage of the transistor in this process (1 V) limits the maximum  $V_{\rm EH}$ .

We characterize the performance of the XO-Timer with 6 available samples (different dies and crystals). Fig. 14 (a)-(d) shows the measured frequency variation of the XO-Timer over the temperature (-20 to 70 °C) and voltage (0.25 to 0.3 V) variations. In the HPM, the worst frequency variation is <13.2 ppm and <0.37 ppm against temperature and voltage variation, respectively. As the intrinsic frequency variation of the crystal is  $<\pm 30$  ppm across -20to 70 °C, and the measured frequency variation matches with the temperature profile of the AT-cut crystal, the frequency variation in the HPM is attributable to the crystal itself. In the LPM, the worst frequency variation is < 86.7 ppm <42 ppm over temperature and voltage variation. Both are considerably larger than the HPM as the 3-stage  $g_{\rm m}$ 's equivalent impedance in the LPM is sensitive to temperature and voltage variation. Still, they conform to the BLE standard  $(<\pm 500 \text{ ppm})$  with an adequate margin for the timer and aging.

Fig. 15 illustrates the measured power consumption of the XO-Timer across the temperature and voltage in both HPM and LPM. The average power of the XO-Timer in the HPM and LPM is 24.3 and 11.4  $\mu$ W at room temperature, respectively. Herein, we benchmark the sleep-timer using the figure of merit proposed in [9]:

$$FoM_2 = P_S + P_A V_S, \tag{10}$$



FIGURE 15. Measured power consumption of 6 samples against temperature and supply voltage variations in the HPM (a) and (b), and LPM (c) and (d).



FIGURE 16. (a) Measured Allan deviation of the 32.258-kHz timer in LPM. (b) PN of the 16-MHz XO in HPM.

where  $P_{\rm S}$  is the power in the sleep mode,  $P_{\rm A}$  is the transceiver's active power (assume 10 mW), and  $V_{\rm S}$  is the sleep-timer's frequency error across the temperature range and  $\Delta V_{\rm DD}$  of 0.3 V. The FoM<sub>2</sub> of the proposed timer is 14.8  $\mu$ W, outperforming the prior art [9], [11], [28], [29].

Next, we measured the Allan deviation of the 32.258-kHz output of the XO-Timer to determine the long-term frequency stability [Fig. 16(a)]. The Allan deviation floor is 35.1 ppb with an average time of 10 s, comparable to other state-ofthe-art sleep-timers [9], [30]. Fig. 16(b) shows the PN of the 16-MHz signal in the HPM. It scores a PN of -133.8 dBc/Hz at 1-kHz offset. Its FoM<sub>1</sub>, only including the power of XO as in [20] for a fair comparison, is -236 dBc/Hz at 1-kHz offset, comparable with the prior art [20], [31], [32]. There are spurs in Fig. 16(b) due to the crosstalk from the 32.258-kHz output (<-82 dBc) and µPM (<-86.5 dBc). These reference spurs after passing through the PLL are upconverted by  $20 \log (N)$ , where N is 150 for 2.4-GHz PLL output. Hence, these spurs become <-38.5 and <-43 dBc, which meets the BLE standard (i.e., <-30 dBc [33]). There are two spurs at 50 Hz and 150 Hz due to the electromagnetic interference

|                                              | This Work                           |                    | ISSCC'22 [11]            |            | JSSC'18 [20] | ISSCC'15 [9]          |           |
|----------------------------------------------|-------------------------------------|--------------------|--------------------------|------------|--------------|-----------------------|-----------|
| Key Architecture                             | Dual-Mode XO-Timer<br>+ On-Chip μPM |                    | Dual-Mode XO-Timer       |            | XO           | Dual-Mode XO-Timer    |           |
| Technology [nm]                              | 28                                  |                    | 28                       |            | 65           | 65                    |           |
| Area [mm <sup>2</sup> ]                      | 0.127 #                             |                    | 2.51 (inc. LDOs)         |            | 0.023        | 0.13                  |           |
| Supply Voltage [V]                           | 0.25 to 0.3                         |                    | 1.5 to 1.7               |            | 0.35         | N/A                   |           |
| Temperature, T <sub>range</sub> [°C]         | -20 to 70                           |                    | -40 to 90                |            | -40 to 90    | -40 to 90             |           |
| Frequency                                    | 16 MHz                              | 32.258 kHz         | 76.8 MHz                 | 32.768 kHz | 24 MHz       | 24 MHz                | 31.25 kHz |
| Load Capacitance [pF]                        | 6                                   | 2 <sup>&amp;</sup> | N/A                      |            | 6            | N/A                   |           |
| Power, <i>P</i> [µW]                         | 24.3                                | 11.4               | 1080                     | 48.2       | 31.8         | 445                   | 37        |
| PN @ 1kHz [dBc/Hz]                           | -133.8                              | N/A                | -134.03                  | N/A        | -134         |                       | N/A       |
| FoM₁ @ 1kHz [dBc/Hz]                         | <b>-</b> 236 *                      | N/A                | -231.4                   | N/A        | -236.6*      |                       | N/A       |
| $\Delta$ f/f versus T <sub>range</sub> [ppm] | 13.2 <sup>†</sup>                   | 86.7 †             |                          | 88.4       | 14.1         | 16                    | 26        |
| ∆f/f versus Supply<br>Voltage [ppm]          | 0.37 †                              | <b>42</b> †        |                          |            | 17.9         | 2.07                  | 2.7       |
|                                              | 0.25 to 0.3 V                       |                    | -                        |            | 0.3 to 0.5 V | ∆V = 0.3V             |           |
| FoM₂ [ <i>µ</i> W]                           | N/A                                 | 14.8               | N/A                      | 49.084     | N/A          | 37.3                  |           |
| Allan Deviation [ppb]                        | 35.1 (10s averaging time)           |                    | 2500 (1s averaging time) |            | N/A          | 8 (2s averaging time) |           |

TABLE 1. Performance summary and comparison with recent art.

FoM<sub>1</sub> for XO: PN – 20log( $flf_{offset}$ ) + 10log(P/1mW). FoM<sub>2</sub> for Timer:  $P_S + P_A V_S$ , where  $P_A=10$ mW &  $V_S$  is from [9]. #µPM is included. & Parasitic capacitance dominated by I/O pads and PCB traces. \*Only include the power of the amplifiers. † Worst values of 6 samples.  $\blacktriangle$  Voltage variation data unavailable, excluded from FoM<sub>2</sub>.



FIGURE 17. Measured (a) transient waveform and (b) frequency variation of the XO-Timer under transition between HPM and LPM.



As there is oscillation on the crystal in the LPM, it can improve the startup time of the XO in HPM from 12 to 2.8 ms without any additional quick-startup technique [Fig. 17 (a)]. The energy consumed in this transition time is 68.0 nJ. As mentioned in Section IV, the  $C_{\rm L}$ -reduction technique results in frequency variation between the LPM and HPM. Fig. 17(b) displays such frequency variation of the 16-MHz clock in the time domain. The frequency deviation  $\frac{\Delta f_{H-L}}{f_c}$  between two modes is 290 ppm (room temperature,  $V_{\rm EH}^{55} = 0.25$  V). It is larger than the calculated value in Section IV, attributable to the uncertainties in the parasitic capacitances from the I/O pads and PCB traces. Further, Fig. 18 exhibits the frequency variations between HPM and LPM for different voltages and temperatures. As the frequency error induced by  $C_{\rm L}$  reduction is predictable, we can account for it in the digital domain for calibration. Even



FIGURE 18. Measured frequency variation between HPM and LPM of 6 samples against (a) temperature and (b) supply voltage.

without calibration, they are all compatible with the BLE standard (<±500 ppm).

There are missing cycles within the DIV when switching from LPM to HPM. This phenomenon derives from the reduction in oscillation amplitude over the crystal after an abrupt increase in  $C_{\rm L}$  (6 pF). Consequently, the DIV may miss some cycles from  $A_{\rm XT}$ . Thus, the measured first-cycle period for the sleep-timer is 56  $\mu$ s. Even so, we can solve this issue by adopting progressive  $C_{\rm L}$  switching as in [9]. From simulation, if we add up a small  $C_{\rm L}$  (0.5 pF) to 6 pF by every 6  $\mu$ s, which we can achieve with simple digital counters, we can maintain  $V_{\rm Tim}$  above 77.4% of its full swing, and no missing cycles will happen.

We also powered the XO-Timer with a solar cell  $(1.2 \times 1.6 \text{ cm}^2)$  in an indoor irradiance, resulting in an output voltage  $V_{\text{EH}}$  ranging from 0.25 to 0.3 V [Fig. 19(a)]. To mimic the power profile of a practical IoT node, we assume a BLE advertising packet of 128 µs [34], wherein  $V_{\text{EH}}$  draws a



**FIGURE 19.** (a) Experimental setup for mimicking the operation of the solar-cell-powered XO-Timer together with a duty-cycled TX. The TX drains 20-mA current in a 128- $\mu$ s interval as BLE advertising packet. (b) Measured amplitude degradation against the reservoir capacitor size. (c) The measured transient plot of XO-Timer when  $C_R = 67 \ \mu$ F.

20-mA current to imitate the operation of the BLE transmitter [4]. Meanwhile, we parallel a reservoir capacitor  $C_R$  to the solar cell to buffer the current drain from the highcurrent load. Fig. 19(b) plots the amplitude degradation of the XO amplitude among various reservoir capacitors during the current drained. The 16 MHz XO can maintain >90% of its amplitude even with a reservoir capacitor of 67  $\mu$ F. Fig. 19(c) shows the transient of  $V_{\rm EH}$  and 16-MHz XO.

Benchmarking with the recent art [9], [11] and [20], this work is the first sub-0.5V single-crystal XO-Timer with an embedded  $\mu$ PM to manage the internal voltages for better power efficiency (Table 1). The MHz-clock's FoM<sub>1</sub> is -236 dBc/Hz, comparable to [20], yet it generates both MHz-clock and sleep timer from one crystal. The sleep timer's FoM<sub>2</sub> is 14.8  $\mu$ W, outscoring [9] and [11] by 2.5× and 3.3×, respectively. We acknowledge that the power and area overheads of the  $\mu$ PM can be partially absorbed at the system level by sharing its functions with other circuitries.

#### **VI. CONCLUSION**

This paper reported an ULV XO-Timer able to generate both a 16-MHz clock reference and a 32.258-kHz timer with a single crystal for ultra-low-power IoT nodes. It features a power-efficient  $\mu$ PM consisting of a cascaded 3-stage CP and a single voltage-regulation loop to tailor the internal voltage and current budgets for each sub-function while regulating them against supply voltage and temperature variations. The core amplifier  $A_{\rm XT}$  is ULV-enabled. It is also made reconfigurable to improve the current-to- $g_{\rm m}$  efficiency in the LPM by using a 3-stage  $g_{\rm m}$  plus  $C_{\rm L}$ -reduction, while safeguarding the PN in HPM by using a 1-stage  $g_{\rm m}$ . The measured power consumption of the XO-Timer in 28-nm CMOS is 24.3  $\mu$ W in the HPM, and is reduced to 11.4  $\mu$ W in the LPM. Furthermore, it also has a sleep-timer  $FoM_2$  of 14.8  $\mu$ W despite a limited voltage headroom, rendering this dual-mode XO-Timer with a single crystal a preeminent solution for the ULV energy-harvesting IoT nodes.

#### REFERENCES

- D. B. Ma and Y. Lu, "Power management circuit design for IoT nodes," in *Enabling the Internet of Things*, 1st ed., M. Alioto, Ed. New York, NY, USA: Springer Int., 2017, ch. 10, sec. 10.1, pp. 287–289.
- [2] S. Bandyopadhyay and A. P. Chandrakasan, "Platform architecture for solar, thermal, and vibration energy combining with MPPT and single inductor," *IEEE J. Solid-State Circuits*, vol. 47, no. 9, pp. 2199–2215, Sep. 2012.
- [3] H. Yi, W.-H. Yu, P.-I. Mak, J. Yin, and R. P. Martins, "A 0.18-V 382μW Bluetooth low-energy (BLE) receiver front-end with 1.33-nW sleep power for energy-harvesting applications in 28-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 53, pp. 1618–1627, Jun. 2018.
- [4] S. Yang, J. Yin, H. Yi, W.-H. Yu, P.-I. Mak, and R. P. Martins, "A 0.2-V energy-harvesting BLE transmitter with a micropower manager achieving % system efficiency at 0-dBm output and 5.2-nW sleep power in 28-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 54, no. 5, pp. 1351–1362, May 2019.
- [5] Z. Xu, W. Wang, N. Ning, W. M. Lim, Y. Liu, and Q. Yu, "A supply voltage and temperature variation-tolerant relaxation oscillator for biomedical systems based on dynamic threshold and switched resistors," *IEEE Trans. Very Large Scale Integr. (VLSI) Syst.*, vol. 23, no. 4, pp. 786–790, Apr. 2015.
- [6] A. Kobayashi and K. Niitsu, "Low-voltage gate-leakage-based timer using an amplifier-less replica-bias switching technique in 55-nm DDC CMOS," *IEEE Open J. Circuits Syst.*, vol. 1, pp. 107–114, 2020.
- [7] K.-M. Lei, P.-I. Mak, and R. P. Martins, "A 0.35-V 5,200-μm<sup>2</sup> 2.1-MHz temperature-resilient relaxation oscillator with 667 fJ/cycle energy efficiency using an asymmetric swing-boosted RC network and a dual-path comparator," *IEEE J. Solid-State Circuits*, vol. 56, no. 9, pp. 2701–2710, Sep. 2021.
- [8] M. H.-W. Loo et al., "Fully-integrated timers for ultra-low-power Internet-of-Things nodes—Fundamentals and design techniques," *IEEE Access*, vol. 10, pp. 65936–65950, 2022.
- [9] D. Griffith, J. Murdock, P. Røine, and T. Murphy, "A 37μW dual-mode crystal oscillator for single-crystal radios," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2015, pp. 104–105.
- [10] D. Griffith, F. Dülger, G. Feygin, A. N. Mohieldin, and P. Vallur, "A 65nm CMOS DCXO system for generating 38.4MHz and a real time clock from a single crystal in 0.09mm<sup>2</sup>," in *IEEE Radio Freq. Integr. Circuits Symp. Dig.*, May 2010, pp. 321–324.
- [11] J. Jung, et al., "A single-crystal-oscillator-based clock-management IC with 18× start-up time reduction and 0.68ppm/°C duty-cycled machine-learning-based RCO calibration," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2022, pp. 58–60.
- [12] GreenCLK. Accessed: Mar. 20, 2023. [Online]. Available: https://www.mouser.com/datasheet/2/665/silego\_04142016\_SLG3 NB3331V-1216911.pdf
- [13] J. Kim, P. K. T. Mok, and C. Kim, "A 0.15 V input energy harvesting charge pump with dynamic body biasing and adaptive dead-time for efficiency improvement," *IEEE J. Solid-State Circuits*, vol. 50, no. 2,pp. 414–425, Feb. 2015.
- [14] L. Lin, K.-M. Lei, P.-I. Mak, and R. P. Martins, "A sub-0.5V crystal oscillator-timer (XO-timer) combining 16MHz reference and 32kHz sleep timer with a single crystal for energy-harvesting radios in 28nm CMOS" in *Proc. IEEE Asian Solid-State Circuits Conf. (A-SSCC)*, 2002, pp. 1–3.
- [15] D. Ma and F. Luo, "Robust multiple-phase switched-capacitor DC-DC power converter with digital interleaving regulation scheme," *IEEE Trans. Very Large Scale Integr. (VLSI) Syst.*, vol. 16, no. 6, pp. 611–619, Jun. 2008.
- [16] P. Zhou, A. Paul, C. H. Kim, and S. S. Sapatnekar, "Distributed on-chip switched-capacitor DC–DC converters supporting DVFS in multicore systems," *IEEE Trans. Very Large Scale Integr. (VLSI) Syst.*, vol. 22, no. 9, pp. 1954–1967, Sep. 2014.
- [17] H.-P. Le, S. R. Sanders, and E. Alon, "Design techniques for fully integrated switched-capacitor DC–DC converters," *IEEE J. Solid-State Circuits*, vol. 46, no. 9, pp. 2120–2131, Sep. 2011.

- [18] H. Yi, J. Yin, P.-I. Mak, and R. P. Martins, "A 0.032-mm<sup>2</sup> 0.15-V three-stage charge-pump scheme using a differential bootstrapped ring-VCO for energy-harvesting applications," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 65, no. 2, pp. 146–150, Feb. 2018.
- [19] J. Jiang, W. Shu, J. Chang, and J. Liu, "A novel subthreshold voltage reference featuring 17ppm/°C TC within -40°C to 125°C and 75dB PSRR," in *Proc. IEEE Int. Symp. Circuits Syst. (ISCAS)*, May 2015, pp. 501-504.
- [20] K.-M. Lei, P.-I. Mak, M.-K. Law, and R. P. Martins, "A regulation free sub-0.5-V 16-/24-MHz crystal oscillator with 14.2-nJ startup energy and 31.8-μW steady-state power," *IEEE J. Solid-State Circuits*, vol. 53, no. 9, pp. 2624–2635, Sep. 2018.
- [21] M. Ding, Y.-H. Liu, P. Harpe, C. Bachmann, K. Philips and A. Van Roermund, "A low-power fast start-up crystal oscillator with an autonomous dynamically adjusted load," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 66, no. 4, pp. 1382–1392, Apr. 2019.
- [22] A. Karimi-Bidhendi and P. Heydari, "A study of multi-phase injection on accelerating crystal oscillator start-up," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 67, no. 12, pp. 2868–2872, Dec. 2020.
- [23] S. Iguchi et al., "% power reduction by automatic self-power gating (ASPG) and multistage inverter for negative resistance (MINR) in 0.7V, 9.2µW, 39MHz crystal oscillator," in *IEEE Symp. VLSI Circuits Dig. Tech. Papers*, Jun. 2013, pp. C142–C143.
- [24] "Bluetooth Core Specification V5.2." Accessed: Mar. 20, 2023. [Online]. Available: https://www.bluetooth.com/specifications/ bluetooth-core-specification/
- [25] E. Vittoz, "General theory of high-Q oscillator" in *Low-Power Crystal and MEMS Oscillators*, 1st ed. Berlin, Germany: Springer, 2010, ch. 3, sec. 3.72, pp. 33–36.
- [26] Z. Chobola et al., "Measurement of low frequency noise of monocrystalline silicon solar cells," in *Proc. XVII IMEKO World Congr.*, Jun. 2003, pp. 1–4.
- [27] "16-MHz Quartz Crystal 7V-16.000MAAE-T From the TXC Corporation." Accessed: Mar. 20, 2023. [Online]. Available: https:// www.txccrystal.com/images/pdf/7v.pdf
- [28] D. Griffith et al., "A 190nW 33kHz RC oscillator with ±0.% temperature stability and 4ppm long term stability," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2014, pp. 300–301.
- [29] A. Paidimarri et al., "A 120nW 18.5kHz RC oscillator with comparator offset cancellation for ±0.% temperature stability," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2013, pp. 184–185.
- [30] A. Shrivastava, D. A. Kamakshi, and B. H. Calhoun, "A 1.5 nW, 32.768 kHz XTAL oscillator operational from a 0.3 V supply," *IEEE J. Solid-State Circuits*, vol. 51, no. 3, pp. 686–696, Mar. 2016.
- [31] J. B. Lechevallier, R. A. R. Van Der Zee, and B. Nauta, "Fast & energy efficient start-up of crystal oscillators by self-timed energy injection," *IEEE J. Solid-State Circuits*, vol. 54, no. 11, pp. 3107–3117, Nov. 2019.
- [32] K. M. Megawer et al., "A fast startup CMOS crystal oscillator using two-step injection," *IEEE J. Solid-State Circuits*, vol. 54, no. 12, pp. 3257–3268, Dec. 2019.
- [33] V. K. Chillara et al., "9.8 An 860μW 2.1-to-2.7GHz all-digital PLL-based frequency modulator with a DTC-assisted snapshot TDC for WPAN (Bluetooth Smart and ZigBee) applications," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Paper*, Feb. 2014, pp. 172–173.
- [34] D. Griffith, J. Murdock, and P. T. Røine, "A 24 MHz crystal oscillator with robust fast start-up using dithered injection," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2016, pp. 104–105.



LIWEN LIN (Graduate Student Member, IEEE) received the B.Sc. degree in electrical and computer engineering from the University of Macau, Macau, China, in 2019, where she is currently pursuing the Ph.D. degree from the State-Key Laboratory of Analog and Mixed-Signal VLSI, Faculty of Science and Technology. Her research interests include ultralow-voltage low-power analog and radio-frequency circuit techniques.



**KA-MENG LEI** (Senior Member, IEEE) received the B.Sc. degree in electrical and electronic engineering and the Ph.D. degree in electrical and computer engineering from the University of Macau, Macau, in 2012 and 2016, respectively.

He has been serving as an Assistant professor with the University of Macau since 2019. He was a Postdoctoral Fellow with Harvard University from 2017 to 2019, where he was involved in developing the high-resolution portable nuclear magnetic resonance (NMR) spectrometer. He has published

29 refereed papers. He coauthored one book *Handheld Total Chemical* and *Biological Analysis Systems: Bridging NMR, Digital Microfluidics, and* Semiconductors (Springer 2018), and two book chapter *Micro-NMR on* CMOS for Biomolecular Sensing (Springer 2018) and Ultra-Low-Voltage Clock References (Springer 2023). His current research interests include ultralow voltage analog circuit techniques, sensors and analog front-end interfaces, and high-resolution portable NMR platform.

Dr. Lei was a (co-)recipient of the Chipidea Microelectronics Prize 2017 (postgraduate), the Best Paper Award in ASQED 2013, the Distinguished Design Award from the IEEE Asian Solid-State Circuits Conference in 2015, the Silkroad Award from the ISSCC 2016, the FDCT Macao Science and Technology Award for Postgraduates in 2016, the 2016 Young Researcher Award from the Instituto International de Macau, and the IEEE SSCS Predoctoral Achievement Award in 2017. He was on the Organizing Committee of the IEEE A-SSCC 2019. He is an Associate Editor of the IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS. He is serving as a Committee Member for the IEEE SSCS Young Professionals and coordinating the Webinars for Young Excellence Program. He was a TPC Member of the ICTA in 2021 and 2022.



**PUI-IN MAK** (Fellow, IEEE) received the Ph.D. degree from the University of Macau (UM), Macau, China, in 2006.

He is currently a Full Professor with the UM Faculty of Science and Technology–ECE Department, and an Associate Director (Research) with the UM Institute of Microelectronics and State Key Laboratory of Analog and Mixed-Signal VLSI. His research interests are on analog and radio-frequency circuits and systems for wireless and multidisciplinary innovations.

Prof. Mak (co)-received the 2005 DAC/ISSCC Student Paper Award, the 2010 CASS Outstanding Young Author Award; National Scientific and the 2011 Technological Progress Award; the Best Associate Editor of IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-II: EXPRESS BRIEFS from 2012 to 2013, the 2015 A-SSCC Distinguished Design Award, and the ISSCC Silkroad Award. In 2005, he was decorated with the Honorary Title of Value for scientific merits by the Macau Government. His involvements with IEEE are: an Editorial Board Member of IEEE Press from 2014 to 2016; a Member of Board-of-Governors of IEEE Circuits and Systems Society from 2009 to 2011: a Senior Editor of IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS from 2014 to 2015; an Associate Editor of IEEE JOURNAL OF SOLID-STATE CIRCUITS in 2018, IEEE SOLID-STATE CIRCUITS LETTERS in 2017, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—PART I: REGULAR PAPERS from 2010 to 2011 and 2014 and 2015, and IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-PART II: EXPRESS BRIEFS from 2010 to 2013. He is/was the TPC Vice Co-Chair of ASP-DAC in 2016, a TPC Member of A-SSCC from 2013 to 2016, ESSCIRC from 2016 to 2017, and ISSCC from 2017 to 2019. He is/was a Distinguished Lecturer of IEEE Circuits and Systems Society from 2014 to 2015 and IEEE Solid-State Circuits Society from 2017 to 2018. He was the Chairman of the Distinguished Lecturer Program of IEEE Circuits and Systems Society from 2018 to 2019. He has been inducted as an Overseas Expert of the Chinese Academy of Sciences since 2018; has been a Fellow of the U.K. Institution of Engineering and Technology for contributions to engineering research, education and services since 2018; and has been a Fellow of the IEEE for contributions to radiofrequency and analog circuits since 2019, and has been a Fellow of the U.K. Royal Society of Chemistry since 2020.



**RUI P. MARTINS** (Fellow, IEEE) was born in April 1957. He received the bachelor's, master's, and Ph.D. degrees in electrical engineering and computers from the Department of Electrical and Computer Engineering (DECE), Instituto Superior Técnico (IST), University of Lisbon, Portugal, in 1980, 1985, and 1992, respectively. He received the Habilitation for Full Professor of electrical engineering and computers with the DECE, IST, University of Lisbon, in 2001.

He has been with the DECE/IST, University of Lisbon since October 1980. Since October 1992, he has been on leave from the University of Lisbon and with the DECE, Faculty of Science and Technology (FST), University of Macau (UM), Macau, where he has been the Chair Professor since August 2013. At FST, he was the Dean, from 1994 to 1997, and has been the UM's Vice-Rector, since September 1997. From September 2008 to August 2018, he was the Vice-Rector (Research) and from September 2018 to August 2023, he was the Vice-Rector (Global Affairs). In 2003, he created the Analog and Mixed Signal VLSI Research Laboratory, UM, elevated in January 2011 to the State Key Laboratory (SKLAB), China (the 1st in Engineering in Macau), being its Founding Director. He was the Founding Chair of UMTEC (UM company) from January 2009 to March 2019, supporting the incubation and creation of Digifluidic, in 2018, the first UM Spin-Off, whose CEO is a SKLAB Ph.D. graduate. He was also the Co-Founder of Chipidea Microelectronics (Macau) (later Synopsys-Macau, and currently Akrostar, where the CEO is one of his Ph.D. graduates), from 2001 to 2002. Within the scope of his teaching and research activities, he has taught 21 bachelor's and master's courses. In UM, he has supervised (or co-supervised) 47 theses and 26 Ph.D. and 21 master's students. He has authored or coauthored nine books and 12 book chapters; 49 patents (39 USA, three Taiwan, and seven China); 675 papers (289 articles in scientific journals and 386 papers in conference proceedings); and other 70 academic works, in a total of 815 publications.

Prof. Martins was the General Chair of the ACM/IEEE Asia South Pacific Design Automation Conference (ASP-DAC 2016), receiving the IEEE Council on Electronic Design Automation Outstanding Service Award in 2016, and also the General Chair of the IEEE Asian Solid-State Circuits Conference in 2019. From 2005 to 2014, he was the Vice-President of the Association of Portuguese Speaking Universities, where he was the President from 2014 to 2017, and the Vice-President from 2021 to 2024; and has received three Macau Government Decorations-the Medal of Professional Merit (Portuguese-1999); the Honorary Title of Value (Chinese-2001), and the Medal of Merit in Education (Chinese-2021). He was the Founding Chair of IEEE Macau Section, from 2003 to 2005, the IEEE Macau Joint-Chapter on Circuits and Systems/Communications, from 2005 to 2008, and the 2009 World Chapter of the Year of IEEE CAS Society (CASS); the General Chair of the IEEE Asia Pacific Conference on CAS in 2008; the Vice-President Region 10 (Asia, Australia, and Pacific), from 2009 to 2011; and the VP-World Regional Activities and Membership of IEEE CASS, from 2012 to 2013, nominated the Best Associate Editor from 2012 to 2013. He has been a member of the Advisory Board of the JOURNAL OF SEMICONDUCTORS of the Chinese Institute of Electronics, Institute of Semiconductors, Chinese Academy of Sciences, since January 2021, and a Fellow of the Asia-Pacific Artificial Intelligent Association since October 2021. He was also a member of the IEEE CASS Fellow Evaluation Committee (the Chair in 2013, 2014, and 2018 and the Vice-Chair 2019, 2021, and 2022). He was the IEEE Nominating Committee of Division I Director (CASS/EDS/SSCS) in 2014, and the IEEE CASS Nominations Committee from 2016 to 2017. Since July 2010 was elected, unanimously, to the Lisbon Academy of Sciences, as Corresponding from 2010 to 2022 and has been an Effective Member since 2022, being the only Portuguese Academician working and living in Asia.