

Received November 30, 2021, accepted December 12, 2021, date of publication December 23, 2021, date of current version December 30, 2021.

*Digital Object Identifier 10.1109/ACCESS.2021.3137851*

# A Nanosized-Metal-Grain Pattern-Dependent Threshold Voltage Model for the Work Function Fluctuation of GAA Si NW MOSFETs

# WEN-LI SUNG<sup>[1](https://orcid.org/0000-0001-7374-0964),2</sup> AND YIMING LI<sup>ID1,2,3,4,5</sup>, (Member, IEEE)<br><sup>1</sup>Parallel and Scientific Computing Laboratory, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan

<sup>2</sup> Institute of Communications Engineering, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan

<sup>3</sup>Department of Electrical Engineering and Computer Engineering, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan

4 Institute of Biomedical Engineering, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan

<sup>5</sup>Center for mmWave Smart Radar System and Technologies, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan

Corresponding author: Yiming Li (ymli@faculty.nctu.edu.tw)

This work was supported in part by the Ministry of Science and Technology (MOST), Taiwan, under Grant MOST 110-2221-E-A49-139, Grant MOST 109-2221-E-009-033, and Grant MOST 109-2634-F-009-030; and in part by the ''Center for mm Wave Smart Radar Systems and Technologies'' under the Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education in Taiwan.

**ABSTRACT** To estimate characteristic fluctuation of emerging devices, three-dimensional device simulation has been performed intensively for various random cases; however, it strongly relies on huge computational resources. In this paper, we report a nanosized-metal-grain pattern-dependent model by using the method of multi-variable non-linear regression (MVNLR) for the threshold voltage fluctuation ( $\sigma V_{th}$ ) of gate-allaround silicon nanowire metal-oxide-semiconductor field-effect transistors. The model is developed by perturbing the local metal grains of specific patterns and summing up each metal grain of various grain patterns. The method of MVNLR is applied to build equations for nominal devices, and devices with high work-function (WK) and low WK with respect to process parameters (radius of channel, gate length, channel doping, and oxide thickness), which are generated by the design of experiments. The proposed model can estimate the magnitude of  $\sigma V_{th}$  accurately (error rate  $< 1\%$ ) for all parameters, where an error-correction process is further adopted by using a simplified one-perceptron method.

**INDEX TERMS** Work function fluctuation, gate-all-around, nanowire, MOSFETs, threshold voltage, variability, metal grain, multi-variable non-linear regression, one-perceptron.

# **I. INTRODUCTION**

Three-dimensional (3D) gate-all-around (GAA) silicon (Si) nanowire (NW) metal-oxide-semiconductor field-effect transistors (MOSFETs) with high-κ/metal-gate technology are potential devices to replace the fin-type-field effect transistor (FinFET) due to improved electrostatic performance and transistor scaling in sub-5-nm nodes [1]–[5]. However, in nanoscaled devices, process variations will impact the yield of chips due to fluctuations of key device parameters, such as threshold voltage  $(V<sub>th</sub>)$ . The fluctuation sources include the random dopant fluctuation, work function fluctuation (WKF), random interface trap, random telegraph noise and process variation effect (PVE) [6]–[13]. Notably, the WKF has been considered to be a key fluctuation source due to large grain size or small metal grain number (MGN) by a 3D device simulation (3D-DS) for the GAA Si NW MOSFETs [13]–[15]. Moreover, the PVE of in-line critical-dimension (CD) variation affects the device characteristics significantly in the nanometer regime, for example, radius (*R*) of a cylindrical channel [13]. Different CDs may influence the results of the WKF-induced threshold-voltage variability  $(\sigma V_{th})$  because the area of metal grains is different in the assumption of the same MGN. Moreover, the GAA devices with multi-channels have been considered to increase the drive current [5]. Estimating the variability of the GAA devices requires huge computational resources based on the 3D-DS [14].

Previously, different models for the WKF-induced  $\sigma V_{th}$  had been proposed by various research groups.

The associate editor coordinating the review of this manuscript and approving it for publication was Yilun Shang.



**FIGURE 1.** Schematic of the nominal GAA Si NW MOSFET, where the Lg is 10 nm, the R is 5 nm, the L<sub>sd</sub> is optimized at 5 nm, and the T<sub>ox</sub> is<br>0.6 nm, respectively. The C1 is the cutting plane.

**TABLE 1.** The process parameters for the nominal GAA Si NW MOSFET.

| Parameter                                            | Value              |
|------------------------------------------------------|--------------------|
| $L_{\rm r}$ (nm)                                     | 10                 |
| $T_{ox}$ (nm)                                        | 0.6                |
| $R$ (nm)                                             | 5                  |
| $L_{sd}$ (nm)                                        | 5                  |
| Channel Doping (Ch dop) $(cm3)$                      | $5 \times 10^{17}$ |
| $S/D$ Doping (SD dop) (cm <sup>-3</sup> )            | $1 \times 10^{20}$ |
| $S/D$ Extension Doping (SDE dop) (cm <sup>-3</sup> ) | $5 \times 10^{18}$ |
| $LWK$ (eV)                                           | 4.432              |
| HWK (eV)                                             | 4.632              |
| Nominal EWK (eV)                                     | 4.552              |

In 2008 and 2010, H. F. Dadgour *et al.* had proposed an effective work-function (EWK) model to calculate the magnitude of  $\sigma V_{th}$  immediately and circuit performance for various MGN (MGN =  $(L_g/G_L) \times (W_{eff}/G_W)$ , where  $L_g$  is the gate length, *G<sup>L</sup>* and *G<sup>W</sup>* are the length and width of grain size, and  $W_{\text{eff}}$  is the effective width of the metal gate) [16]–[18]. However, the EWK model cannot create accurate results for emerging devices due to the location effect of a large metal grain size or few MGN (for example,  $MGN < 20$ ) in the nano-scaled gate area [19]. Thus, in 2011, S. H. Rasouli *et al.* had proposed an accurately physical-based model for planar-MOSFET [19]. This model is modified from the net-work transistors model [20], which considers the location effect of the work-function (WK) of adjacent metal grains. In 2017, P. H. Vardhan *et al.* had commented that the modified network transistors model [19] only works well for large-scale devices with relatively small grain sizes. Thus, by that time, they had proposed a new model from Laplace's potential solution and percolation resistance net-work models to estimate the WKF-induced  $\sigma V_{th}$  for FinFET devices [21]. In the same year, G. Indalecio *et al.* had proposed a novel fluctuation sensitivity map technique to capture fluctuation sensitivity and estimate the WKF-induced  $\sigma V_{th}$  for FinFET devices, which considers the location of fluctuations [22]. However, it still has a high error rate (ER). In 2018, P. H. Vardhan *et al.* extended the FinFET-based model to GAA NW MOSFETs with cylindrical geometry and constraint boundary conditions [23]. Moreover, in 2019, a Fourier-Bessel model had been proposed due to more accurate expression for GAA

NW MOSFETs [24], but, the Fourier-Bessel model also has a large ER for large grain size. In the same year, K. Ko *et al.* proposed a compact model strategy to reduce the ER by adding a percentage to capture the interactions between the grains for the case of large grain size [15]. Nevertheless, they have highlighted their approach still has a high ER due to many preconditions. Thus, they had proposed an artificialneural-network-based machine learning (ML) approach with more accurate results for process variations [25]. Recently, various deep learning (DL) techniques had been studied for the WKF on the GAA Si NW devices with high efficiency and accuracy [26]. The advantage of these ML/DL models is data-oriented, not model-oriented. They do not need a fixed relationship or physical model between the input and output parameters. However, one of disadvantages of these ML/DL models is hard to decide the best-hidden layers to ensure the global convergence for the WKF-induced variability of the GAA devices with multi-channels with respect to various process conditions and MGNs.

In this work, we propose the nanosized-metal-grain pattern-dependent model by using the method of multivariable non-linear regression (MVNLR) to overcome the aforementioned issues. The advantages of the proposed model are fast, simple and easy to calculate the WKF-induced  $\sigma V_{th}$  for various process conditions. To minimize the ER, an one-perceptron method [27] is simplified and applied to conduct the error correction (EC).

This paper is structured as follows: Section II presents the device structure, statistical device simulation and model for the GAA Si NW MOSFETs. Section III reports the results and the detailed discussion for different MGNs, process conditions and the GAA devices with multi-channels. In addition, we also discuss the main error source and their physical mechanisms. Finally, we conclude and offer directions for future research.

#### **II. DEVICE, WKF, AND STATISTICAL MODEL**

# A. THE SIMULATED DEVICE

Fig. 1 shows the schematic of the nominal GAA Si NW MOSFET, where the  $L_g$  is 10 nm, the *R* of a cylindrical channel is 5 nm, the S/D extension  $(L_{sd})$  is optimized at 5 nm, and the oxide thickness  $(T_{ox})$  is 0.6 nm, respectively. To simulate the low-power devices with higher  $V_{th}$ , the nominal EWK of the TiN metal is set to 4.552 eV [13], [17] for the nominal device, where the low WK (LWK) is 4.432 eV, the high WK (HWK) is 4.632 eV, and the nominal EWK is calculated by LWK  $\times$  0.4 + HWK  $\times$  0.6. Table 1 summarizes the process parameters for the nominal GAA Si NW MOSFET. Notably, a 3D quantum-mechanically corrected transport model was calibrated against the experimental  $I_D-V_G$  curve for the nominal GAA Si NW MOSFET [8].

# B. THE WKF SIMULATION

According to grain orientations of TiN [28]–[31], we set TiN<111> (probability (p) = 0.4, LWK = 4.432 eV),



**FIGURE 2.** The cuboid method for the WKF simulations for three different MGNs and the transformation of 3D metal gate to 2D plane, where the metal gate is partitioned into three grain sizes and 2000 different grain patterns are generated by the MC program. (a) MGN = 16. (b) MGN = 80. (c) MGN = 320. (d) The distributions of the ratio of HWK are normal distributions for different MGNs, where the ratio of HWK is the number of HWK divided by MGN.

TiN<200> (p = 0.6, HWK = 4.632 eV), MGNs and 2000 simulated samples in Monte Carlo (MC) program, respectively. To label the index number of metal grains in different locations, we transfer the 3D metal gate to 2D plane. As shown in Figs. 2(a)-(c), the 2000 random patterns (samples) with the label number can be generated by our MC program for the WKF simulations. The probabilistic distribution of the ratio of HWK (i.e., the number of HWK divided by MGN) follow the ''binomial distribution'' and will be approached to the normal distribution in large enough MGN and samples by the central limit theory [17], as shown in Fig. 2(d). Because the probability density function of the ''binomial distribution'' is related to the MGN [17], we can obviously observe that the distributions become widen as the MGN decreases due to the increase of the standard deviation. Additionally, the mean of the ratio of HWK is close to the probability of HWK. It means that the MC program generates the similar distributions to the experimental data [17]. Fig. 3 illustrates the schematics of the patterns of the devices affected by HWK or LWK from S to D for different metal ring sizes (RS) [11]: (a)  $L_g/2$ , (b)  $L_g/5$ , and (c) *Lg*/10 nm, where all simulated cases are in the inset of plots. The simulated results of the devices affected by HWK or LWK will be fed into the nanosized-metal-grain patterndependent model to calculate the perturbations of local metal grains for various patterns. The starting positions of Sh\_L are set to 0 for the devices affected by HWK or LWK, where the metal grain in golden color has a nominal EWK (4.552 eV). The other positions can be shifted by adding the equal RS sequentially for the Sh\_L. Then, the devices affected by HWK or LWK are generated in different locations.

#### C. THE PROPOSED MODEL

The proposed model contains the equations of the nominal devices and the devices affected by HWK and LWK with respect to various process parameters.

#### 1) THE NOMINAL AND PERTURBED EQUATIONS

To build the equations of the nominal devices and the devices affected by HWK and LWK, the design of experiments (DOEs) is created by process parameters of *R* (range: 4∼6 nm, step: 0.5 nm), *Lg* (range: 9∼11 nm, step: 0.5 nm), Norm\_Ch\_dop (range: 3∼7, step: 1), and *Tox* (range: 0.6∼0.8 nm, step: 0.1 nm), where the Norm\_Ch\_dop is given



**FIGURE 3.** Schematics of the patterns of the devices affected by HWK or LWK from S to D for different metal ring sizes: (a) Lg/2, (b) Lg/5, and (c)  $Lg/10$  nm. According to the different grain sizes of Figs.  $2(a)$ -(c), the  $Lg$  is divided by 2, 5, and 10. In addition, the number of location grains of the RS is set to W<sub>eff</sub>/G<sub>W</sub>. Our program will generate the devices affected by HWK or LWK in different locations. The total simulated cases with HWK and LWK are in the inset of Fig. 3. The simulated results of the devices affected by HWK or LWK will be fed into the nanosized-metal-grain pattern-dependent model (See [\(9\)](#page-5-0) and (10)) to calculate the perturbations of local metal grains for various patterns.



FIGURE 4. A flow to build the equations of nominal devices, and devices affected by HWK/LWK from the MVNLR, where  $V_{th}^{nom}$  is the V<sub>th</sub> of nominal device, VHWK is the V<sub>th</sub> of device affected by HWK and VLWK is the V<sub>th</sub> of device affected by LWK. The red dash lines are inputs of statistical calculation package.

by the Ch\_dop divided by  $10^{17}$  cm<sup>-3</sup>. Fig. 4 shows a flow to build the equations of the nominal devices and the devices affected by HWK and LWK. First, we input the DOEs into the nominal devices and the devices affected by HWK and LWK for the 3D-DS. Second, we input the DOEs and the V<sub>th</sub> of the nominal devices, and the devices affected by HWK and LWK into the statistics calculation package. By using the method of the MVNLR, the equations of the nominal devices and the devices affected by HWK and LWK can be obtained for various process conditions. As shown in Fig. 5, for the nominal devices and devices with HWK and LWK, their  $V_{th}$ depends on  $R$ ,  $L_g$ , Norm\_Ch\_dop, and  $T_{ox}$  nonlinearly. Thus, we can apply the MVNLR directly at all data points to obtain the equations of the devices affected by HWK and LWK and the nominal devices for each process parameter, where the values of the regression coefficient are over 99.5%. The equations of the devices affected by HWK and LWK and the nominal devices are listed as follows, where the process parameters, ''*R*, *Lg*, Norm\_Ch\_dop,*Tox* '', denoted as ''*pr*'', *Num* is the number of location grains of the RS and *k* is the index number of metal grain:

<span id="page-3-0"></span>
$$
V_{th}^{nom} (pr)
$$
  
= 675.0 - 89.66 × R-19.193×Lg-83.6×T<sub>ox</sub>  
+0.655 × Norm\_ch\_dop + 5.211 × R × Lg - 21.64

$$
\times R \times T_{ox}
$$
  
+0.6059 × R×Norm\_Ch\_dop + 12.6×Lg × T<sub>ox</sub>  
+0.0621 × Lg × Norm\_Ch\_dop (1)

We denote  $V_{th}^{nom}(pr) = V_{th}^{nom}$  hereafter.

<span id="page-3-1"></span>
$$
Sh_L(k) = G_L \times \text{Quotient}((k-1)/\text{Num})
$$
 (2)

For the case of  $MGN = 16$ , we have

<span id="page-3-2"></span>
$$
V_{th}^{HWK} (pr, Sh\_L(k))
$$
  
= 652.5 - 86.9 × R-11.48×Lg  
-18.31 × Sh\\_L(k) - 83.6 × T<sub>ox</sub> + 1.52  
×Norm\_Ch\_dop  
+ 4.2 × Sh\\_L(k)<sup>2</sup> + 4.643 × R × Lg + 0.4613  
×R × Sh\_L(k)  
-21.98 × R × T<sub>ox</sub> + 0.675 × R × Norm\_Ch\_dop  
+7.54 × Lg × T<sub>ox</sub>  
-0.092 × Lg × Norm\_Ch\_dop - 1.106 × Sh\_L(k)  
×T<sub>ox</sub>  
-0.0158 × Sh\_L(k) × Norm\_Ch\_dop - 0.347 × Sh\_L(k)<sup>3</sup> (3)



(a)  $MGN = 16$ . (b)  $MGN = 80$ . (c)  $MGN = 320$ . (d) Nominal devices.

and

$$
V_{th}^{LWK} (pr, Sh_L(k))
$$
  
= 632.3 - 87.84 × R - 20.44 × Lg  
+15.34 × Sh\_L(k) - 126.7 × T<sub>ox</sub> + 1.56  
×Norm\_Ch\_dop  
-2.32 × Sh\_L(k)<sup>2</sup> + 5.113 × R × Lg - 0.4964  
×R × Sh\_L(k)  
-24.75 × R × T<sub>ox</sub> + 0.677 × R × Norm\_Ch\_dop  
+15.97 × Lg × T<sub>ox</sub>  
-0.107 × Lg × Norm\_Ch\_dop+2.488 × Sh\_L(k) × T<sub>ox</sub>  
+0.0256 × Sh\_L(k) × Norm\_Ch\_dop+0.125 × Sh\_L(k)<sup>3</sup>.  
(4)

For the case of  $MGN = 80$ , we have

$$
V_{th}^{HWK} (pr, Sh_L(k))
$$
  
= 655.6 - 86.48 × R - 15.84 × Lg  
+10.996 × Sh\_L(k) - 71.1 × T<sub>ox</sub> + 1.56  
×Norm\_Ch\_dop  
-3.1031 × Sh\_L(k)<sup>2</sup> + 4.87 × R × Lg + 0.2163  
×R × Sh\_L(k)  
-23.62 × R × T<sub>ox</sub> + 0.6675 × R × Norm\_Ch\_dop  
+12.07 × Lg × T<sub>ox</sub> - 0.0933 × Lg × Norm\_Ch\_dop  
+0.065 × Sh\_L(k) × T<sub>ox</sub> - 0.0099 × Sh\_L(k)  
×Norm\_Ch\_dop + 0.18067 × Sh\_L(k)<sup>3</sup> (5)

and

$$
V_{th}^{LWK} (pr, Sh_L(k))
$$
  
= 645.3 - 87.94 × R - 19.61 × Lg



**FIGURE 6.** The computational flow of the nanosized-metal-grain pattern-dependent model for various MGNs and process parameters, where the W\_nom, W\_HWK and W\_LWK are the weights for the nominal device and devices affected by HWK and LWK, respectively. The k represents the index number of the metal grains.

$$
-17.773 \times Sh_{L}(k) - 58.2 \times T_{ox} + 1.6
$$
  
\n
$$
\times \text{Norm\_Ch\_dop}
$$
  
\n
$$
+4.887 \times Sh_{L}(k)^{2} + 5.234 \times R \times Lg - 0.1579
$$
  
\n
$$
\times R \times Sh_{L}(k)
$$
  
\n
$$
-25.46 \times R \times T_{ox} + 0.674 \times R \times \text{Norm\_Ch\_dop}
$$
  
\n
$$
+11.85 \times Lg \times T_{ox} - 0.107 \times Lg \times \text{Norm\_Ch\_dop}
$$
  
\n
$$
-0.353 \times Sh_{L}(k) \times T_{ox} + 0.0114 \times Sh_{L}(k)
$$
  
\n
$$
\times \text{Norm\_Ch\_dop} - 0.29459 \times Sh_{L}(k)^{3}.
$$
  
\n(6)

For the case of  $MGN = 320$ , we have

$$
V_{th}^{HWK} (pr, Sh\_L(k))
$$
  
= 662.11 – 87.5 × R – 17.921 × Lg  
+8.018 × Sh\\_L(k) – 72.43 × T<sub>ox</sub> + 1.621  
×Norm\_Ch\_dop  
-1.9136 × Sh\_L(k)<sup>2</sup> + 5.1035 × R × Lg + 0.1049  
×R × Sh\_L(k)  
-24.478 × R × T<sub>ox</sub> + 0.6663 × R × Norm\_Ch\_dop  
+12.533 × Lg × T<sub>ox</sub>  
-0.1002 × Lg × Norm\_Ch\_dop + 0.027 × Sh\_L(k) × T<sub>ox</sub>  
-0.0044 × Sh\_L(k) × Norm\_Ch\_dop + 0.1011  
×Sh\_L(k)<sup>3</sup> (7)

# and

$$
V_{th}^{LWK} (pr, Sh_L(k))
$$
  
= 643.7 - 87.54 × R - 18.35 × Lg  
-11.888 × Sh\_L(k) - 55.4 × T<sub>ox</sub> + 1.598  
×Norm\_Ch\_dop  
+2.8391 × Sh\_L(k)<sup>2</sup> + 5.131 × R × Lg - 0.0876  
×R × Sh\_L(k)

$$
-24.56 \times R \times T_{ox} + 0.6745 \times R \times \text{Norm\_Ch\_dop}
$$

$$
+11.44 \times Lg \times T_{ox} - 0.1048 \times Lg \times \text{Norm\_Ch\_dop}
$$

$$
-0.376 \times \text{Sh\_L}(k) \times T_{ox} + 0.0045 \times \text{Sh\_L}(k)
$$

$$
\times \text{Norm\_Ch\_dop} - 0.1529 \times \text{Sh\_L}(k)^3.
$$
(8)

We denote  $V_{th}^{HWK}(pr, Sh_L(k)) = V_{th}^{HWK}(k)$  and  $V_{th}^{LWK}(pr, Sh_L(k)) = V_{th}^{LWK}(k)$  hereafter.

# 2) THE OVERALL EQUATIONS AND THEIR COMPUTATIONAL FLOW

The nanosized-metal-grain pattern-dependent model is developed by perturbations of metal grains for various patterns with the EC process from evaluating the one-perceptron method. The equations of model are given by

<span id="page-5-0"></span>
$$
\delta V_{th}^{HWK}(k) = (V_{th}^{nom} - V_{th}^{HWK}(k)) / Num \qquad (9)
$$

$$
\delta V_{th}^{LWK}(k) = (V_{th}^{nom} - V_{th}^{LWK}(k))/Num \qquad (10)
$$

$$
\delta V_{th} = \sum_{k=1}^{MGN} (\delta V_{th}^{HWK}(k) \times W_{-HWK} + \delta V_{th}^{LWK}(k) \times W_{-LWK})
$$
\n(11)

and

<span id="page-5-1"></span>
$$
V_{th} = W_{nom} \times V_{th}^{nom} + \delta V_{th}, \qquad (12)
$$

where the  $V_{th}^{nom}$  is the V<sub>th</sub> of the nominal device, the  $V_{th}^{HWK}$ is the V<sub>th</sub> of the devices affected by HWK, the  $V_{th}^{LWK}$  is the V<sub>th</sub> of the devices affected by LWK, *Num* is the number of location grains of the RS, *k* is the index number of metal grain, the W\_nom, W\_HWK and W\_LWK are the weights for the nominal device and the devices affected by HWK and LWK, respectively. The perturbation of each metal grain is calculated by [\(9\)](#page-5-0) and (10) with respect to various grain patterns. Based on the assumption of the superposition principle, the total perturbation of all grains is calculated by [\(11\)](#page-5-0), where the weights of  $(11)$  are used to execute the EC process.



FIGURE 7. The fluctuated I<sub>D</sub>-V<sub>G</sub> of 2000 studied devices with random patterns induced by the WKF from the 3D-DS, where the red dot line is the nominal curve without the WKF. (a)  $\overline{MGN} = 16$ . (b)  $\overline{MGN} = 80$ . (c)  $\overline{MGN} = 320$ .

Without the EC process, the W\_nom, W\_HWK and W\_LWK are set to one. After calculating  $(11)$ , the V<sub>th</sub> can be obtained by [\(12\)](#page-5-1). Fig. 6 illustrates the computational flow of the nanosized-metal-grain pattern-dependent model for various MGNs and process parameters. First, we set various MGNs, sample size, HWK with  $p = 0.6$ , and LWK with  $p = 0.4$  as inputs of the MC program. The MC program will generate various patterns. Second, depending on the MGNs, the different computational flow will be selected. For example, if the case of MGN = 16 is selected, the nominal  $V_{th}$  will be calculated by  $(1)$ . Because  $(2)$  will be fed into  $(3)$  and  $(4)$ , we must calculate [\(2\)](#page-3-1) before the separation of HWK and LWK. After the separation, [\(3\)](#page-3-2) and (4) will be calculated. Then, the perturbation of each metal grain with HWK and LWK can be obtained by [\(9\)](#page-5-0) and (10), respectively. After executing the loop of *k* from 1 to MGN, the sum of all perturbations will be obtained by [\(11\)](#page-5-0). Finally, the program will output the value of  $V_{th}$  by [\(12\)](#page-5-1). After repeating various patterns, we can obtain each  $V_{th}$  for each pattern; and, then the WKF-induced  $\sigma V_{th}$ . For other cases of MGNs, the same flow of the case of  $MGN = 16$  can be followed. To minimize the computational errors, a simplified one-perceptron method is adopted to guarantee the high accuracy of the proposed model by adjusting the weights. For the simplified one-perceptron method, we do not consider the activation function as used in the conventional methods [27]. Moreover, in the conventional one-perceptron method, the number of weights depends on different MGN and the initial weights are randomly generated by a random number generator. This may not ensure the global convergence in the learning process for various MGNs and process parameters. Instead, the simplified one-perceptron method only has the W\_LWK and W\_HWK for various MGNs and process parameters because of the separation of the HWK and LWK. The weight can vary from 0.5 to 1.5 to ensure the global convergence because the perturbations of metal grains are computed by [\(9\)](#page-5-0) and (10). In this work, we vary the W\_LWK to find an optimal W. LWK so that the computational errors can be minimized, where  $W_HWK = 1$  is assumed. 2000 samples are randomly

generated and simulated by the 3D-DS for the model validation, where the metric definitions of ER and relative ER (RER) are  $[(\sigma V_{th}^{Model} - \sigma V_{th}^{3D-DS})/Mean] \times 100\%$  and  $[(\sigma \text{V}^{\text{Model}}_{th} - \sigma \text{V}^{\text{3D-DS}}_{th})/\sigma \text{V}^{\text{3D-DS}}_{th}] \times 100\%$ , respectively.



**FIGURE 8.** The ER and RER of  $\sigma V_{\text{th}}$  versus sample size for various MGNs without the EC process. (a) ER. (b) RER.

### **III. RESULTS AND DISCUSSIONS**

To validate the model accuracy, 2000 samples are randomly generated and simulated by the 3D-DS for the GAA Si



**FIGURE 9.** The cases of different cluster metal grains with the same grains of LWK and the corresponding  $V_{th}$  from the 3D-DS and model for the GAA Si NW MOSFETs.

NW MOSFETs with respect to various MGNs, as shown in Fig. 7. The magnitude of  $V_{th}$  is extracted by a constant current method from the fluctuated  $I_D-V_G$  curves, where the current criterion of  $V_{th}$  is  $(W_{ch}/L_g) \times 10^{-8}$  A, and  $W_{ch}$  is the circumference of a cylindrical channel. Fig. 8 illustrates the ER and RER of  $\sigma V_{th}$  versus sample size for various MGNs without the EC process. As shown in Fig. 8(a), the ER of  $\sigma V_{th}$  is slightly affected by sample size for the case of  $MGN = 16$ . This is because the case of  $MGN = 16$  has a comparatively larger grain size, therefore, it has more location effect as compared to the case of  $MGN = 80$ , etc. As shown in Fig. 8(b), the RER becomes large than the ER because of the lower value of  $\sigma V_{th}$ , it is prominent especially for the case of  $MGN = 320$ . The error sources may be attributed to the clusters of metal grains. It is because of the difference between HWK and LWK. Thus, the input values from [\(9\)](#page-5-0) and (10) will be biased. Then, the errors will be generated from the proposed model. To analyze the clusters of metal grains, as shown in Fig. 9, the  $V_{th}$  of Case A from the 3D-DS has a much lower value than that from the model because of the clusters of metal grains. It also can be seen that it causes a large error if the patterns have the clusters of metal grains. Fig. 10 shows the conduction band of Case A, Case B and Case C in the on-state ( $V_G = V_D = 0.6 V$ ) along the channel position from S to D, respectively. Fig.  $10(a)$ –(c) and  $10(a')-(c')$  show that the cluster of metal grains with the low WK in the channel of Case A alters the conduction band distributions of the 3D and 2D views in the channel. Moreover, Fig. 10(d) shows the plot of the 1D conduction band from S to D along the channel. This plot shows that the conduction band of Case A is relatively lower than that of Case B and Case C in the channel, thus generating a lower



**FIGURE 10.** The conduction band of Case A, Case B and Case C in the on-state ( $V_G = V_D = 0.6$  V) along the channel position from S to D, where the L1 cut-line is along the channel from S to D: (a)-(c) 3D view, (a')-(c') 2D view, and (d) 1D plot in the channel region.

 $V_{th}$  in Case A than that in Case B and Case C. To further reduce the computational errors, the EC process will be necessary for the proposed model. Using [\(11\)](#page-5-0), we can adjust the W\_LWK to fine tune the value of fluctuation and calculate the updated  $\sigma V_{th}$ . Therefore, by varying the W\_LWK (where  $W_HWK = 1$  is assumed), the RER plots are obtained, as shown in Fig. 11.

It can be observed that the plots are showing linear behavior. Thus, the optimized value of the W\_LWK can be obtained by the linear RER plot at  $RER = 0$ . By using the optimized value of the W\_LWK from [\(11\)](#page-5-0), the updated  $\sigma V_{th}$  can be obtained. Notably, the optimized value of the W\_LWK also can be achieved by minimizing the value of root-mean-square error. Fig. 12 illustrates the ER and RER versus sample size for various MGNs after the EC process. We find that the ER and RER have significant reductions after the EC process as compared to Fig. 8. In addition, both ER and RER are stable as the sample size increases. It indicates that the proposed model with the EC process can significantly reduce the errors and predict the accurate value of  $\sigma V_{th}$  in a large enough sample size. Table 2 summarizes the computational cost and error rate for various major methods [17], [25], [32]. To estimate the results of the WKF-induced  $\sigma V_{th}$  with respect



FIGURE 11. The RER versus W\_LWK of the V<sub>th</sub> on the GAA Si NW MOSFET for various MGNs, where 100 samples are used for the EC.



**FIGURE 12.** The (a) ER and (b) RER versus sample size for various MGNs after the EC process.

to various process parameters, we proceed with the 3D-DS simulation for the DOEs of the concerned process parameters  $(R, L_g, T_{ox}, \text{ and } Ch\_dop)$ . As shown in Fig. 13(a), in the same range of the process parameters, the variation of *R* plays a key factor to affect the variation of  $V_{th}$ . Thus, we vary different *R* to estimate the WKF-induced  $\sigma V_{th}$ . For different MGNs, as shown in Fig. 13(b), the variations of  $\sigma V_{th}$  are

insignificant (< 2%). However, the improvement of  $\sigma V_{th}$ is large for relatively larger MGN. It can be observed that the grain size is a key factor affecting the results of the WKF. Additionally, the proposed model can be applied not only with the results of the 3D-DS, but also directly from measured data. By applying a similar procedure, we can model and analyze the WKF-induced  $\sigma V_{th}$  of 3D vertically stacked GAA NW MOSFETs with multi-channels. As shown in Fig. 14, the WKF-induced  $\sigma V_{th}$  decreases exponentially by increasing the number of channels (or MGN) for the 3D-DS and the proposed model.

**TABLE 2.** List of the computational cost and error rate for various major methods.

| Method         | ER $(%)$<br>Computational Cost |            | Reference |
|----------------|--------------------------------|------------|-----------|
| $3D-DS$        |                                | High       | This work |
| EWK Model      | $< 30\%$ *                     | Low        | 171. [32] |
| ML.            | $< 1\%$                        | Medium     | [25]      |
| Proposed Model | $< 1\%$                        | $\Delta W$ | This work |

\* The value of ER was depending on different grain sizes and devices.



**FIGURE 13.** (a) The Pareto chart of process variations for different process parameters. (b) A plot of the  $\sigma V_{\text{th}}$  versus MGN for various R.

Furthermore, the absolute value of ER is lower than 0.4%. Thus, we can fit the data by using (13) for the GAA devices



**FIGURE 14.** Plot of the  $\sigma V_{th}$  versus MGN for the 3D-DS and model.

**TABLE 3.** The extracted parameters of the statistical model [14] for the 3d-ds and the proposed model.

| Method         | α (mV |       | ٨ı    | $\mathbf{R}^2$ |
|----------------|-------|-------|-------|----------------|
| 3D-DS          | 325   | 15.58 | 70 25 | 0.998          |
| Proposed Model | 3.306 | 23.93 | 0.09  | 0 980          |

with multi-channels [14].

$$
\sigma V_{th}(MGN) = \alpha \times exp[\beta/(MGN + \gamma)], \qquad (13)
$$

where the best regression coefficients:  $\alpha$  (mV),  $\beta$  (unitless), and  $\gamma$  (unit-less) are listed in Table 3. The value of the  $R^2$  is close to 1. Therefore, based on the low ER and high efficiency, we believe that the proposed model with the EC process is a good predictive approach for the WKF-induced  $\sigma V_{th}$ . Furthermore, in the process viewpoint, if the distributions of grain sizes can be measured by transmission electron microscopy from the foundries in advance, the proposed model can estimate the  $\sigma V_{th}$  affected by the WKF immediately. And, it can be plugged into circuit simulation for estimating the performance of CMOS circuits by circuit simulator and feedback the results for the yield improvement before the test results of chip-probe yield.

#### **IV. CONCLUSION**

In this paper, we have presented the nanosized-metal-grain pattern-dependent model to calculate the WKF-induced  $\sigma V_{th}$ . Moreover, the MVNLR has been explored with and without the EC process on the GAA Si NW MOSFETs for various process conditions (*R* (range: 4∼6 nm), *Lg* (range: 9∼11 nm), Ch\_dop (range: 3∼7 × 10<sup>17</sup> cm−<sup>3</sup> ), and *Tox* (range: 0.6∼0.8 nm) and MGNs (16∼320)). It has been proved that by using the proposed model, the high computational cost can be reduced significantly and the explored model outperforms in terms of accuracy using the EC process having fewer simulated sample sizes. Notably, the proposed model is only suitable for the GAA Si NW MOSFETs with

cylindrical channels. Currently, we are investigating the variability of vertically stacked GAA nanosheet. The approach reported in this work can also be advanced to model the variability of complementary FETs (CFETs) for circuit applications of CFETs.

# **REFERENCES**

- [1] R. Chau, ''Process and packaging innovations for Moore's law continuation and beyond,'' in *IEDM Tech. Dig.*, Dec. 2019, pp. 1–6.
- [2] J. Appenzeller, J. Knoch, M. T. Björk, H. Riel, H. Schmid, and W. Riess, ''Toward nanowire electronics,'' *IEEE Trans. Electron Devices*, vol. 55, no. 11, pp. 2827–2845, Nov. 2008, doi: [10.1109/TED.2008.2008011.](http://dx.doi.org/10.1109/TED.2008.2008011)
- [3] H. Mertens *et al.*, "Gate-all-around MOSFETs based on vertically stacked horizontal Si nanowires in a replacement metal gate process on bulk Si substrates,'' in *Proc. IEEE Symp. VLSI Technol.*, Jun. 2016, pp. 158–159.
- [4] S. Barraud, V. Lapras, B. Previtali, M. P. Samson, J. Lacord, S. Martinie, M.-A. Jaud, S. Athanasiou, F. Triozon, O. Rozeau, J. M. Hartmann, C. Vizioz, C. Comboroure, F. Andrieu, J. C. Barbe, M. Vinet, and T. Ernst, ''Performance and design considerations for gate-all-around stacked-NanoWires FETs,'' in *IEDM Tech. Dig.*, Dec. 2017, pp. 677–680.
- [5] T. Al-Ameri, V. P. Georgiev, F. Adamu-Lema, and A. Asenov, ''Simulation study of vertically stacked lateral Si nanowires transistors for 5-nm CMOS applications,'' *IEEE J. Electron Devices Soc.*, vol. 5, no. 6, pp. 466–472, Nov. 2017, doi: [10.1109/JEDS.2017.2752465.](http://dx.doi.org/10.1109/JEDS.2017.2752465)
- [6] G. Espineira, D. Nagy, G. Indalecio, A. J. Garcia-Loureiro, K. Kalna, and N. Seoane, ''Impact of gate edge roughness variability on FinFET and gate-all-around nanowire FET,'' *IEEE Electron Device Lett.*, vol. 40, no. 4, pp. 510–513, Apr. 2019, doi: [10.1109/LED.2019.2900494.](http://dx.doi.org/10.1109/LED.2019.2900494)
- [7] C. Millar, D. Reid, G. Roy, S. Roy, and A. Asenov, ''Accurate statistical description of random dopant-induced threshold voltage variability,'' *IEEE Electron Device Lett.*, vol. 29, no. 8, pp. 946–948, Aug. 2008, doi: [10.1109/LED.2008.2001030.](http://dx.doi.org/10.1109/LED.2008.2001030)
- [8] W.-L. Sung and Y. Li, ''DC/AC/RF characteristic fluctuations induced by various random discrete dopants of gate-all-around silicon nanowire n-MOSFETs,'' *IEEE Trans. Electron Devices*, vol. 65, no. 6, pp. 2638–2646, Jun. 2018, doi: [10.1109/TED.2018.2822484.](http://dx.doi.org/10.1109/TED.2018.2822484)
- [9] M.-L. Fan, V. P.-H. Hu, Y.-N. Chen, P. Su, and C.-T. Chuang, ''Analysis of single-trap-induced random telegraph noise on FinFET devices, 6T SRAM cell, and logic circuits,'' *IEEE Trans. Electron Devices*, vol. 59, no. 8, pp. 2227–2234, Aug. 2012, doi: [10.1109/TED.2012.](http://dx.doi.org/10.1109/TED.2012.2200686) [2200686.](http://dx.doi.org/10.1109/TED.2012.2200686)
- [10] W.-L. Sung, Y.-S. Yang, and Y. Li, ''Work-function fluctuation of gateall-around silicon nanowire n-MOSFETs: A unified comparison between cuboid and Voronoi methods,'' *IEEE J. Electron Devices Soc.*, vol. 9, pp. 151–159, 2021, doi: [10.1109/JEDS.2020.3046608.](http://dx.doi.org/10.1109/JEDS.2020.3046608)
- [11] W.-L. Sung and Y. Li, "Effects of random number and location of the nanosized metal grains on the threshold voltage variability of silicon gateall-around nanowire n-type metal-oxide-semiconductor field-effect transistors,'' *J. Comput. Electron.*, vol. 19, no. 4, pp. 1478–1484, Dec. 2020, doi: [10.1007/s10825-020-01572-9.](http://dx.doi.org/10.1007/s10825-020-01572-9)
- [12] H. Nam, Y. Lee, J.-D. Park, and C. Shin, ''Study of work-function variation in high-*k*/metal-gate gate-all-around nanowire MOSFET,'' *IEEE Trans. Electron Devices*, vol. 63, no. 8, pp. 3338–3341, Aug. 2016.
- [13] Y. Li, H.-T. Chang, C.-N. Lai, P.-J. Chao, and C.-Y. Chen, "Process variation effect, metal-gate work-function fluctuation and random dopant fluctuation of 10-nm gate-all-around silicon nanowire MOSFET devices,'' in *IEDM Tech. Dig.*, Dec. 2015, pp. 887–890.
- [14] W.-L. Sung and Y. Li, "Statistical prediction of nanosizedmetal-grain-induced threshold-voltage variability for 3D vertically stacked silicon gate-all-around nanowire n-MOSFETs,'' *J. Electron. Mater.*, vol. 49, no. 11, pp. 6865–6871, Nov. 2020, doi: [10.1007/](http://dx.doi.org/10.1007/s11664-020-08332-2) [s11664-020-08332-2.](http://dx.doi.org/10.1007/s11664-020-08332-2)
- [15] K. Ko, M. Kang, J. Jeon, and H. Shin, "Compact model strategy of metalgate work-function variation for ultrascaled FinFET and vertical GAA FETs,'' *IEEE Trans. Electron Devices*, vol. 66, no. 3, pp. 1613–1616, Mar. 2019, doi: [10.1109/TED.2019.2891677.](http://dx.doi.org/10.1109/TED.2019.2891677)
- [16] H. Dadgour, V. De, and K. Banerjee, "Statistical modeling of metal-gate work-function variability in emerging device technologies and implications for circuit design,'' in *Proc. IEEE/ACM Int. Conf. Comput.-Aided Design*, Nov. 2008, pp. 270–277.
- [17] H. F. Dadgour, K. Endo, V. K. De, and K. Banerjee, "Grain-orientation induced work function variation in nanoscale metal-gate transistors— Part I: Modeling, analysis, and experimental validation,'' *IEEE Trans. Electron Devices*, vol. 57, no. 10, pp. 2504–2514, Oct. 2010, doi: [10.1109/ted.2010.2063191.](http://dx.doi.org/10.1109/ted.2010.2063191)
- [18] H. F. Dadgour, K. Endo, V. K. De, and K. Banerjee, "Grain-orientation induced work function variation in nanoscale metal-gate transistors— Part II: Implications for process, device, and circuit design,'' *IEEE Trans. Electron Devices*, vol. 57, no. 10, pp. 2515–2525, Oct. 2010, doi: [10.1109/TED.2010.2063270.](http://dx.doi.org/10.1109/TED.2010.2063270)
- [19] S. H. Rasouli, C. Xu, N. Singh, and K. Banerjee, ''A physical model for work-function variation in ultra-short channel metal-gate MOSFETs,'' *IEEE Electron Device Lett.*, vol. 32, no. 11, pp. 1507–1509, Nov. 2011, doi: [10.1109/LED.2011.2166531.](http://dx.doi.org/10.1109/LED.2011.2166531)
- [20] X. Zhang, J. Li, M. Grubbs, M. Deal, B. Magyari-Köpe, B. M. Clemens, and Y. Nishi, ''Physical model of the impact of metal grain work function variability on emerging dual metal gate MOSFETs and its implication for SRAM reliability,'' in *IEDM Tech. Dig.*, Dec. 2009, pp. 57–60.
- [21] P. H. Vardhan, S. Mittal, S. Ganguly, and U. Ganguly, ''Analytical estimation of threshold voltage variability by metal gate granularity in finfet,'' *IEEE Trans. Electron Devices*, vol. 64, no. 8, pp. 3071–3076, Aug. 2017, doi: [10.1109/TED.2017.2712763.](http://dx.doi.org/10.1109/TED.2017.2712763)
- [22] G. Indalecio, N. Seoane, K. Kalna, and A. J. García-Loureiro, ''Fluctuation sensitivity map: A novel technique to characterise and predict device behaviour under metal grain work-function variability effects,'' *IEEE Trans. Electron Devices*, vol. 64, no. 4, pp. 1695–1701, Apr. 2017, doi: [10.1109/TED.2017.2670060.](http://dx.doi.org/10.1109/TED.2017.2670060)
- [23] P. H. Vardhan, S. Mittal, S. Ganguly, and U. Ganguly, "Analytical modeling of metal gate granularity based threshold voltage variability in NWFET,'' *Solid-State Electron.*, vol. 147, pp. 26–34, Sep. 2018, doi: [10.1016/j.sse.2018.05.007.](http://dx.doi.org/10.1016/j.sse.2018.05.007)
- [24] P. H. Vardhan, S. Ganguly, and U. Ganguly, "An accurate expression to estimate the metal gate granularity induced threshold voltage variability in NWFETs,'' *Solid-State Electron.*, vol. 152, pp. 65–71, Feb. 2019, doi: [10.1016/j.sse.2018.12.003.](http://dx.doi.org/10.1016/j.sse.2018.12.003)
- [25] K. Ko, J. K. Lee, M. Kang, J. Jeon, and H. Shin, "Prediction of process variation effect for ultrascaled GAA vertical FET devices using a machine learning approach,'' *IEEE Trans. Electron Devices*, vol. 66, no. 10, pp. 4474–4477, Oct. 2019, doi: [10.1109/TED.2019.2937786.](http://dx.doi.org/10.1109/TED.2019.2937786)
- [26] C. Akbar, Y. Li, and W.-L. Sung, "Deep learning algorithms for the work function fluctuation of random nanosized metal grains on gate-all-around silicon nanowire MOSFETs,'' *IEEE Access*, vol. 9, pp. 73467–73481, 2021, doi: [10.1109/ACCESS.2021.3079981.](http://dx.doi.org/10.1109/ACCESS.2021.3079981)
- [27] F. Chollet, *Deep Learning With Python*. New York, NY, USA: Manning Publications Co., 2021.
- [28] M. M. Hussain, M. A. Quevedo-Lopez, H. N. Alshareef, H. C. Wen, D. Larison, B. Gnade, and M. El-Bouanani, ''Thermal annealing effects on a representative high-*k*/metal film stack,'' *Semicond. Sci. Technol.*, vol. 21, no. 10, pp. 1437–1440, Oct. 2006, doi: [10.1088/0268-1242/21/10/012.](http://dx.doi.org/10.1088/0268-1242/21/10/012)
- [29] A. Yagishita, T. Saito, K. Nakajima, S. Inumiya, K. Matsuo, T. Shibata, Y. Tsunashima, K. Suguro, and T. Arikado, ''Improvement of threshold voltage deviation in damascene metal gate transistors,'' *IEEE Trans. Electron Devices*, vol. 48, no. 8, pp. 1604–1611, Aug. 2001, doi: [10.1109/16.936569.](http://dx.doi.org/10.1109/16.936569)
- [30] G. Indalecio, A. J. García-Loureiro, N. S. Iglesias, and K. Kalna, "Study of metal-gate work-function variation using Voronoi cells: Comparison of Rayleigh and gamma distributions,'' *IEEE Trans. Electron Devices*, vol. 63, no. 6, pp. 2625–2628, Jun. 2016, doi: [10.1109/TED.2016.2556749.](http://dx.doi.org/10.1109/TED.2016.2556749)
- [31] H. Nam and C. Shina, "Comparative study in work-function variation: Gaussian vs. Rayleigh distribution for grain size,'' *IEICE Electron. Exp.*, vol. 10, no. 9, pp. 1–5, 2013, doi: [10.1587/elex.10.20130109.](http://dx.doi.org/10.1587/elex.10.20130109)
- [32] S.-H. Chou, M.-L. Fan, and P. Su, ''Investigation and comparison of work function variation for FinFET and UTB SOI devices using a Voronoi approach,'' *IEEE Trans. Electron Devices*, vol. 60, no. 4, pp. 1485–1489, Apr. 2013, doi: [10.1109/TED.2013.2248087.](http://dx.doi.org/10.1109/TED.2013.2248087)



WEN-LI SUNG received the B.S. degree in physics from National Tsing Hua University, in 1995, and the M.S. degree in electronics from National Yang Ming Chiao Tung University (NYCU), Hsinchu, Taiwan, in 2000, where he is currently pursuing the Ph.D. degree with the Parallel and Scientific Computing Laboratory and the Institute of Communications Engineering. His research interests include simulation and optimization of advanced nano-scaled MOSFETs and circuits.



YIMING LI (Member, IEEE) is currently a Full Professor of electrical and computer engineering with the National Yang Ming Chiao Tung University (NYCU), Taiwan. He has authored or coauthored over 200 research papers appearing in international book chapters, journals, and conferences. His current research interests include computational electronics, device physics, semiconductor nanostructures, modeling and parameter extraction, biomedical and energy harvesting

devices, and optimization techniques. He has been a Program Committee Member of IEDM, since 2011.