Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 17

Variability Issues in Three-Dimensional ICs*

Abstract

The effects of variations on the behavior of three-dimensional (3-D) circuits are reviewed in this chapter. Stochastic models that describe the performance variability of 3-D circuits are presented. Both interdie and intradie variations are considered. Detailed analytic models to describe the combined effects of die-to-die and within-die variations for clock paths that span more than one tier are included. The distribution of clock skew for different clock networks is evaluated based on this model, and enhanced topologies that reduce skew variability are presented. The skew model is extended to include power supply noise, and design guidelines for clock trees considering both power supply noise and process variations are presented.

Keywords

Die-to-die variations; within-die variations; process variability in 3-D circuits; clock skew variations; power supply noise in 3-D circuits; skitter

For several decades, the electrical behavior of both active devices and passive components in integrated circuits was primarily characterized with deterministic models. With feature size scaling well below micrometer dimensions, the increasing variability of the physical properties of these elements has rendered these models increasingly less accurate [619]. Consequently, since approximately the mid-1990s, a considerable body of research on statistical models has been developed that focus on the variability of on-chip transistors and interconnect [620–623].

Variability originates from both the manufacturing process and the environment of an integrated circuit. Environmental variability is typically attributed to fluctuations in the power supply of the circuits and temperature gradients within the ambient environment. Collectively, these variations are termed as process, voltage, and temperature or, succinctly, PVT variations.

Process variations are the result of a large and diverse number of imperfections in the manufacturing process. For example, aberrations in the stepper lens [624] or other imprecisions introduced from lithography [625] and/or during illumination [626] lead to slightly different physical properties of the transistors, thereby affecting the electrical behavior of the manufactured circuits. In addition to variations due to the optical process steps, fluctuations are also caused during fabrication of the interconnection layers. For example, the chemical mechanical polishing process utilized to smooth the interface of the deposited interconnect layers produces variations in the thickness of the metal layers [627].

Depending upon the underlying phenomena producing the manufacturing variations, these variations can be characterized as either systematic or random. For example, variations in the transistor channel length depend upon the orientation of the layout of the transistors and can be characterized in a systematic manner [628]. Alternatively, the number and distribution of the dopant atoms within the transistor channel, which determine the transistor voltage threshold, is a random phenomenon [622]. Whether a source of variation is treated as systematic or random also depends on the models that capture these variations. Therefore, if the model is of significant complexity, the assumption of a random process to model a source of physical variation may be a plausible approach from a design perspective [629].

The different scales at which process variations are manifested are illustrated in Fig. 17.1. Depending upon the stage of the fabrication process, these variations affect all of the transistors across a die in the same way but differently between dies (interdie variations), or cause the properties of each transistor within a die to differ from each other (intradie variations). Another nomenclature typically used for these types of variations that is also followed in this chapter is die-to-die (D2D) for interdie and within-die (WID) for intradie variations. The issue of variability in the manufacturing and design of integrated systems is a broad topic that by no means can be covered within this chapter (nor is this the intent). The interested reader is referred to many other excellent sources that consider this topic in much greater depth [629].

Figure 17.1 Classification of process variations and an illustration of the physical scale of the disparate sources of variations.

Historically, random or systematic variability was addressed by worst case design methodologies [630]; however, considering the increase in the magnitude of the variations, designing for the worst case can lead to a significant loss in performance at high overhead. Consequently, statistical models that reduce the pessimism of the design margins can recover some of this performance loss, yielding more competitive products. These models employ random variables to characterize WID or D2D variations, where these variables are usually assumed to be normally distributed. In this chapter, these process variations are assumed to follow a Gaussian distribution. The effects of variability on timing and, consequently, the parametric yield of 3-D circuits are discussed in the following section. An efficient model that characterizes the distribution of clock skew for 3-D clock distribution networks as compared to Monte Carlo simulations [631] is presented in Section 17.2. The combined effects of process and power noise fluctuations on the salient features of 3-D clock distribution networks are discussed in Section 17.3. Alternatively, temperature variations are not considered since thermal issues are extensively covered in previous chapters. The major concepts of this chapter are summarized in Section 17.4.

17.1 Process Variations in Data paths Within Three-Dimensional ICs

Interestingly, variability in 3-D circuits has to date not been adequately investigated. The multi-tier nature of these circuits requires different statistical models to account for process variations. To exemplify this requirement, consider the datapaths illustrated in Fig. 17.2. For the datapath shown in Fig. 17.2A, one random variable can be used for each gate to describe the effects of WID variations on the delay of this gate. Furthermore, one random variable, common for all of the gates along the path, is used to capture the effect of D2D variations. Thus, a statistical model that describes the delay distribution of this path requires seven random variables.

Figure 17.2 Example of intratier and intertier paths. (A) One random variable is required to model D2D variations, and (B) two random variables (one for each tier) are used to model D2D variations for the entire path.

Alternatively, for the path shown in Fig. 17.2B, which spans two physical tiers, the situation is somewhat different. The same number of random variables is required to model WID variations. Two variables are, however, needed to model D2D variations as some gates along the path are placed in another tier. Assuming that these tiers originate from different wafers (a reasonable assumption for the vast majority of systems), these variables should be modeled as independent. Furthermore, if all of the tiers are fabricated with the same process (e.g., a memory stack), these random variables are also considered to be identically distributed. Alternatively, this situation is not the same if a different process is used for all or some of the tiers. The former case is assumed in this chapter to model the multiple sources of D2D variations that can exist within a 3-D stack. The effects of multiple D2D variations on the speed of datapaths in a 3-D circuit are described in [632]. This work constitutes the basis of this section.

A statistical delay model for 3-D circuits is based on the critical path model described in [633]. This path model describes the performance of a datapath while hiding information from the lower abstraction levels. Although this approach adds some inaccuracy, a general path model is useful in adding insight into the effects of variability on the performance of 3-D circuits. Note that the general critical path model in [633] is for planar circuits. The model utilizes two primary parameters, the number of critical paths within a circuit N_cp and the number of stages within each critical path n_cp. Another assumption of the model is that all of the n_cp stages within a critical path are two input NAND gates.

To model D2D variations, a single random variable G is employed for all of the gates, while the effects of WID variations are modeled by a set of independent and identically distributed random variables notated as L_ij, where 1≤i≤N_cp and 1≤j≤n_cp. Consequently, L_ij describes the effects of WID variations on gate j of path i. Based on this notation, the maximum delay variation due to the combined effect of D2D and WID variations is [633]

$Δ T_{\max}^{2 - D} = \max_{1 \leq i \leq N_{cp}} (\sum_{j = 1}^{n_{cp}} a (G + L_{ij})) = \max_{1 \leq i \leq N_{cp}} (n_{cp} aG + \sum_{j = 1}^{n_{cp}} a L_{ij}),$ $Δ T_{\max}^{2 - D} = \max_{1 \leq i \leq N_{cp}} (\sum_{j = 1}^{n_{cp}} a (G + L_{ij})) = \max_{1 \leq i \leq N_{cp}} (n_{cp} aG + \sum_{j = 1}^{n_{cp}} a L_{ij}),$ (17.1)

(17.1)

where a is the sensitivity of the delay of the gates to process variations. Using the common assumption that both D2D and WID process variations follow a normal distribution, that is G~N(0, σ_G) and L~N(0, σ_L), the probability that the maximum delay variation of the critical paths is less than a specific delay τ is

$\Pr {Δ T_{\max}^{2 - D} \leq τ} = F_{{Δ T}_{\max}^{2 - D}} (τ) = f_{G} (\frac{τ}{α n_{cp}}) * ({F_{L} (\frac{τ}{α \sqrt{n_{cp}}})}^{N_{cp}}) .$ $\Pr {Δ T_{\max}^{2 - D} \leq τ} = F_{{Δ T}_{\max}^{2 - D}} (τ) = f_{G} (\frac{τ}{α n_{cp}}) * ({F_{L} (\frac{τ}{α \sqrt{n_{cp}}})}^{N_{cp}}) .$ (17.2)

(17.2)

The functions f_K() and F_K() are, respectively, the probability density function and the cumulative density function (cdf) of the random variable K and the sign * corresponds to the convolution operation.

The primary difference in the modeling procedure for delay variations in 3-D circuits is that a single variable G to describe D2D variations can no longer be used. This situation is due to the existence of intertier paths, as depicted in Fig. 17.2. Consequently, if a 3-D circuit includes m tiers, m random variables are required to model D2D variations. Based on the discussion of intra and intertier paths, the number of intertier and intratier critical paths within a 3-D circuit are, respectively, notated as $N_{cp}^{inter}$ $N_{cp}^{inter}$ and $N_{cp}^{intra}$ $N_{cp}^{intra}$ . If tier i contains $N_{cp}^{i}$ $N_{cp}^{i}$ intratier paths, the total number of intratier paths throughout a 3-D stack is

$N_{cp}^{intra} = \sum_{i = 1}^{m} N_{cp}^{i} .$ $N_{cp}^{intra} = \sum_{i = 1}^{m} N_{cp}^{i} .$ (17.3)

(17.3)

Accordingly, the within each intratier path within any tier of a 3-D circuit is assumed to comprise $n_{cp}^{intra}$ $n_{cp}^{intra}$ number of stages. Similarly, the number of stages of an intertier path is notated as $n_{cp}^{inter}$ $n_{cp}^{inter}$ , where the number of gates within this path placed in each tier i is denoted by $n_{cp}^{i}$ $n_{cp}^{i}$ . Based on this notation,

$n_{cp}^{inter} = \sum_{i = 1}^{m} n_{cp}^{i} .$ $n_{cp}^{inter} = \sum_{i = 1}^{m} n_{cp}^{i} .$ (17.4)

(17.4)

To better understand this notation, an example of a two-dimensional (2-D) and 3-D circuit is shown in Fig. 17.3. For the 2-D circuit depicted in Fig. 17.3A, $N_{cp} = 2$ $N_{cp} = 2$ and $n_{cp} = 3$ $n_{cp} = 3$ for both paths. Alternatively, in Fig. 17.3B, there is one intertier and two intratier paths, which, respectively, means that $N_{cp}^{intra} = 2$ $N_{cp}^{intra} = 2$ and $N_{cp}^{inter} = 1$ $N_{cp}^{inter} = 1$ . For the intratier paths, $n_{cp}^{intra} = 3$ $n_{cp}^{intra} = 3$ , while for the intertier path, $n_{cp}^{1} = 1$ $n_{cp}^{1} = 1$ and $n_{cp}^{2} = 2$ $n_{cp}^{2} = 2$ . Another distinction in the notation of WID random variables is shown in Fig. 17.3, where the variables for the WID variations of the intratier paths are notated as L_ij and the variables for the WID variations of the intertier paths are notated as L’_ij, capturing the delay variation of gate j in the (intra or intertier) critical path j. In a manner similar to (17.1), the delay variation of a critical path i within a multi-tier circuit for the intratier paths [632] is

$Δ T_{i}^{intra} = n_{cp}^{intra} a G_{g (i)} + (\sum_{j = 1}^{n_{cp}^{intra}} a L_{ij}), 1 \leq i \leq N_{cp}^{intra},$ $Δ T_{i}^{intra} = n_{cp}^{intra} a G_{g (i)} + (\sum_{j = 1}^{n_{cp}^{intra}} a L_{ij}), 1 \leq i \leq N_{cp}^{intra},$ (17.5)

(17.5)

and for the intertier paths [632] is

$Δ T_{i}^{inter} = (\sum_{j = 1}^{m} β n_{cp}^{j} G_{j}) + (\sum_{j = 1}^{n_{cp}^{inter}} β L_{ij}^{'}), 1 \leq i \leq N_{cp}^{inter},$ $Δ T_{i}^{inter} = (\sum_{j = 1}^{m} β n_{cp}^{j} G_{j}) + (\sum_{j = 1}^{n_{cp}^{inter}} β L_{ij}^{'}), 1 \leq i \leq N_{cp}^{inter},$ (17.6)

(17.6)

where g() is a mapping function that projects each intratier critical path onto one of the m tiers. The sensitivity of the gate delay to process variations for intra and intertier paths is, respectively, denoted by α and β. The maximum delay variation for the entire 3-D circuit is the maximum delay variation provided by (17.5) and (17.6),

$Δ T_{\max}^{3 - D} = \max_{1 \leq i \leq N_{cp}^{intra}, 1 \leq j \leq N_{cp}^{inter}} (Δ T_{i}^{intra}, Δ T_{j}^{inter}) .$ $Δ T_{\max}^{3 - D} = \max_{1 \leq i \leq N_{cp}^{intra}, 1 \leq j \leq N_{cp}^{inter}} (Δ T_{i}^{intra}, Δ T_{j}^{inter}) .$ (17.7)

(17.7)

Figure 17.3 Notation used in the delay variability model for 2-D and 3-D circuits. (A) 2-D circuit comprising two critical paths each with three logic gates, and (B) two-tier 3-D circuit contains three critical paths each with three stages, where two paths are intratier paths and one path is an intertier path. Two random variables are required in (B) to model the D2D variations of each tier [632].

Determining a closed-form expression for (17.7) is a complicated task. This intractability is due to the intertier paths [632]. Consequently, upper and lower bounds (LBs) are determined for (17.7), while for the delay variation of intratier paths, closed-form expressions are similar to (17.1). Note that in (17.5) a mapping function exists for each intratier path to another tier within the stack. Assuming that the intratier paths are divided equally among the m tiers of the stack, the cumulative distribution function (cdf) for the intratier paths in a 3-D circuit is

$F_{{Δ T}_{\max}^{3 - D}} (τ) = {[f_{G} (\frac{τ}{α n_{cp}^{intra}}) * ({F_{L} (\frac{τ}{α \sqrt{n_{cp}^{intra}}})}^{\frac{N_{cp}^{intra}}{m}})]}^{m},$ $F_{{Δ T}_{\max}^{3 - D}} (τ) = {[f_{G} (\frac{τ}{α n_{cp}^{intra}}) * ({F_{L} (\frac{τ}{α \sqrt{n_{cp}^{intra}}})}^{\frac{N_{cp}^{intra}}{m}})]}^{m},$ (17.8)

(17.8)

where * is the convolution operation. Different from a 2-D circuit the number of tiers affects the cdf, as described by (17.8). Furthermore, the critical paths are assumed to be equally split among the tiers of the stack. This mapping yields the worst case delay variability for the intratier paths [632]. These results suggest that to better manage variability in 3-D circuits, process aware physical design should carefully consider the distribution of the paths among the tiers within a stack.

Based on these analytic formulations, another conclusion is that a planar circuit has a higher likelihood to satisfy a timing constraint as compared to a 3-D circuit. The related assumptions are that both circuits contain the same number of critical paths, and that the 3-D circuit does not contain any intertier paths. Although this last assumption is rather restrictive, assuming that registers are present at the boundary of each tier for intertier nets, this argument demonstrates that process variability is an important issue for 3-D circuits, a situation that has to date been neglected. To exemplify these results, consider Fig. 17.4 where the cdf of different circuits with the same critical paths are considered. These curves demonstrate that a 2-D circuit exhibits a higher probability to satisfy a timing constraint (τ) as compared to a 3-D version of the same circuit. Moreover, a 3-D circuit with an even distribution of paths between the two tiers is the least likely circuit to satisfy a design constraint. Note that these results do not consider any changes that the third dimension brings into the physical design of the circuit.

Figure 17.4 Cdf of a 2-D circuit (dashed line), a 3-D circuit with uneven critical path distribution between the two tiers (dashed dotted line), and a 3-D circuit with the same number of critical paths in each tier (dotted lined) [632].

To provide a complete picture of the importance of process variations on datapaths within 3-D circuits an intertier path is considered. In this case, however, the delay variation cannot be described analytically and only bounds are provided [632]. The LB is set to the greatest delay variation of the intratier paths,

$Δ T_{\max}^{LB} = \max_{1 \leq i \leq N_{cp}^{intra}} (Δ T_{i}^{intra}) .$ $Δ T_{\max}^{LB} = \max_{1 \leq i \leq N_{cp}^{intra}} (Δ T_{i}^{intra}) .$ (17.9)

(17.9)

Although this approach appears as a crude simplification, the bound in (17.9) is not loose since the intertier path delay is determined by adding the random variables for the D2D variations. This summation decreases the standard deviation of an intertier path as compared to an intratier path. Assuming therefore that all of the traits of intra and intertier paths are the same (e.g., the number of gate stages and sensitivity of each gate delay to process variations), an intertier path always exhibits a lower likelihood to limit the performance of a 3-D circuit.

Alternatively, the upper bound (UB) can be described by [632]

$Δ T_{\max}^{UB} = \max (\max_{i} (Δ T_{i}^{intra}), \max_{i} (Δ T_{i}^{inter, UB})),$ $Δ T_{\max}^{UB} = \max (\max_{i} (Δ T_{i}^{intra}), \max_{i} (Δ T_{i}^{inter, UB})),$ (17.10)

(17.10)

where

$Δ T_{i}^{inter, UB} = \sum_{j = 1}^{m} β n_{cp}^{j} G_{j}^{'} + \sum_{j = 1}^{n_{cp}^{D 2 D}} β L_{ij}^{'} .$ $Δ T_{i}^{inter, UB} = \sum_{j = 1}^{m} β n_{cp}^{j} G_{j}^{'} + \sum_{j = 1}^{n_{cp}^{D 2 D}} β L_{ij}^{'} .$ (17.11)

(17.11)

In this expression, a set of new random variables $G_{j}^{'}$ $G_{j}^{'}$ is introduced which are assumed to be independent and identically distributed as G_j. Based on (17.10) and (17.11), the cdf of $Δ T_{\max}^{UB}$ $Δ T_{\max}^{UB}$ becomes [632]

$F_{{Δ T}_{\max}^{UB}} (τ) = [\prod_{i = 1}^{m} f_{G} (\frac{τ}{k_{1}}) * ({F_{L} (\frac{τ}{k_{2}})}^{N_{cp}^{i}})] [f_{G} (\frac{τ}{k_{3}}) * ({F_{L} (\frac{τ}{k_{4}})}^{N_{cp}^{inter}})],$ $F_{{Δ T}_{\max}^{UB}} (τ) = [\prod_{i = 1}^{m} f_{G} (\frac{τ}{k_{1}}) * ({F_{L} (\frac{τ}{k_{2}})}^{N_{cp}^{i}})] [f_{G} (\frac{τ}{k_{3}}) * ({F_{L} (\frac{τ}{k_{4}})}^{N_{cp}^{inter}})],$ (17.12)

(17.12)

where $k_{1} = a n_{cp}^{intra}$ $k_{1} = a n_{cp}^{intra}$ , $k_{2} = a \sqrt{n_{cp}^{intra}}$ $k_{2} = a \sqrt{n_{cp}^{intra}}$ , $k_{3} = β \sum_{i = 1}^{m} {(n_{cp}^{i})}^{2}$ $k_{3} = β \sum_{i = 1}^{m} {(n_{cp}^{i})}^{2}$ , $k_{4} = β \sqrt{n_{cp}^{inter}}$ $k_{4} = β \sqrt{n_{cp}^{inter}}$ .

The application of this model on synthetic critical paths for a 90 nm process node demonstrates interesting tradeoffs [632]. Assuming 1,000 critical paths, each consisting of six NAND gates, and introducing the parameter $γ = σ_{G}^{2} / σ_{tot}^{2}$ $γ = σ_{G}^{2} / σ_{tot}^{2}$ to describe the importance of D2D variations (modeled as a random variable G) over the total variations, the mean of the maximum critical path delay is determined for different placements of the critical paths among the tiers within a 3-D system. The parameter γ has the values, 0.25, 0.5, and 0.75, where a higher γ corresponds to a greater contribution of D2D variations. Furthermore, the number of tiers changes from one (i.e., a 2-D circuit) to six tiers.

These results indicate that if only intratier paths are assumed, the mean delay increases by 9.5% for γ=0.5, while for lower γ, the increase in the mean delay is lower. If only intratier paths are assumed, the D2D variations affect all of the gates along the critical paths within a tier in the same way, shifting the nominal mean delay. If the contribution of D2D variations is greater than the total variations, this shift is more pronounced. Alternatively, WID variations affecting each gate (or stage) of the critical paths have an averaging effect for longer (multistage) paths [634], leading to a smaller shift in the mean delay. Consequently, if the D2D variations are low, the increase in the mean delay is also low.

The placement of the critical paths among the tiers of the 3-D stack also affects the mean delay, where an uneven distribution of the critical paths among the tiers produces higher yield [632]. Although this situation suggests the placement of the critical paths in those tiers where D2D variations are lower, other design constraints may not permit such a placement. However, including variability in the physical design process for 3-D circuits can reduce the performance margins of the circuits; particularly since overall variability in 3-D systems is expected to worsen as compared to 2-D circuits [632].

Alternatively, if the critical paths span more than one tier, meaning that both intra and intertier critical paths exist in a 3-D circuit, the delay distribution of these paths decreases. This behavior is similar to the averaging effect of WID variations in intratier paths. As the gates within the intertier paths are spread over several tiers, the contribution of D2D variations from different tiers can cancel each other, yielding a narrower delay distribution of the critical paths as compared to those 3-D circuits that are comprised of only intratier critical paths. The interplay between the contribution of WID and D2D variations is applied in the following section to the design of more robust global clock distribution networks for 3-D circuits, where a highly accurate variation aware clock skew model is also described.

17.2 Effects of Process Variations on Clock Paths

Several global 3-D clock distribution networks are presented in Chapter 15, Synchronization in Three-Dimensional ICs. In this section, the effects of process variations on the principal traits of 3-D clock networks, such as clock skew and power, are discussed. To consider these effects, effective statistical delay models are required to describe the distribution of the clock skew for disparate 3-D clock network topologies. A model of the delay distribution of a clock buffer including both WID and D2D variations is described in Section 17.2.1. This model produces a delay distribution of clock paths, as described in Section 17.2.2. This distribution is, in turn, utilized to describe the distribution of clock skew in 3-D clock trees, as discussed in Section 17.2.3. Based on these models, the effects of process variations on clock skew distribution for different clock networks are reviewed, and a robust 3-D clock topology is presented in Section 17.2.4.

17.2.1 Statistical Delay Model of Clock Buffers

A typical buffered 3-D H-tree is illustrated in Fig. 17.5. The pairwise clock skew is defined as the skew between every pair of sinks in a 3-D clock distribution network, S_skew={s_i,j|s_i,j=D_i−D_j, 1≤i, j≤n_sink}. Sinks i and j can be located in any tier of the 3-D circuit. s_i,j denotes the skew between sinks i and j. The clock signal delay to sinks i and j is denoted, respectively, by D_i and D_j. The number of clock sinks is n_sink. To determine the distribution of skew S_skew for the clock sinks, a statistical delay model for a clock buffer is required.

Figure 17.5 3-D H-tree spanning four tiers. (A) Notation for all of the 64 sinks, and (B) certain sinks used to evaluate clock skew.

The overall variation in the delay of a clock buffer is dependent on the variations of the input capacitance and output resistance [635,636]. This approach considers the input slew rate to more accurately model the distribution of the buffer delay. The interconnects constituting a clock stage are modeled as distributed RC wires. The circuit illustrated in Fig. 17.6 is utilized to obtain the variation of the buffer delay for different input signal slew rates.

Figure 17.6 Elemental circuit to measure the distribution of delay due to variations in the buffer characteristics.

Let R_in denote the output resistance of a buffer driving a second buffer. The load capacitance of the buffer is denoted by C_l. Interconnects with diverse impedance characteristics are modeled by employing different R_int and C_int, where R_int and C_int denote, respectively, the resistance and capacitance of the interconnects. The interconnect R_int and C_int can also be adjusted to produce different slew rates for the input signal of the buffer shown in Fig. 17.6.

For a step input signal, the Elmore delay [414] from source S to nodes I and O (in Fig. 17.6) is, respectively,

$D_{SI} = 0.69 R_{in} C_{int} + 0.38 R_{int} C_{int} + 0.69 (R_{in} + R_{int}) C_{b},$ $D_{SI} = 0.69 R_{in} C_{int} + 0.38 R_{int} C_{int} + 0.69 (R_{in} + R_{int}) C_{b},$ (17.13)

(17.13)

${Δ D}_{SI} = 0.69 (R_{in} + R_{int}) {Δ C}_{b},$ ${Δ D}_{SI} = 0.69 (R_{in} + R_{int}) {Δ C}_{b},$ (17.14)

(17.14)

$D_{SO} = D_{SI} + D_{b} + 0.69 R_{b} C_{l},$ $D_{SO} = D_{SI} + D_{b} + 0.69 R_{b} C_{l},$ (17.15)

(17.15)

${Δ D}_{SO} = {Δ D}_{SI} + {Δ D}_{b} + 0.69 {Δ R}_{b} C_{l},$ ${Δ D}_{SO} = {Δ D}_{SI} + {Δ D}_{b} + 0.69 {Δ R}_{b} C_{l},$ (17.16)

(17.16)

where C_b, R_b, and D_b are, respectively, the input capacitance, output resistance, and intrinsic delay of the buffer. Variations of C_b, R_b, and D_b are, respectively, denoted by ΔC_b, ΔR_b, and ΔD_b. For the buffer shown in Fig. 17.6, R_in is considered constant (for the moment).

Through several Monte Carlo simulations, the delay variation at nodes I and O is measured by setting C_l to zero (corresponding to ${Δ D}_{{SO}_{0}}$ ${Δ D}_{{SO}_{0}}$ ) and a different value (e.g., 200 fF, corresponding to ${Δ D}_{{SO}_{1}}$ ${Δ D}_{{SO}_{1}}$ ). The mean value and standard deviation of ΔC_b, ΔR_b, and ΔD_b are obtained from (17.14) and (17.16) [636]. Assuming that process variations can be described by a Gaussian distribution, the electrical characteristics of a buffer can also be approximated by a Gaussian distribution [637],

${Δ C}_{b} ~ N (0, σ_{C_{b}}^{2}), {Δ R}_{b} ~ N (0, σ_{R_{b}}^{2}), {Δ D}_{b} ~ N (0, σ_{D_{b}}^{2}) .$ ${Δ C}_{b} ~ N (0, σ_{C_{b}}^{2}), {Δ R}_{b} ~ N (0, σ_{R_{b}}^{2}), {Δ D}_{b} ~ N (0, σ_{D_{b}}^{2}) .$ (17.17)

(17.17)

$σ_{C_{b}}$ $σ_{C_{b}}$ is characterized from (17.14). According to (17.14) and (17.16), $σ_{D_{SO}}$ $σ_{D_{SO}}$ is dependent on $σ_{C_{b}}$ $σ_{C_{b}}$ , $σ_{D_{b}}$ $σ_{D_{b}}$ , and $σ_{R_{b}}$ $σ_{R_{b}}$ , and the covariance $σ_{D_{SO}}^{2}$ $σ_{D_{SO}}^{2}$ among these variables is

$σ_{D_{SO}}^{2} = {(0.69 (R_{in} + R_{int}) σ_{C_{b}})}^{2} + σ_{D_{b}}^{2} + {(0.69 σ_{R_{b}} C_{l})}^{2} + 1.38 (R_{in} + R_{int}) cov (D_{b}, C_{b})$ $σ_{D_{SO}}^{2} = {(0.69 (R_{in} + R_{int}) σ_{C_{b}})}^{2} + σ_{D_{b}}^{2} + {(0.69 σ_{R_{b}} C_{l})}^{2} + 1.38 (R_{in} + R_{int}) cov (D_{b}, C_{b})$

$+ 1.38 C_{l} cov (D_{b}, R_{b}) + 0.952 C_{l} (R_{in} + R_{int}) cov (C_{b}, R_{b}),$ $+ 1.38 C_{l} cov (D_{b}, R_{b}) + 0.952 C_{l} (R_{in} + R_{int}) cov (C_{b}, R_{b}),$ (17.18)

(17.18)

$σ_{D_{b}}^{2} = σ_{{D_{SO}}_{0}}^{2} - σ_{D_{SI}}^{2} - 1.38 (R_{in} + R_{int}) cov (D_{b}, C_{b}),$ $σ_{D_{b}}^{2} = σ_{{D_{SO}}_{0}}^{2} - σ_{D_{SI}}^{2} - 1.38 (R_{in} + R_{int}) cov (D_{b}, C_{b}),$ (17.19)

(17.19)

$σ_{C_{b}}^{2} = \frac{σ_{D_{SI}}}{0.69 (R_{in} + R_{int})} .$ $σ_{C_{b}}^{2} = \frac{σ_{D_{SI}}}{0.69 (R_{in} + R_{int})} .$ (17.20)

(17.20)

$σ_{R_{b}}$ $σ_{R_{b}}$ is obtained from (17.18) by substituting $σ_{D_{b}}$ $σ_{D_{b}}$ and $σ_{C_{b}}$ $σ_{C_{b}}$ , respectively, into (17.19) and (17.20).

Consider that ΔC_b, ΔR_b, and ΔD_b are used to obtain the delay variation of each buffer stage Δd_i, which is similar to ΔD_SO. When calculating $σ_{d_{i}}$ $σ_{d_{i}}$ (similar to recalculating $σ_{D_{SO 1}}$ $σ_{D_{SO 1}}$ through (17.18)), $σ_{C_{b}}$ $σ_{C_{b}}$ , $σ_{R_{b}}$ $σ_{R_{b}}$ , and $σ_{D_{b}}$ $σ_{D_{b}}$ are again substituted into (17.18). In this procedure, the covariances, cov(D_b, R_b), cov(D_b, C_b) and cov(C_b, R_b), effectively cancel. Consequently, the correlation among ΔC_b, ΔR_b, and ΔD_b does not significantly affect Δd_i as long as Δd_i is based on this same correlation. Since ΔC_b, ΔR_b, and ΔD_b originate from the same source of process variations, these variables are assumed in this model to be fully correlated.

17.2.2 Delay Distribution of Clock Paths

The buffer delay model described in the previous subsection is used to evaluate the delay distribution of clock paths in 3-D circuits. An example of a 3-D clock path is illustrated in Fig. 17.7. The devices in different physical tiers are connected by TSVs [181], which, in turn, are modeled as RC wires of different resistance and capacitance as compared to the horizontal wires (e.g., R_TSV and C_TSV in Fig. 17.7). R_TSV and C_TSV are considered fixed.

Figure 17.7 Electrical model of a segment of an intertier clock path.

Consider the clock path consisting of buffers i−1, i, and i+1. From (17.14) and (17.16), the delay variation Δd_i attributed to the variation of buffer i along a target path is

${Δ d}_{i} = 0.69 (R_{in (i)}^{'} + {Δ R}_{b (i - 1)}) {Δ C}_{b (i)} + 0.69 {Δ R}_{b (i)} (C_{l (i)}^{'} + {Δ C}_{b (i + 1)} + {Δ C}_{b (j)})$ ${Δ d}_{i} = 0.69 (R_{in (i)}^{'} + {Δ R}_{b (i - 1)}) {Δ C}_{b (i)} + 0.69 {Δ R}_{b (i)} (C_{l (i)}^{'} + {Δ C}_{b (i + 1)} + {Δ C}_{b (j)})$

$+ 0.69 R_{b (i)}^{'} {Δ C}_{b (j)} + {Δ D}_{b (i)},$ $+ 0.69 R_{b (i)}^{'} {Δ C}_{b (j)} + {Δ D}_{b (i)},$ (17.21)

(17.21)

$R_{in (i)} = R_{b (i - 1)} + R_{TSV},$ $R_{in (i)} = R_{b (i - 1)} + R_{TSV},$ (17.22)

(17.22)

$C_{l (i)} = {2 C}_{int} + C_{b (i + 1)} + C_{b (j)},$ $C_{l (i)} = {2 C}_{int} + C_{b (i + 1)} + C_{b (j)},$ (17.23)

(17.23)

where the prime (′) denotes the nominal value. For buffer i, the ΔR_b(i−1) of the upstream buffer and ΔC_b(i+1) of the downstream buffer are both included in (17.21). To determine the delay of a clock path, Δd_i for all of the buffers along this path is summed. In this case, ΔR_b(i−1)ΔC_b(i) and ΔR_b(i)ΔC_b(i+1) are duplicated. One of these two terms therefore needs to be removed. Consequently, Δd_i is rewritten as

${Δ d}_{i} = 0.69 (R_{in (i)}^{'} {Δ C}_{b (i)} + {Δ R}_{b (i)} (C_{l (i)}^{'} + {Δ C}_{b (i + 1)} + {Δ C}_{b (j)} + R_{b (i)}^{'} {Δ C}_{b (j)}) + {Δ D}_{b (i)}$ ${Δ d}_{i} = 0.69 (R_{in (i)}^{'} {Δ C}_{b (i)} + {Δ R}_{b (i)} (C_{l (i)}^{'} + {Δ C}_{b (i + 1)} + {Δ C}_{b (j)} + R_{b (i)}^{'} {Δ C}_{b (j)}) + {Δ D}_{b (i)}$

$= 0.69 (R_{in (i)}^{'} {Δ C}_{b (i)} + R_{b (i)}^{'} {Δ C}_{b (j)}) + {Δ D}_{b (i)} + δ_{i},$ $= 0.69 (R_{in (i)}^{'} {Δ C}_{b (i)} + R_{b (i)}^{'} {Δ C}_{b (j)}) + {Δ D}_{b (i)} + δ_{i},$ (17.24)

(17.24)

where

$δ_{i} = 0.69 {Δ R}_{b (i)} (C_{l (i)}^{'} + {Δ C}_{b (i + 1)} + {Δ C}_{b (j)}) .$ $δ_{i} = 0.69 {Δ R}_{b (i)} (C_{l (i)}^{'} + {Δ C}_{b (i + 1)} + {Δ C}_{b (j)}) .$ (17.25)

(17.25)

The variation of ΔC_b is relatively low as compared with the nominal C_b (σ/μ<3% for both D2D and WID variations, as reported in Table 17.1). The observed delay variation of the buffers is also typically much lower than the nominal value (e.g., σ/μ≤5% for both D2D and WID variations, as reported in [638]). δ_i can be approximated using a first order linear Taylor series expansion around zero [637],

$δ_{i} \approx {[\frac{ϑ δ_{i}}{ϑ {Δ R}_{b (i)}}]}_{0} {Δ R}_{b (i)} + {[\frac{ϑ δ_{i}}{ϑ {Δ C}_{b (i + 1)}}]}_{0} {Δ C}_{b (i + 1)} + {[\frac{ϑ δ_{i}}{ϑ {Δ C}_{b (j)}}]}_{0} {Δ C}_{b (j)} = 0.69 C_{l (i)}^{'} {Δ R}_{b (i)} .$ $δ_{i} \approx {[\frac{ϑ δ_{i}}{ϑ {Δ R}_{b (i)}}]}_{0} {Δ R}_{b (i)} + {[\frac{ϑ δ_{i}}{ϑ {Δ C}_{b (i + 1)}}]}_{0} {Δ C}_{b (i + 1)} + {[\frac{ϑ δ_{i}}{ϑ {Δ C}_{b (j)}}]}_{0} {Δ C}_{b (j)} = 0.69 C_{l (i)}^{'} {Δ R}_{b (i)} .$ (17.26)

(17.26)

Table 17.1

Variations in the Electrical Characteristics of the Buffers

Input Slew	R_b (Ω)			C_b (fF)			D_b (ps)
Input Slew	μ	σ_WID	σ_D2D	μ	σ_WID	σ_D2D	μ	σ_WID	σ_D2D
47 (mV/ps)	371	18.8	15.3	4.9	0.04	0.03	19.9	1.04	0.85
47 (mV/ps)	σ/μ	5.1%	4.1%	σ/μ	0.8%	0.7%	σ/μ	5.2%	4.3%
16 (mV/ps)	349	17.8	14.7	5.7	0.31	0.16	24.8	1.49	1.21
16 (mV/ps)	σ/μ	5.1%	4.2%	σ/μ	2.3%	2.1%	σ/μ	6.0%	4.9%
6 (mV/ps)	345	16.7	13.7	7.2	0.08	0.06	30.1	2.19	1.79
6 (mV/ps)	σ/μ	4.8%	4.0%	σ/μ	1.1%	0.9%	σ/μ	7.3%	5.9%

As reported in Table 17.1 and discussed in [632,638] and [639], σ/μ of the transistor characteristics is typically less than 5%. The 3σ variation is smaller than 15% of the nominal R_b and 10% for ΔC_b. Since ΔC_b and ΔR_b are modeled as Gaussian distributions, for more than 99.7% of buffers, ΔC_bΔR_b is lower than 1.5% C_bR_b. Moreover, from the nominal value and standard deviation of C_b, R_b, and D_b, as reported in Table 17.1, 0.69ΔC_bΔR_b and 0.69C_bR_b are, respectively, much lower than ΔD_b and D_b. Consequently, approximating δ_i with (17.26) does not introduce a significant loss of accuracy.

As mentioned previously, ΔR_b(i), ΔC_b(i), and ΔD_b(i) are approximated as Gaussian distributions and can be assumed to be fully correlated. According to (17.24) and (17.26), Δd_i is approximated as a Gaussian distribution.

${Δ d}_{i} ~ N (0, σ_{d_{i}^{D 2 D}}^{2} + σ_{d_{i}^{WID}}^{2}),$ ${Δ d}_{i} ~ N (0, σ_{d_{i}^{D 2 D}}^{2} + σ_{d_{i}^{WID}}^{2}),$ (17.27)

(17.27)

$σ_{d_{i}^{D 2 D}}^{2} = {\begin{matrix} {{(σ}_{1} + σ_{2} + σ_{3})}^{2} + σ_{4}^{2} if buffers i and j are in different tiers, \end{matrix}$ $σ_{d_{i}^{D 2 D}}^{2} = {\begin{matrix} {{(σ}_{1} + σ_{2} + σ_{3})}^{2} + σ_{4}^{2} if buffers i and j are in different tiers, \end{matrix}$ (17.28a)

(17.28a)

$σ_{d_{i}^{D 2 D}}^{2} = {\begin{matrix} {{(σ}_{1} + σ_{2} + σ_{3} + σ_{4})}^{2} if buffers i and j are in the same tier, \end{matrix}$ $σ_{d_{i}^{D 2 D}}^{2} = {\begin{matrix} {{(σ}_{1} + σ_{2} + σ_{3} + σ_{4})}^{2} if buffers i and j are in the same tier, \end{matrix}$ (17.28b)

(17.28b)

$σ_{d_{i}^{WID}}^{2} = {{(σ}_{5} + σ_{6} + σ_{7})}^{2} + σ_{8}^{2} + 2 corr (i, j) {(σ}_{5} + σ_{6} + σ_{7}) σ_{8} .$ $σ_{d_{i}^{WID}}^{2} = {{(σ}_{5} + σ_{6} + σ_{7})}^{2} + σ_{8}^{2} + 2 corr (i, j) {(σ}_{5} + σ_{6} + σ_{7}) σ_{8} .$ (17.29)

(17.29)

The terms σ₁ to σ₈ are, respectively,

$\begin{matrix} σ_{1} = 0.69 R_{in (i)}^{'} σ_{C_{b (i)}^{D 2 D}}, & σ_{2} = 0.69 C_{l (i)}^{'} σ_{R_{b (i)}^{D 2 D}}, \\ σ_{5} = 0.69 R_{in (i)}^{'} σ_{C_{b (i)}^{WID}}, & σ_{6} = 0.69 C_{l (i)}^{'} σ_{R_{b (i)}^{WID}}, \end{matrix} \begin{matrix} σ_{3} = σ_{D_{b (i)}^{D 2 D}}, & σ_{4} = 0.69 R_{b (i)}^{'} σ_{C_{b (i)}^{D 2 D}}, \\ σ_{7} = σ_{D_{b (i)}^{WID}}, & σ_{8} = 0.69 R_{b (i)}^{'} σ_{C_{b (i)}^{WID}} . \end{matrix}$ $\begin{matrix} σ_{1} = 0.69 R_{in (i)}^{'} σ_{C_{b (i)}^{D 2 D}}, & σ_{2} = 0.69 C_{l (i)}^{'} σ_{R_{b (i)}^{D 2 D}}, \\ σ_{5} = 0.69 R_{in (i)}^{'} σ_{C_{b (i)}^{WID}}, & σ_{6} = 0.69 C_{l (i)}^{'} σ_{R_{b (i)}^{WID}}, \end{matrix} \begin{matrix} σ_{3} = σ_{D_{b (i)}^{D 2 D}}, & σ_{4} = 0.69 R_{b (i)}^{'} σ_{C_{b (i)}^{D 2 D}}, \\ σ_{7} = σ_{D_{b (i)}^{WID}}, & σ_{8} = 0.69 R_{b (i)}^{'} σ_{C_{b (i)}^{WID}} . \end{matrix}$

The correlation between buffers i and j is denoted by corr(i, j). A model describing this spatial correlation is discussed in Appendix E.

Consequently, for a 3-D clock path to a sink u that includes n_u clock buffers, the variation of the delay is expressed as the summation of (17.24) of each buffer along the path. The variance of the distribution of a 3-D clock path is a Gaussian distribution consisting of WID and D2D variations of the buffers,

$Δ D_{u} = \sum_{i = 1}^{n_{u}} Δ d_{i},$ $Δ D_{u} = \sum_{i = 1}^{n_{u}} Δ d_{i},$ (17.30)

(17.30)

$Δ D_{u} ~ N (0, σ_{D_{u}^{D 2 D}}^{2} + σ_{D_{u}^{WID}}^{2}) .$ $Δ D_{u} ~ N (0, σ_{D_{u}^{D 2 D}}^{2} + σ_{D_{u}^{WID}}^{2}) .$ (17.31)

(17.31)

The variation of the delay of a 3-D clock path due to D2D process variations is the sum of the D2D variations of the buffer delay in all of the tiers,

$Δ D_{u}^{D 2 D} = \sum_{j = 1}^{m} Δ D_{u (j)}^{D 2 D},$ $Δ D_{u}^{D 2 D} = \sum_{j = 1}^{m} Δ D_{u (j)}^{D 2 D},$ (17.32)

(17.32)

$Δ D_{u (j)}^{D 2 D} = \sum_{j = 1}^{n_{u (j)}} Δ D_{u (j, i)}^{D 2 D},$ $Δ D_{u (j)}^{D 2 D} = \sum_{j = 1}^{n_{u (j)}} Δ D_{u (j, i)}^{D 2 D},$ (17.33)

(17.33)

where m is the number of tiers spanned by the clock tree. $Δ D_{u (j)}^{D 2 D}$ $Δ D_{u (j)}^{D 2 D}$ is the variation of the delay of the clock path from the clock source to sink u in tier j. The number of buffers located in tier j along this clock path is denoted by n_u(j). The variation of the delay related to the i^th buffer in tier j is denoted by ${Δ D}_{u (j, i)}$ ${Δ D}_{u (j, i)}$ . Since the D2D variations equally affect the buffers within the same tier, according to (17.27), (17.28), and (17.32), the distribution of $Δ D_{u (j)}^{D 2 D}$ $Δ D_{u (j)}^{D 2 D}$ is a Gaussian distribution. The D2D variations affect the buffers in different tiers independently and, therefore, $Δ D_{u (j)}^{D 2 D}$ $Δ D_{u (j)}^{D 2 D}$ is independent of $Δ D_{u (k)}^{D 2 D}$ $Δ D_{u (k)}^{D 2 D}$ for any j ≠ k. Consequently, according to (17.32), the distribution of $Δ D_{u}^{D 2 D}$ $Δ D_{u}^{D 2 D}$ is also a Gaussian distribution,

$Δ D_{u}^{D 2 D} ~ N (0, σ_{{Δ D}_{u}^{D 2 D}}^{2}),$ $Δ D_{u}^{D 2 D} ~ N (0, σ_{{Δ D}_{u}^{D 2 D}}^{2}),$ (17.34)

(17.34)

$σ_{D_{u}^{D 2 D}}^{2} = \sum_{j = 1}^{m} σ_{D_{u (j)}^{D 2 D}}^{2} = \sum_{j = 1}^{m} {(\sum_{i = 1}^{n_{u (j)}} σ_{D_{u (j, i)}^{D 2 D}})}^{2} .$ $σ_{D_{u}^{D 2 D}}^{2} = \sum_{j = 1}^{m} σ_{D_{u (j)}^{D 2 D}}^{2} = \sum_{j = 1}^{m} {(\sum_{i = 1}^{n_{u (j)}} σ_{D_{u (j, i)}^{D 2 D}})}^{2} .$ (17.35)

(17.35)

Alternatively, the variation of the delay of a 3-D clock path affected by WID variations is the sum of the WID variations of all of the buffers along this path. Consequently, according to (17.29), the distribution of $Δ D_{u}^{WID}$ $Δ D_{u}^{WID}$ is also a Gaussian distribution. The resulting variance of the delay of sink u due to WID variations is

$Δ D_{u}^{WID} ~ N (0, σ_{{Δ D}_{u}^{WID}}^{2}),$ $Δ D_{u}^{WID} ~ N (0, σ_{{Δ D}_{u}^{WID}}^{2}),$ (17.36)

(17.36)

$σ_{D_{u}^{WID}}^{2} = \sum_{i = 1}^{n_{u}} σ_{d_{i}^{WID}}^{2} + 2 \sum_{1 \leq i < j \leq n_{u}} corr (i, j) σ_{d_{i}^{WID}} σ_{d_{j}^{WID}},$ $σ_{D_{u}^{WID}}^{2} = \sum_{i = 1}^{n_{u}} σ_{d_{i}^{WID}}^{2} + 2 \sum_{1 \leq i < j \leq n_{u}} corr (i, j) σ_{d_{i}^{WID}} σ_{d_{j}^{WID}},$ (17.37)

(17.37)

where corr(i, j) is the correlation between the WID variations of buffers i and j. If buffers i and j are located in different tiers, corr(i, j)=0. The correlation of the impact of WID variations on different buffers within the same tier can be classified as systematic or random. The systematic WID variations typically exhibit a spatial correlation [638,640–642]. For those buffers located within the same tier, two types of correlations for WID variations exist, as described in Appendix E.

17.2.3 Clock Skew Distribution in Three-Dimensional Clock Trees

The clock skew between any pair of sinks in a 3-D clock tree is the difference in the clock delay between these sinks. For a 3-D clock tree with n_sink sinks distributed in m tiers, the nominal and variation of the clock skew s_u,v between sinks u and v are, respectively,

$s_{u, v}^{'} = D_{u}^{'} - D_{v}^{'},$ $s_{u, v}^{'} = D_{u}^{'} - D_{v}^{'},$ (17.38)

(17.38)

${Δ s}_{u, v} = Δ s_{u, v}^{WID} + Δ s_{u, v}^{D 2 D} = Δ D_{u}^{WID} - Δ D_{v}^{WID} + Δ D_{u}^{D 2 D} - Δ D_{v}^{D 2 D} .$ ${Δ s}_{u, v} = Δ s_{u, v}^{WID} + Δ s_{u, v}^{D 2 D} = Δ D_{u}^{WID} - Δ D_{v}^{WID} + Δ D_{u}^{D 2 D} - Δ D_{v}^{D 2 D} .$ (17.39)

(17.39)

The mean value of Δs_u,v is E(Δs_u,v)=E( $Δ s_{u, v}^{WID}$ $Δ s_{u, v}^{WID}$ )=E( $Δ s_{u, v}^{D 2 D}$ $Δ s_{u, v}^{D 2 D}$ )=0. The terms ΔD_u^WID−ΔD_v^WID and ΔD_u^D2D−ΔD_v^D2D are independent. Consequently, $Δ s_{u, v}^{D 2 D}$ $Δ s_{u, v}^{D 2 D}$ and $Δ s_{u, v}^{WID}$ $Δ s_{u, v}^{WID}$ are treated separately. The correlation between every two terms in the expression of $Δ s_{u, v}^{D 2 D}$ $Δ s_{u, v}^{D 2 D}$ is one or zero (i.e., respectively, fully correlated or uncorrelated). According to (17.32), $Δ s_{u, v}^{D 2 D}$ $Δ s_{u, v}^{D 2 D}$ is the sum of the terms in different tiers,

$Δ s_{u, v}^{D 2 D} = \sum_{j = 1}^{m} {Δ s}_{{(u, v)}_{j}}^{D 2 D}, Δ s_{{(u, v)}_{j}}^{D 2 D} = \sum_{i = 1}^{n_{u (j)}} {Δ D}_{u (j, i)}^{D 2 D} - \sum_{i = 1}^{n_{v (j)}} {Δ D}_{v (j, i)}^{D 2 D},$ $Δ s_{u, v}^{D 2 D} = \sum_{j = 1}^{m} {Δ s}_{{(u, v)}_{j}}^{D 2 D}, Δ s_{{(u, v)}_{j}}^{D 2 D} = \sum_{i = 1}^{n_{u (j)}} {Δ D}_{u (j, i)}^{D 2 D} - \sum_{i = 1}^{n_{v (j)}} {Δ D}_{v (j, i)}^{D 2 D},$ (17.40)

(17.40)

where ${Δ D}_{u (j, i)}^{D 2 D}$ ${Δ D}_{u (j, i)}^{D 2 D}$ is the D2D delay variation related to the i^th buffer in the j^th tier along the clock path ending at sink u. The number of buffers in the j^th tier along this path is denoted as n_u(j).

All of the buffers in the same tier are equally affected by D2D variations, meaning that the correlation between each pair of variables in (17.40) is one. Since ${Δ D}_{u (j, i)}^{D 2 D}$ ${Δ D}_{u (j, i)}^{D 2 D}$ and ${Δ D}_{v (j, i)}^{D 2 D}$ ${Δ D}_{v (j, i)}^{D 2 D}$ are both modeled as a Gaussian distribution, ${Δ D}_{s {(u, v)}_{j}}^{D 2 D}$ ${Δ D}_{s {(u, v)}_{j}}^{D 2 D}$ is also a Gaussian distribution. In (17.40), ∀ j₁ ≠ j₂ (1≤j₁, j₂≤m) ${Δ D}_{s {(u, v)}_{j_{1}}}^{D 2 D}$ ${Δ D}_{s {(u, v)}_{j_{1}}}^{D 2 D}$ is independent of ${Δ D}_{s {(u, v)}_{j_{2}}}^{D 2 D}$ ${Δ D}_{s {(u, v)}_{j_{2}}}^{D 2 D}$ . Consequently, $Δ s_{u, v}^{D 2 D}$ $Δ s_{u, v}^{D 2 D}$ is also described by a Gaussian distribution,

$Δ s_{u, v}^{D 2 D} ~ N (0, σ_{s_{u, v}^{D 2 D}}^{2}),$ $Δ s_{u, v}^{D 2 D} ~ N (0, σ_{s_{u, v}^{D 2 D}}^{2}),$ (17.41)

(17.41)

$σ_{D_{u}^{D 2 D}}^{2} = \sum_{j = 1}^{m} σ_{s_{u, v (j)}^{D 2 D}}^{2} = \sum_{j = 1}^{m} {(\sum_{i = 1}^{n_{u (j)}} σ_{D_{u (j, i)}^{D 2 D}} - \sum_{i = 1}^{n_{v (j)}} σ_{D_{v (j, i)}^{D 2 D}})}^{2} .$ $σ_{D_{u}^{D 2 D}}^{2} = \sum_{j = 1}^{m} σ_{s_{u, v (j)}^{D 2 D}}^{2} = \sum_{j = 1}^{m} {(\sum_{i = 1}^{n_{u (j)}} σ_{D_{u (j, i)}^{D 2 D}} - \sum_{i = 1}^{n_{v (j)}} σ_{D_{v (j, i)}^{D 2 D}})}^{2} .$ (17.42)

(17.42)

According to (17.37), the distribution of $Δ s_{u, v}^{WID}$ $Δ s_{u, v}^{WID}$ is also a Gaussian distribution,

$Δ s_{u, v}^{WID} ~ N (0, σ_{s_{u, v}^{WID}}^{2}),$ $Δ s_{u, v}^{WID} ~ N (0, σ_{s_{u, v}^{WID}}^{2}),$ (17.43)

(17.43)

$σ_{s_{u, v}^{WID}}^{2} = \sum_{i = n_{u, v} + 1}^{n_{u}} σ_{D_{u (i)}^{WID}}^{2} + \sum_{j = n_{u, v} + 1}^{n_{v}} σ_{D_{v (j)}^{WID}}^{2} + 2 \sum_{\begin{matrix} i, j = n_{u, v} + 1 \\ i < j \end{matrix}}^{n_{u}} corr (i, j) σ_{D_{u (i)}^{WID}} σ_{D_{u (j)}^{WID}}$ $σ_{s_{u, v}^{WID}}^{2} = \sum_{i = n_{u, v} + 1}^{n_{u}} σ_{D_{u (i)}^{WID}}^{2} + \sum_{j = n_{u, v} + 1}^{n_{v}} σ_{D_{v (j)}^{WID}}^{2} + 2 \sum_{\begin{matrix} i, j = n_{u, v} + 1 \\ i < j \end{matrix}}^{n_{u}} corr (i, j) σ_{D_{u (i)}^{WID}} σ_{D_{u (j)}^{WID}}$

$+ 2 \sum_{\begin{matrix} i, j = n_{u, v} + 1 \\ i < j \end{matrix}}^{n_{v}} corr (i, j) σ_{D_{v (i)}^{WID}} σ_{D_{v (j)}^{WID}} - 2 \sum_{\begin{matrix} n_{u, v} + 1 \leq i \leq n_{u} \\ n_{u, v} + 1 \leq j \leq n_{v} \end{matrix}} corr (i, j) σ_{D_{u (i)}^{WID}} σ_{D_{v (j)}^{WID}},$ $+ 2 \sum_{\begin{matrix} i, j = n_{u, v} + 1 \\ i < j \end{matrix}}^{n_{v}} corr (i, j) σ_{D_{v (i)}^{WID}} σ_{D_{v (j)}^{WID}} - 2 \sum_{\begin{matrix} n_{u, v} + 1 \leq i \leq n_{u} \\ n_{u, v} + 1 \leq j \leq n_{v} \end{matrix}} corr (i, j) σ_{D_{u (i)}^{WID}} σ_{D_{v (j)}^{WID}},$ (17.44)

(17.44)

where n_u,v is the number of buffers shared by the clock paths ending at sinks u and v, as depicted in Fig. 17.8. Downstream the buffers n_u,v, the subpaths to u and v do not share a buffer.

Figure 17.8 Clock paths to sinks u and v where the paths share n_u,v buffers.

According to (17.39) through (17.44), the variation of the clock skew Δs_u,v between sinks u and v in a 3-D clock tree is modeled as a Gaussian distribution,

$Δ s_{u, v} ~ N (0, σ_{s_{u, v}^{WID}}^{2} + σ_{s_{u, v}^{D 2 D}}^{2}) .$ $Δ s_{u, v} ~ N (0, σ_{s_{u, v}^{WID}}^{2} + σ_{s_{u, v}^{D 2 D}}^{2}) .$ (17.45)

(17.45)

If the maximum tolerant skew variation is ΔS≥0, the probability that a 3-D clock tree satisfies this constraint is

$P (| s_{u, v} | \leq Δ S) = \int_{- Δ S - s_{u, v}^{'}}^{Δ S - s_{u, v}^{'}} f_{Δ s_{u, v}} (t) dt,$ $P (| s_{u, v} | \leq Δ S) = \int_{- Δ S - s_{u, v}^{'}}^{Δ S - s_{u, v}^{'}} f_{Δ s_{u, v}} (t) dt,$ (17.46)

(17.46)

$f_{Δ s_{u, v}} (t) = \frac{1}{\sqrt{2 π σ_{s_{u, v}}^{2}}} e^{- t^{2} / (2 σ_{s_{u, v}}^{2})} .$ $f_{Δ s_{u, v}} (t) = \frac{1}{\sqrt{2 π σ_{s_{u, v}}^{2}}} e^{- t^{2} / (2 σ_{s_{u, v}}^{2})} .$ (17.47)

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 17. Variability Issues in Three-Dimensional ICs

Create new playlist

Sign In

Sign Up