Chapter 10. Integer-N Frequency Synthesizers

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 10. Integer-N Frequency Synthesizers

The oscillators used in RF transceivers are usually embedded in a “synthesizer” environment so as to precisely define their output frequency. Synthesizer design has for decades proved a difficult task, leading to hundreds of RF synthesis techniques. RF synthesizers typically employ phase-locking and must deal with the generic PLL issues described in Chapter 9. In this chapter, we study one class called “integer-N” synthesizers. The chapter outline is shown below. The reader is encouraged to first review Chapters 8 and 9.

10.1 General Considerations

Recall from Chapter 3 that each wireless standard provides a certain number of frequency channels for communication. For example, Bluetooth has 80 1-MHz channels in the range of 2.400 GHz to 2.480 GHz. At the beginning of each communication session, one of these channels, f_j, is allocated to the user, requiring that the LO frequency be set (defined) accordingly (Fig. 10.1). The synthesizer performs this precise setting.

Figure 10.1 Setting of LO frequency for each received channel.

The reader may wonder why a synthesizer is necessary. It appears that the control voltage of a VCO can be simply changed to establish the required LO frequency. The VCOs studied in Chapter 8 are called “free-running” because, for a given control voltage, their output frequency is defined by the circuit and device parameters. The frequency therefore varies with temperature, process, and supply voltage. It also drifts with time due to low-frequency phase noise components. For these reasons, VCOs are controlled by a phase-locked loop such that their output frequency can track a precise reference frequency (typically derived from a crystal oscillator).

The high precision expected of the LO frequency should not come as a surprise. After all, narrow, tightly-spaced channels in wireless standards tolerate little error in transmit and receive carrier frequencies. For example, as shown in Fig. 10.2, a slight shift leads to significant spillage of a high-power interferer into a desired channel. We know from the wireless standards studied in Chapter 3 that the channel spacing can be as small as 30 kHz while the center frequency lies in the gigahertz range.

Figure 10.2 Effect of LO frequency error in TX.

Figure 10.3 shows the conceptual picture we have thus far formed of a synthesizer. The output frequency is generated as a multiple of a precise reference, f_REF, and this multiple is changed by the channel selection command so as to cover the carrier frequencies required by the standard.

Figure 10.3 Generic frequency synthesizer.

In addition to accuracy and channel spacing, several other aspects of synthesizers impact a transceiver’s performance: phase noise, sidebands, and “lock time.” We studied the effect of phase noise in Chapter 8 and deal with sidebands and lock time here. We know that, if the control voltage of a VCO is periodically disturbed, then the output spectrum contains sidebands symmetrically disposed around the carrier. This indeed occurs if a VCO is placed in a phase-locked loop and experiences the ripple produced by the PFD and CP nonidealities. We therefore wish to understand the effect of such sidebands (“spurs”). Illustrated in Fig. 10.4, the effect of sidebands is particularly troublesome in the receiver path. Suppose the synthesizer (the LO) output consists of a carrier at ω_LO and a sideband at ω_S, while the received signal is accompanied by an interferer at ω_int. Upon downconversion mixing, the desired channel is convolved with the carrier and the interferer with the sideband. If ω_int − ω_S = ω₀ − ω_LO (=ω_IF), then the downconverted interferer falls atop the desired channel. For example, if the interferer is 60 dB above the desired signal and the sideband is 70 dB below the carrier, then the corruption at the IF is 10 dB below the signal—barely an acceptable value in some standards.

Figure 10.4 Reciprocal mixing.

A receiver with an IIP₃ of −15 dBm senses a desired signal and two interferers as shown in Fig. 10.5. The LO also exhibits a sideband at ω_S, corrupting the downconversion. What relative LO sideband magnitude creates as much corruption as intermodulation does?

Figure 10.5 Intermodulation and reciprocal mixing in a receiver.

Solution:

To compute the level of the resulting intermodulation product that falls into the desired channel, we write the difference between the interferer level and the IM₃ level in dB as

(10.1)-(10.2)

(The IM₃ level is equal to −90 dBm.) Thus, if the sideband is 50 dB below the carrier, then the two mechanisms lead to equal corruptions.

When the digital channel selection command in Fig. 10.3 changes value, the synthesizer takes a finite time to settle to a new output frequency (Fig. 10.6). Called the “lock time” for synthesizers that employ PLLs, this settling time directly subtracts from the time available for communication. The following example elaborates on this point.

Figure 10.6 Frequency settling during synthesizer lock period.

During synthesizer settling, the power amplifier in a transmitter is turned off. Explain why.

Solution:

If the power amplifier remains on, then the LO frequency variations produce large fluctuations in the transmitted carrier during the settling time. Shown in Fig. 10.7, this effect can considerably corrupt other users’ channels.

Figure 10.7 Fluctuation of carrier frequency during synthesizer switching.

Lock times required in typical RF systems vary from a few tens of milliseconds to a few tens of microseconds. (Exceptional cases such as ultra-wideband systems stipulate a lock time of less than 10 ns.) But how is the lock time defined? Illustrated in Fig. 10.8, the lock time is typically specified as the time required for the output frequency to reach within a certain margin (e.g., 100 ppm) around its final value.

Figure 10.8 Definition of synthesizer lock time.

10.2 Basic Integer-N Synthesizer

Recall from Chapter 9 that a PLL employing a feedback divide ratio of N multiplies the input frequency by the same factor. Based on this concept, integer-N synthesizers produce an output frequency that is an integer multiple of the reference frequency (Fig. 10.9). If N increases by 1, then f_out increases by f_REF; i.e., the minimum channel spacing is equal to the reference frequency.

Figure 10.9 Integer-N synthesizer.

How does the synthesizer of Fig. 10.9 cover a desired frequency range, f₁ to f₂? The divide ratio must be programmable from, say, N₁ to N₂ so that N₁f_REF = f₁ and N₂f_REF = f₂. We therefore recognize two conditions for the choice of f_REF: it must be equal to the desired channel spacing and it must be the greatest common divisor of f₁ and f₂. The choice may be dominated by one of the two conditions; e.g., the minimum channel spacing may be smaller than the greatest common divisor of f₁ and f₂.

Compute the required reference frequency and range of divide ratios for an integer-N synthesizer designed for a Bluetooth receiver. Consider two cases: (a) direct conversion, (b) sliding-IF downconversion with f_LO = (2/3)f_RF (Chapter 4).

Solution:

(a) Shown in Fig. 10.10(a), the LO range extends from the center of the first channel, 2400.5 MHz, to that of the last, 2479.5 MHz. Thus, even though the channel spacing is 1 MHz, f_REF must be chosen equal to 500 kHz. Consequently, N₁ = 4801 and N₂ = 4959.

Figure 10.10 Bluetooth LO frequency range for (a) direct and (b) sliding-IF downconversion.

(b) As illustrated in Fig. 10.10(b), in this case the channel spacing and the center frequencies are multiplied by 2/3. Thus, f_REF = 1/3 MHz, N₁ = 4801, and N₂ = 4959.

The simplicity of the integer-N synthesizer makes it an attractive choice. Behaving as a standard PLL, this architecture lends itself to the analyses carried out in Chapter 9. In particular, the PFD/CP nonidealities and design techniques described in Chapter 9 directly apply to integer-N synthesizers.

10.3 Settling Behavior

Our study of PLL dynamics in Chapter 9 has dealt with frequency or phase changes at the input, a rare event in RF synthesizers. Instead, the transients of interest are those due to (1) a change in the feedback divide ratio, i.e., as the synthesizer hops from one channel to another, or (2) the startup switching, i.e., the synthesizer has been off to save power and is now turned on.

Let us consider the case of channel switching. Interestingly, a small change in N yields the same transient behavior as does a small change in the input frequency. This can be proved with the aid of the feedback system shown in Fig. 10.11, where the feedback factor A changes by a small amount at t = 0. The output after t = 0 is equal to

(10.3)-(10.5)

implying that the change is equivalent to multiplying X(s) by (1 − /A) while retaining the same transfer function. Since in the synthesizer environment, x(t) (the input frequency) is constant before t = 0, i.e., x(t) = f₀, we can view multiplication by (1 − /A) as a step function from f₀ to f₀(1 − /A),¹ i.e., a frequency jump of −(/A)f₀.

Figure 10.11 Effect of changing the feedback factor.

The foregoing analysis suggests that, when the divide ratio changes, the loop responds as if an input frequency step were applied, requiring a finite time to settle within an acceptable margin around its final value. As shown in Fig. 10.12, the worst case occurs when the synthesizer output frequency must go from the first channel, N₁f_REF, to the last, N₂ f_REF, or vice versa.

Figure 10.12 Worst-case synthesizer settling.

In synthesizer settling, the quantity of interest is the frequency error, Δω_out, with respect to the final value. Determine the transfer function from the input frequency to this error.

Solution:

The error is equal to ω_in[N − H(s)], where H(s) is the transfer function of a type-II PLL (Chapter 9). Thus,

(10.6)

In order to estimate the settling time, we combine the above result with the equations derived in Chapter 9, assuming N₂ − N₁ N₁. If the divide ratio jumps from N₁ to N₂, this change is equivalent to an input frequency step of Δω_in = (N₂ − N₁)ω_REF/N₁.

We must also note that (a) the PLL settling equations are multiplied by the divide ratio, N₁ (≈ N₂) in an integer-N synthesizer, and (b) to obtain the settling time, the settling equations must be normalized to the final frequency, N_2ωREF. For the normalized error to fall below a certain amount, α, we have

(10.7)

where

(10.8)-(10.10)

For example, if , Eqs. (10.7) and (10.8) yield

(10.11)

where t_s denotes the settling time. A sufficient condition for the settling is that the exponential envelope decay to small values:

(10.12)

where the factor of represents the resultant of the cosine and the sine in Eq. (10.11). We therefore obtain the settling time for a normalized error of α as

(10.13)

A 900-MHz GSM synthesizer operates with f_REF = 200 kHz and provides 128 channels. If , determine the settling time required for a frequency error of 10 ppm.

Solution:

The divide ratio is approximately equal to 4500 and varies by 128, i.e., N₁ ≈ 4500 and N₂ − N₁ = 128. Equation (10.13) thus gives

(10.14)

(10.15)

While this relation has been derived for , it provides a reasonable approximation for other values of ζ up to about unity.

How is the value of ζω_n chosen? From Chapter 9, we note that the loop time constant is roughly equal to one-tenth of the input period. It follows that (ζω_n)⁻¹ ≈ 10T_REF and hence

(10.16)

In practice, the settling time is longer and a rule of thumb for the settling of PLLs is 100 times the reference period.

As observed in Chapter 9 and in this section, the loop bandwidth trades with a number of critical parameters, including the settling time and the suppression of the VCO phase noise. Another important trade-off is that between the loop bandwidth and the magnitude of the reference sidebands. In the locked condition, the charge pump inevitably disturbs the control voltage at every phase comparison instant, modulating the VCO. In order to reduce this disturbance, the second capacitor (C₂ in Fig. 10.9) must be increased, and so must the main capacitor, C₁, because C₂ ≤ 0.2C₁.

The principal drawback of the integer-N architecture is that the output channel spacing is equal to the input reference frequency. We recognize that both the lock time (≈ 100 input cycles) and the loop bandwidth (≈ 1/10 of the input frequency) are tightly related to the channel spacing. Consequently, synthesizers designed for narrow-channel applications suffer from a long lock time and only slightly reduce the VCO phase noise.

10.4 Spur Reduction Techniques

The trade-off between the loop bandwidth and the level of reference spurs has motivated extensive work on methods of spur reduction without sacrificing the bandwidth. Indeed, the techniques described in Chapter 9 alleviating issues such as charge sharing, channel-length modulation, and Up and Down current mismatch fall in this category. In this section, we study additional approaches that lower the ripple on the control voltage.

A student reasons that if the transistor widths and drain currents in a charge pump are scaled down, so is the ripple. Is that true?

Solution:

This is true because the ripple is proportional to the absolute value of the unwanted charge pump injections rather than their relative value. This reasoning, however, can lead to the wrong conclusion that scaling the CP down reduces the output sideband level. Since a reduction in I_P must be compensated by a proportional increase in K_VCO so as to maintain ζ constant, the sideband level is almost unchanged.

A key point in devising spur reduction techniques is that the disturbance of the control voltage occurs primarily at the phase comparison instant. In other words, V_cont is disturbed for a short duration and remains relatively constant for the rest of the input period. We therefore surmise that the output sidebands can be lowered if V_cont is isolated from the disturbance for that duration. For example, consider the arrangement shown in Fig. 10.13(a), where S₁ turns off just before phase comparison begins and turns on slightly after the disturbance is finished. As a result, C₂ senses only the stable value at X and holds this value when S₁ is off. (The charge injection and clock feedthrough of S₁ still slightly disturb V_cont.)

Figure 10.13 Masking the ripple at node X by insertion of a switch, (a) with the second capacitor tied to V_cont, (b) with the main RC section tied to V_cont.

Unfortunately, the arrangement of Fig. 10.13(a) leads to an unstable PLL. To understand this point, we recognize that the isolation of V_cont from the disturbance also eliminates the role of R₁. Since we keep S₁ off until the disturbance is finished, the voltage sensed by C₂ when S₁ turns on is independent of the value of R₁. That is, the circuit’s behavior does not change if R₁ = 0. (As explained in Chapter 9, to create a zero, R₁ must produce a slight jump on the control voltage each time a finite, random phase error is detected.)

Let us now swap the two sections of the loop filter as shown in Fig. 10.13(b), where S₁ is still switched according to the waveforms in Fig. 10.13(a). Can this topology yield a stable PLL? Yes, it can. Upon experiencing a jump due to a finite phase error, node X delivers this jump to V_cont after S₁ turns on. Thus, the role of R₁ is maintained. On the other hand, the short-duration disturbance due to PFD and CP nonidealities is “masked” by S₁, thereby leading to lower sidebands at the output [1, 2]. In practice, about half of C₂ is tied to V_cont so as to suppress the charge injection and clock feedthrough of S₁ [1, 2]. This “sampling loop filter” employs complementary transistors for S₁ to accommodate a rail-to-rail control voltage.

In order to arrive at another method of spur reduction, let us return to the open-loop transfer function of a type-II second-order PLL,

(10.17)

and recall from Chapter 9 that R₁ is added in series with C₁ so as to create a zero in H_open(s). We may then ask, is it possible to add a constant to K_VCO/s rather than to 1/(C₁s)? That is, can we realize

(10.18)

so as to obtain a zero? The loop now contains a zero at K_VCO/K₁, whose magnitude can be chosen to yield a reasonable damping factor.

Before computing the damping factor, we ponder the meaning of K₁ in Eq. (10.18). Since K₁ simply adds to the output phase of the VCO, we may surmise that it denotes a constant delay after the VCO. But the transfer function of such a delay [of the form exp(−K₁s)] would be multiplied by K_VCO/s. To avoid this confusion, we construct a block diagram representing Eq. (10.18) [Fig. 10.14(a)], recognizing that K_VCO/s + K₁ is, in fact, the transfer function from V_cont to φ₁, i.e.,

(10.19)

Figure 10.14 (a) Stabilization of PLL by adding K₁ to the transfer function of VCO, (b) realization using a variable-delay stage.

That is, K₁ denotes a block that is controlled by V_cont. Indeed, K₁ represents a variable-delay stage [3] having a “gain” of K₁:

(10.20)

where T_d is the delay of the stage [Fig. 10.14(b)].

The key advantage of the topology shown in Fig. 10.14(b) over a standard type-II PLL is that, by avoiding the resistor in series with C₁, it allows this capacitor to absorb the PFD/CP nonidealities. By contrast, in the standard PLL, only the smaller capacitor plays this role.

In order to determine the damping factor, we use Eq. (10.18) to write the closed-loop transfer function:

(10.21)-(10.22)

It follows that

(10.23)

(10.24)

In Problem 10.1, we prove that, with a feedback divider, these parameters are revised as

(10.25)

(10.26)

Equation (10.25) implies that K₁ must be scaled in proportion to N so as to maintain a reasonable value for ζ, a difficult task because delay stages that accommodate high frequencies inevitably exhibit a short delay. The architecture is therefore modified to that shown in Fig. 10.15, where the variable delay line appears after the divider [3]. The reader can show that

(10.27)

(10.28)

A retiming flipflop can be inserted between the delay line and the PFD to remove the phase noise of the former (Section 10.6.7).

Figure 10.15 Stabilization of an integer-N synthesizer.

10.5 PLL-Based Modulation

In addition to the modulator and transmitter architectures introduced in Chapter 4, a number of other topologies can be realized that merge the modulation and frequency synthesis functions. We study two in this section and a few more in Chapter 12.

10.5.1 In-Loop Modulation

In addition to frequency synthesis, PLLs can also perform modulation. Recall from Chapter 3 that FSK and GMSK modulation can be realized by means of a VCO that senses the binary data. Figure 10.16(a) depicts a general case where the filter smoothes the time-domain transitions to some extent, thereby reducing the required bandwidth.² The principal issue here is the poor definition of the carrier frequency: the VCO center frequency drifts with time and temperature with no bound. One remedy is to phase-lock the VCO periodically to a reference so as to reset its center frequency. Illustrated in Fig. 10.16(b), such a system first disables the baseband data path and enables the PLL, allowing f_out to settle to Nf_REF. Next, the PLL is disabled and x_BB(t) is applied to the VCO.

Figure 10.16 (a) Open-loop modulation and (b) in-loop modulation of VCO.

The arrangement of Fig. 10.16(b) requires periodic “idle” times during the communication to phase-lock the VCO, a serious drawback. Also, the output signal bandwidth depends on K_VCO, a poorly-controlled parameter. Moreover, the free-running VCO frequency may shift from Nf_REF due to a change in its load capacitance or supply voltage. Specifically, as depicted in Fig. 10.17, if the power amplifier is ramped at the beginning of transmission, its input impedance Z_PA changes considerably, thus altering the capacitance seen at the input of the buffer (“load pulling”). Also, upon turning on, the PA draws a very high current from the system supply, reducing its voltage by tens or perhaps hundreds of millivolts and changing the VCO frequency (“supply pushing”).

Figure 10.17 Variation of buffer input impedance during PA ramp-up.

To alleviate the foregoing issues, the VCO can remain locked while sensing the baseband data. That is, the PLL in Fig. 10.16(b) continuously monitors and corrects the VCO output (i.e., the CP is always enabled). Of course, to impress the data upon the carrier successfully, the design must select a very slow loop so that the desired phase modulation at the output is not corrected by the PLL. Called “in-loop modulation,” this approach offers two advantages over the quadrature upconversion techniques studied in Chapter 4. First, in contrast to a quadrature GMSK modulator, it requires much less processing of the baseband data. Second, it obviates the need for the quadrature phases of the LO. Of course, this method can be applied only to constant-envelope modulation schemes.

The effect of the PLL in Fig. 10.16(b) on the data can also be studied in the frequency domain. Neglecting the effect of the filter in the data path, determine the transfer function from x_BB(t) to φ_out.

Solution:

Beginning from the output, we write the feedback signal arriving at the PFD as φ_out/N, subtract it from 0 (the input phase), and multiply the result by I_P/(2π)[R₁ + (C₁s)⁻¹], obtaining the signal at node A. We then add X_BB to this signal³ and multiply the sum by K_VCO/s:

(10.29)

It follows that

(10.30)

This response is simply equal to the VCO phase noise transfer function (Chapter 9) multiplied by K_VCO/s. (Why is this result expected?) For low values of s, the system exhibits a high-pass response, attenuating the low-frequency contents of X_BB. As s becomes large enough that the denominator can be approximated by s², the response approaches the desired shape, K_VCO/s (that of a frequency modulator). Figure 10.18 plots this behavior. The reader can prove that the response reaches a peak equal to K_VCO/(2ζω_n) at ω = ω_n. For the baseband data to experience negligible high-pass filtering, ω_n must be well below the lowest frequency content of the data. As a rule of thumb, we say ω_n should be around 1/1000 of the bit rate.

Figure 10.18 In-loop modulation frequency response.

In-loop modulation entails two drawbacks: (1) due to the very small PLL bandwidth, the VCO phase noise remains mostly uncorrected, and (2) the modulated signal bandwidth is a function of K_VCO, a process- and temperature-dependent parameter.

10.5.2 Modulation by Offset PLLs

A stringent requirement imposed by GSM has led to a transmitter architecture that employs a PLL with “offset mixing.” The requirement relates to the maximum noise that a GSM transmitter is allowed to emit in the GSM receive band, namely, −129 dBm/Hz. Illustrated in Fig. 10.19 is a situation where the TX noise becomes critical: user B receives a weak signal around f₁ from user A while user C, located in the close proximity of user B, transmits a high power around f₂ and significant broadband noise. As shown here, the noise transmitted by user C corrupts the desired signal around f₁.

Figure 10.19 Problem of transmitted noise in receiver band.

The problem of broadband noise is particularly pronounced in direct-conversion transmitters. As depicted in Fig. 10.20, each stage in the signal path contributes noise, producing high output noise in the RX band even if the baseband LPF suppresses the out-of-channel DAC output noise. In Problem 10.2, we observe that the far-out phase noise of the LO also manifests itself as broadband noise at the PA output.

Figure 10.20 Noise amplification in a direct-conversion TX along with typical values.

If the signal level is around 632 mV_pp (= 0 dBm in a 50-Ω system) at node X in Fig. 10.20, determine the maximum tolerable noise floor at this point. Assume the following stages are noiseless.

Solution:

The noise floor must be 30 dB lower than that at the PA output, i.e., −159 dBm/Hz in a 50-Ω system . Such a low level dictates very small load resistors for the upconversion mixers. In other words, it is simply impractical to maintain a sufficiently low noise floor at each point along the TX chain.

In order to reduce the TX noise in the RX band, a duplexer filter can be interposed between the antenna and the transceiver. (Recall from Chapter 4 that a duplexer is otherwise unnecessary in GSM because the TX and RX do not operate simultaneously.) However, the duplexer loss (2–3 dB) lowers the transmitted power and raises the receiver noise figure.

Alternatively, the upconversion chain can be modified so as to produce a small amount of broadband noise. For example, consider the topology shown in Fig. 10.21(a), where the baseband signal is upconverted and applied to a PLL. If the PLL bandwidth is only large enough to accommodate the signal, then x_out(t) ≈ x₁(t), but the broadband noise traveling to the antenna arises primarily from the far-out phase noise of the VCO. That is, unlike the TX chain in Fig. 10.20, this architecture need only minimize the broadband noise of one building block. Note that x₁(t) has a constant envelope.

Figure 10.21 (a) Noise filtration by means of a PLL, (b) use of ÷N in feedback, (c) offset-PLL architecture.

The above approach dictates that the PFD and CP operate at the carrier frequency, a relatively difficult requirement. We therefore add a feedback divider to the PLL to proportionally reduce the carrier frequency of x₁(t) [Fig. 10.21(b)]. If x₁(t) = A₁ cos[ω₁t + φ(t)], where φ(t) denotes GMSK or other types of frequency or phase modulation, and if the PLL bandwidth is large enough, then

(10.31)

Unfortunately, the PLL multiplies the phase by a factor of N, altering the signal bandwidth and modulation.

Let us now modify the architecture as shown in Fig. 10.21(c) [4]. Here, an “offset mixer,” MX₁, downconverts the output to a center frequency of f_REF, and the result is separated into quadrature phases, mixed with the baseband signals, and applied to the PFD.⁴ With the loop locked, x₁(t) must become a faithful replica of the reference input, thus containing no modulation. Consequently, y_I(t) and y_Q(t) “absorb” the modulation information of the baseband signal. This architecture is called an “offset-PLL” transmitter or a “translational” loop.

If x_I(t) = A cos[φ(t)] and x_Q(t) = A sin[φ(t)], derive expressions for y_I(t) and y_Q(t).

Solution:

Centered around f_REF, y_I and y_Q can be respectively expressed as

(10.32)

(10.33)

where ω_REF = 2πf_REF and φ_y(t) denotes the phase modulation information. Carrying the quadrature upconversion operation and equating the result to an unmodulated tone, x₁(t) = A cos ω_REFt, we have

(10.34)

It follows that

(10.35)

and hence

(10.36)

Note that x_out(t) also contains the same phase information.

The local oscillator waveform driving the offset mixer in Fig. 10.21(c) must, of course, be generated by another PLL according to the synthesis methods and concepts studied thus far in this chapter. However, the presence of two VCOs on the same chip raises concern with respect to mutual injection pulling between them. To ensure a sufficient difference between their frequencies, the offset frequency, f_REF, must be chosen high enough (e.g., 20% of f_LO). Additionally, two other reasons call for a large offset: (1) the stages following MX₁ must not degrade the phase margin of the overall loop, and (2) the center frequency of the 90° phase shift must be much greater than the signal bandwidth to allow accurate quadrature separation. Another variant of offset PLLs returns the output of the mixer directly to the PFD [5].

In the architecture of Fig. 10.21(c), the PA output spectrum is centered around the VCO center frequency. Is the VCO injection-pulled by the PA?

Solution:

To the first order, it is not. This is because, unlike TX architectures studied in Chapter 4, this arrangement impresses the same modulated waveform on the VCO and the PA (Fig. 10.22). In other words, the instantaneous output voltage of the PA is simply an amplified replica of that of the VCO. Thus, the leakage from the PA arrives in-phase with the VCO waveform—as if a fraction of the VCO output were fed back to the VCO. In practice, the delay through the PA introduces some phase shift, but the overall effect on the VCO is typically negligible.

Figure 10.22 Coupling of PA output to VCO in an offset-PLL TX.

10.6 Divider Design

The feedback divider used in integer-N synthesizers presents interesting design challenges: (1) the divider modulus, N, must change in unity steps, (2) the first stage of the divider must operate as fast as the VCO, (3) the divider input capacitance and required input swing must be commensurate with the VCO drive capability, (4) the divider must consume low power, preferably less than the VCO. In this section, we describe divider designs that meet these requirements.

It is important to note that divider design typically assumes the VCO to have certain voltage swings and output drive capability and, as such, must be carried out in conjunction with the VCO design. Shown in Fig. 10.23 is an example where the VCO runs at twice the carrier frequency to avoid injection-pulling and is followed by a ÷2 stage. This divider may need to drive a considerable load capacitance, C_L, making it necessary to use wide transistors therein and hence present a large capacitance, C_div, to the VCO. A buffer can be inserted at the input and/or output of the divider but at the cost of greater power dissipation.

Figure 10.23 Load seen by divider in a transceiver.

10.6.1 Pulse Swallow Divider

A common realization of the feedback divider that allows unity steps in the modulus is called the “pulse swallow divider.” Shown in Fig. 10.24, the circuit consists of three blocks:

1. A “dual-modulus prescaler”; this counter provides a divide ratio of N + 1 or N according to the logical state of its “modulus control” input.

2. A “swallow counter”; this circuit divides its input frequency by a factor of S, which can be set to a value of 1 or higher in unity steps by means of the digital input.⁵ This counter controls the modulus of the prescaler and also has a reset input.

3. A “program counter”; this divider has a constant modulus, P. When the program counter “fills up” (after it counts P pulses at its input), it resets the swallow counter.

Figure 10.24 Pulse swallow divider.

We now show that the overall pulse swallow divider of Fig. 10.24(a) provides a divide ratio of NP + S. Suppose all three dividers begin from reset. The prescaler counts by N + 1, giving one pulse to the swallow counter (at point A) for every N + 1 pulses at the main input. The program counter counts the output pulses of the prescaler (point B). This continues until the swallow counter fills up, i.e., it receives S pulses at its input. [The main input therefore receives (N + 1)S pulses.] The swallow counter then changes the modulus of the prescaler to N and begins from zero again. Note that the program counter has thus far counted S pulses, requiring another P − S pulses to fill up. Now, the prescaler divides by N, producing P − S pulses so as to fill up the program counter. In this mode, the main input must receive N(P − S) pulses. Adding the total number of the pulses at the prescaler input in the two modes, we have (N + 1)S + N(P − S) = NP + S. That is, for every NP + S pulses at the main input, the program counter generates one pulse at the output. The operation repeats after the swallow counter is reset. Note that P must be greater than S.

Sensing the high-frequency input, the prescaler proves the most challenging of the three building blocks. For this reason, numerous prescaler topologies have been introduced. We study some in the next section. As a rule of thumb, dual-modulus prescalers are about a factor of two slower than ÷2 circuits.

In order to relax the speed required of the dual-modulus prescaler, the pulse swallow divider can be preceded by a ÷2 [Fig. 10.25(a)]. Explain the pros and cons of this approach.

Figure 10.25 (a) Use of ÷2 stage to relax the speed required of pulse swallow divider, (b) effect on output phase noise, (c) location of reference spurs with and without the ÷2 stage.

Solution:

Here, f_out = 2(NP + S)f_REF. Thus, a channel spacing of f_ch dictates f_REF = f_ch/2. The lock speed and the loop bandwidth are therefore scaled down by a factor of two, making the VCO phase noise more pronounced [Fig. 10.25(b)]. One advantage of this approach is that the reference sideband lies at the edge of the adjacent channel rather than in the middle of it [Fig. 10.25(c)]. Mixed with little spurious energy, the sidebands can be quite larger than those in the standard architecture.

The swallow counter is typically designed as an asynchronous circuit for the sake of simplicity and power savings. Figure 10.26 shows a possible implementation, where cascaded ÷2 stages count the input and the NAND gates compare the count with the digital input, D_nD_n−1...D₁. Once the count reaches the digital input, Y goes high, setting the RS latch. The latch output then disables the ÷2 stages. The circuit remains in this state until the main reset is asserted (by the program counter).

Figure 10.26 Swallow counter realization.

An alternative approach to realizing the feedback divider in a synthesizer is described in [7]. This method incorporates ÷2/3 stages in a modular form so as to reduce the design complexity. Shown in Fig. 10.27, the divider employs n ÷2/3 blocks, each receiving a modulus control from the next stage (except for the last stage). The digital inputs set the

Figure 10.27 Modular divider realizing multiple divide ratios.

overall divide ratio according to

(10.37)

10.6.2 Dual-Modulus Dividers

As mentioned above, dual-modulus prescalers pose the most difficult challenge in divider design. We also note from our analysis of the pulse swallow divider in Section 10.6.1 that the modulus change must be instantaneous, an obvious condition but not necessarily met in all dual-modulus designs. As explained in the following sections, circuits such as the Miller divider and injection-locked dividers take a number of input cycles to reach the steady state.

Let us begin our study of dual-modulus prescalers with a divide-by-2/3 circuit. Recall from Chapter 4 that a ÷2 circuit can be realized as a D-flipflop placed in a negative feedback loop. A ÷3 circuit, on the other hand, requires two flipflops. Shown in Fig. 10.28 is an example,⁶ where an AND gate applies to the D input of FF₂. Suppose the circuit begins with . After the first clock, Q₁ assumes the value of (ZERO), and the value of (ONE). In the next three cycles, goes to 10, 11, and 01. Note that the state does not occur again because it would require the previous values of and X to be ZERO and ONE, respectively, a condition prohibited by the AND gate.

Figure 10.28 Divide-by-3 circuit.

Design a ÷3 circuit using a NOR gate rather than an AND gate.

Solution:

We begin with the topology of Fig. 10.28, sense the Q output of FF₂, and add “bubbles” to compensate for the logical inversion [Fig. 10.29(a)]. The inversion at the input of FF₁ can now be moved to its output and hence realized as a bubble at the corresponding input of the AND gate [Fig. 10.29(b)]. Finally, the AND gate with two bubbles at its input can be replaced with a NOR gate [Fig. 10.29(c)]. The reader can prove that this circuit cycles through the following three states: Q₁Q₂ = 00, 01, 10.

Figure 10.29 Implementation of ÷3 circuit using a NOR gate: (a) use of Q₂ with a bubble at the NAND input and FF₁ input, (b) bubble moved from input of FF₁ to its output, (c) final realization.

Analyze the speed limitations of the ÷3 stage shown in Fig. 10.28.

Solution:

We draw the circuit as in Fig. 10.30(a), explicitly showing the two latches within FF₂. Suppose CK is initially low, L₁ is opaque (in the latch mode), and L₂ is transparent (in the sense mode). In other words, has just changed. When CK goes high and L₁ begins to sense, the value of must propagate through G₁ and L₁ before CK can fall again. Thus, the delay of G₁ enters the critical path. Moreover, L₂ must drive the input capacitance of FF₁, G₁, and an output buffer. These effects degrade the speed considerably, requiring that CK remain high long enough for to propagate to Y.

Figure 10.30 Timing and critical path in ÷3 circuit.

The circuit of Fig. 10.28 can now be modified so as to have two moduli. Illustrated in Fig. 10.31, the ÷2/3 circuit employs an OR gate to permit ÷3 operation if the modulus control, MC, is low (why?) or ÷2 operation if it is high. In the latter case, only FF₂ divides the clock by 2 while FF₁ plays no role. Thus, the output can be provided by only FF₂.

Figure 10.31 Divide-by-2/3 circuit.

A student seeking a low-power prescaler design surmises that FF₁ in the circuit of Fig. 10.29 can be turned off when MC goes high. Explain whether this is a good idea.

Solution:

While saving power, turning off FF₁ may prohibit instantaneous modulus change because when FF₁ turns on, its initial state is undefined, possibly requiring an additional clock cycle to reach the desired value. For example, the overall circuit may begin with .

It is possible to rearrange the ÷2/3 stage so as to reduce the loading on the second flipflop. Illustrated in Fig. 10.32 [6], the circuit precedes each flipflop with a NOR gate. If MC is low, then is simply inverted by G₂—as if FF₂ directly followed FF₁. The circuit thus reduces to the ÷3 stage depicted in Fig. 10.29(c). If MC is high, Q₂ remains low, allowing G₁ and FF₁ to divide by two. Note that the output can be provided by only FF₁. This circuit has a 40% speed advantage over that in Fig. 10.31 [6].

Figure 10.32 Divide-by-2/3 circuit with higher speed.

Figure 10.33 shows a ÷3/4 stage. If MC = ONE, G₂ produces a ONE, allowing G₁ to simply pass the output of FF₁ to the D input of FF₂. The circuit thus resembles four latches in a loop and hence divides by 4. If MC = 0, G₂ passes to the input of G₁, reducing the circuit to that in Fig. 10.28. We observe that the critical path (around FF₂) contains a greater delay in this circuit than in the ÷3 stage of Fig. 10.28. A transistor-level design of such a divider is presented in Chapter 13.

Figure 10.33 Divide-by-3/4 circuit.

The dual-modulus dividers studied thus far employ synchronous operation, i.e., the flipflops are clocked simultaneously. For higher moduli, a synchronous core having small moduli is combined with asynchronous divider stages. Figure 10.34 shows a ÷8/9 prescaler as an example. The ÷2/3 circuit (D23) of Fig. 10.31 is followed by two asynchronous ÷2 stages, and MC₁ is defined as the NAND of their outputs with the main modulus control, MC₂. If MC₂ is low, MC₁ is high, allowing D23 to divide by 2. The overall circuit thus operates as a ÷8 circuit. If MC₂ is high and the ÷2 stages begin from a reset state, then MC₁ is also high and D23 divides by 2. This continues until both A and B are high, at which point MC₁ falls, forcing D23 to divide by 3 for one clock cycle before A and B return to zero. The circuit therefore divides by 9 in this mode.

Figure 10.34 Divide-by-8/9 circuit.

Design a ÷15/16 circuit using the synchronous ÷3/4 stage of Fig. 10.33.

Solution:

Since the ÷3/4 stage (D34) divides by 4 when MC is high, we surmise that only two more ÷2 circuits must follow to provide ÷16. To create ÷15, we must force D34 to divide by 3 for one clock cycle. Shown in Fig. 10.35, the circuit senses the outputs of the asynchronous ÷2 stages by an OR gate and lowers MF when AB = 00. Thus, if MC is high, the circuit divides by 16. If MC is low and the ÷2 stages begin from 11, MF remains high and D34 divides by 4 until AB = 00. At this point, MF falls and D34 divides by 3 for one clock cycle before A goes high.

Figure 10.35 Divide-by-15/16 circuit.

An important issue in employing both synchronous and asynchronous sections in Fig. 10.35 is potential race conditions when the circuit divides by 15. To understand the problem, first suppose FF₃ and FF₄ change their output state on the rising edge of their clock inputs. If MC is low, the circuit continues to divide by 16, i.e., goes through the cycle: 01, 11, 10, 00 until both and are low. As depicted in Fig. 10.36(a), then skips the state 00 after the state 10. Since from the time goes low until the time skips one state, three CK_in cycles have passed, the propagation delay through FF₃ and G₃ need not be less than a cycle of CK_in.

Figure 10.36 Delay budget in the ÷15/16 circuit with FF₃ and FF₄ activated on (a) rising edge, and (b) falling edge of clock.

Now consider a case where FF₃ and FF₄ change their output state on the falling edge of their clock inputs. Then, as shown in Fig. 10.36(b), immediately after has fallen to 00, the ÷3/4 circuit must skip the state 00, mandating that the delay through FF₃, FF₄, and G₃ be less than half of a CK_in cycle. This is in general difficult to achieve, complicating the design and demanding higher power dissipation. Thus, the first choice is preferable.

10.6.3 Choice of Prescaler Modulus

The pulse swallow divider of Fig. 10.24 provides a divide ratio of NP + S, allowing some flexibility in the choice of these three parameters. For example, to cover the Bluetooth channels from 2400 MHz to 2480 MHz, we can choose N = 4, P = 575, and S = 100, ..., 180, or N = 10, P = 235, and S = 50, ..., 130. (Recall that P must remain greater than S.) What trade-offs do we face in these choices? One aspect of the design calls for using a large N, and another for a small N.

Returning to the race condition studied in the ÷15/16 circuit of Fig. 10.35, we make the following observation. With the proper choice of the clock edge, D34 begins ÷4 operation as changes and continues for two more input cycles before it goes into the ÷3 mode. More generally, for a synchronous ÷(N + 1)/N circuit followed by asynchronous stages, proper choice of the clock edge allows the circuit to divide by N + 1 for N − 1 input cycles before its modulus is changed to N. This principle applies to the pulse swallow divider as well, requiring a large N so as to permit a long delay through the asynchronous stages and the feedback loop.

Consider the detailed view of a pulse swallow divider, shown in Fig. 10.37. Identify the critical feedback path through the swallow counter.

Figure 10.37 Critical path in a pulse swallow divider.

Solution:

When the ÷9 operation of the prescaler begins, the circuit has at most seven input cycles to change its modulus to 8. Thus, the last pulse generated by the prescaler in the previous ÷8 mode (just before the ÷9 mode begins) must propagate through the first ÷2 stage in the swallow counter, the subsequent logic, and the RS latch in fewer than seven input cycles.

The above perspective encourages a large N for the prescaler. On the other hand, a larger prescaler modulus leads to a higher power dissipation if the stages within the prescaler incorporate current steering to operate at high speeds (Section 10.6.4). For this reason, the prescaler modulus is determined by careful simulations. It is also possible to pipeline the output of the RS latch in Fig. 10.37, thus allowing a smaller N but additional cycles for the modulus change [6].

10.6.4 Divider Logic Styles

The divider blocks in the feedback loop of a synthesizer can be realized by means of various logic styles. The choice of a divider topology is governed by several factors: the input swing (e.g., that available from the VCO), the input capacitance (e.g., that presented to the VCO), the maximum speed, the output swing (as required by the subsequent stage), the minimum speed (i.e., dynamic logic versus static logic), and the power dissipation. In this section, we study divider design in the context of different logic families.

Current-Steering Circuits

Affording the fastest circuits, current-steering logic, also known as “current-mode logic” (CML), operates with moderate input and output swings. CML circuits provide differential outputs and hence a natural inversion; e.g., a single stage serves as both a NAND gate and an AND gate. CML derives its speed from the property that a differential pair can be rapidly enabled and disabled through its tail current source.

Figure 10.38(a) shows a CML AND/NAND gate. The top differential pair senses the differential inputs, A and , and is controlled by M₃ and hence by B and . If B is high, M₁ and M₂ remain on, , and Y = A. If B is low, M₁ and M₂ are off, X is at V_DD, and Y is pulled down by M₄ to V_DD − R_DI_SS. From another perspective, we note from Fig. 10.38(b) that M₁ and M₃ resemble a NAND branch, and M₂ and M₄ a NOR branch. The circuit is typically designed for a single-ended output swing of R_DI_SS = 300 mV, and the transistors are sized such that they experience complete switching with such input swings.

Figure 10.38 (a) CML NAND realization, (b) NAND and NOR branches in the circuit.

For the differential pairs in Fig. 10.38(a) to switch with moderate input swings, the transistors must not enter the triode region. For example, if M₃ is in the triode region when it is on, then the swings at B and must be quite larger than 300 mV so as to turn M₃ off and M₄ on. Thus, the common-mode level of B and must be below that of A and by at least one overdrive voltage, making the design of the preceding stages difficult. Figure 10.39 depicts an example, where the NAND gate is preceded by two representative CML stages. Here, A and swing between V_DD and V_DD − R₁I_SS₁. On the other hand, by virtue of the level-shift resistor R_T, B and vary between V_DD − R_TI_SS₂ and V_DD − R_TI_SS₂ − R₂I_SS₂. That is, R_T shifts the CM level of B and by R_TI_SS₂. The addition of R_T appears simple, but now the high level of F and is constrained if M₅ and M₆ must not enter the triode region. That is, this high level must not exceed V_DD − R_TI_SS₂ − R₂I_SS₂ + V_TH.

Figure 10.39 Problem of common-mode compatibility at NAND inputs.

The stacking of differential pairs in the NAND gate of Fig. 10.38(a) does not lend itself to low supply voltages. The CML NOR/OR gate, on the other hand, avoids stacking. Shown in Fig. 10.40(a), the circuit steers the tail current to the left if A or B is high, producing a low level at X and a high level at Y. Unfortunately, however, this stage operates only with single-ended inputs, demanding great attention to the CM level of A and B and the choice of V_b. Specifically, V_b must be generated such that it tracks the CM level of A and B. As illustrated in Fig. 10.40(b), V_b is established by a branch replicating the circuitry that produces A. The CM level of A is equal to V_DD − R₂I_SS₂/2, and so is the value of V_b. For very high speeds, a capacitor may be tied from V_b to V_DD, thereby maintaining a solid ac ground at the gate of M₃.

Figure 10.40 (a) CML NOR gate, (b) proper generation of bias voltage V_b.

At low supply voltages, we design the logic in the dividers to incorporate the NOR gate of Fig. 10.40(a) rather than the NAND circuit of Fig. 10.38(a). The ÷2/3 circuit of Fig. 10.32 exemplifies this principle. To ensure complete switching of M₁-M₃ in the NOR stage, the input swings must be somewhat larger than our rule of thumb of 300 mV, or the transistors must be wider.

Should M₁-M₃ in Fig. 10.40(a) have equal widths?

Solution:

One may postulate that, if both M₁ and M₂ are on, they operate as a single transistor and absorb all of I_SS1, i.e., W₁ and W₂ need not exceed W₃/2. However, the worst case occurs if only M₁ or M₂ is on. Thus, for either transistor to “overcome” M₃, we require that W₁ = W₂ ≥ W₃.

Another commonly-used gate is the XOR circuit, shown in Fig. 10.41. The topology is identical to the Gilbert cell mixer studied in Chapter 6, except that both input ports are driven by large swings to ensure complete switching. As with the CML NAND gate, this circuit requires proper CM level shift for B and and does not easily operate with low supply voltages.

Figure 10.41 CML XOR implementation.

Figure 10.42 depicts an XOR gate that avoids stacking [8]. If A or B is high, M₃ turns off; that is, . Similarly, . The summation of I_D₃ and I_D₆ at node X is equivalent to an OR operation, and the flow of the sum through R_D produces an inversion.

Figure 10.42 Symmetric, low-voltage XOR.

Thus,

(10.38)-(10.39)

In contrast to the XOR gate of Fig. 10.41, this circuit exhibits perfect symmetry with respect to A and B, an attribute that proves useful in some applications.

While lending itself to low supply voltages, the XOR topology of Fig. 10.42 senses each of the inputs in single-ended form, facing issues similar to those of the NOR gate of Fig. 10.40(a). In other words, V_b must be defined carefully and the input voltage swings and/or transistor widths must be larger than those required for the XOR of Fig. 10.41. Also, to provide differential outputs, the circuit must be duplicated with A and (or B and ) swapped.

The speed advantage of CML circuits is especially pronounced in latches. Figure 10.43(a) shows a CML D latch. The circuit consists of an input differential pair, M₁-M₂, a latch or “regenerative” pair, M₃-M₄, and a clocked pair, M₅-M₆. In the “sense mode,” CK is high, and M₅ is on, allowing M₁-M₂ to sense and amplify the difference between D and . That is, X and Y track the input. In the transition to the “latch mode” (or “regeneration mode”), CK goes down, turning M₁-M₂ off, and goes up, turning M₃-M₄ on. The circuit now reduces to that in Fig. 10.43(b), where the positive feedback around M₃ and M₄ regeneratively amplifies the difference between V_X and V_Y. If the loop gain exceeds unity, the regeneration continues until one transistor turns off, e.g., V_X rises to V_DD and V_Y falls to V_DD − R_DI_SS. This state is retained until CK changes and the next sense mode begins.

Figure 10.43 (a) CML latch, (b) circuit in regeneration mode, (c) circuit’s waveforms.

In order to understand the speed attributes of the latch, let us examine its voltage waveforms as the circuit goes from the sense mode to the latch mode. As shown in Fig. 10.43(c), D and cross at t = t₁, and V_X and V_Y at t = t₂. Even though V_X and V_Y have not reached their full swings at t = t₃, the circuit can enter the latch mode because the regenerative pair continues the amplification after t = t₃. Of course, the latch mode must be long enough for V_X and V_Y to approach their final values. We thus conclude that the latch operates properly even with a limited bandwidth at X and Y if (a) in the sense mode, V_X and V_Y begin from their full levels and cross, and (b) in the latch mode, the initial difference between V_X and V_Y can be amplified to a final value of I_SSR_D.

Formulate the regenerative amplification of the circuit in Fig. 10.43(b) if V_X − V_Y begins with an initial value of V_XY0.

Solution:

If V_XY₀ is small, M₃ and M₄ are near equilibrium and the small-signal equivalent circuit can be constructed as shown in Fig. 10.44(a). Here, C_D represents the total capacitance seen at X and Y to ground, including C_GD1 + C_DB1 + C_GS3 + C_DB3 + 4C_GD3 and the input capacitance of the next stage. The gate-drain capacitance is multiplied by a factor of 4 because it arises from both M₃ and M₄ and it is driven by differential voltages [Fig. 10.44(b)]. Writing a KCL at node X gives

(10.40)

Figure 10.44 (a) CML latch in regeneration mode, (b) decomposition of C_GD, and (c) circuit’s waveforms.

Similarly,

(10.41)

Subtracting (10.41) from (10.40) and grouping the terms, we have

(10.42)

We denote V_X − V_Y by V_XY, divide both sides of Eq. (10.42) by −R_DC_DV_XY, multiply both sides by dt, and integrate with the initial condition V_XY(t = 0) = V_XY0. Thus,

(10.43)

Interestingly, V_XY grows exponentially with time [Fig. 10.44(c)], exhibiting a “regeneration time constant” of

(10.44)

Of course, as V_XY increases, one transistor begins to turn off and its g_m falls toward zero. Note that, if g_m3,4R_D 1, then τ_reg ≈ C_D/g_m3,4.

Suppose the D latch of Fig. 10.43(a) must run with a minimum clock period of T_ck, spending half of the period in each mode. Derive a relation between the circuit parameters and T_ck. Assume the swings in the latch mode must reach at least 90% of their final value.

Solution:

We begin our calculation in the regeneration mode. Since the regenerative pair must produce V_XY = 0.9I_SSR_D in 0.5T_ck seconds, it requires an initial voltage difference, V_XY₀, that can be obtained from (10.43):

(10.45)

The minimum initial voltage must be established by the input differential pair in the sense mode [just before t = t₃ in Fig. 10.43(c)]. In the worst case, when the sense mode begins, V_X and V_Y are at the opposite extremes and must cross and reach V_XY₀ in 0.5T_ck seconds (Fig. 10.45). For example, V_Y begins at V_DD and falls according to

(10.46)

Figure 10.45 Latch waveforms showing minimum time required for proper operation.

Similarly,

(10.47)

Since V_X − V_Y must reach Eq. (10.45) in 0.5T_ck seconds, we have

(10.48)

and hence

(10.49)

With T_ck known, this expression constrains the upper bound on R_DC_D and the lower bound on g_m3,4. In practice, the finite rise and fall times of the clock leave less than 0.5T_ck for each mode, tightening these constraints.

It is possible to merge logic with a latch, thus reducing both the delay and the power dissipation. For example, the NOR and the master latch of FF₁ depicted in Fig. 10.32 can be realized as shown in Fig. 10.46. The circuit performs a NOR/OR operation on A and B in the sense mode and stores the result in the latch mode.

Figure 10.46 CML latch incorporating NOR gate.

Design Procedure

Let us construct a ÷2 circuit by placing two D latches in a negative feedback loop (Fig. 10.47). Note that the total capacitance seen at the clock input is twice that of a single latch. The design of the circuit begins with three known parameters: the power budget, the clock swing, and the load capacitance (the input capacitance of the next stage). We then follow these steps: (1) Select I_SS based on the power budget; (2) Select R_DI_SS ≈ 300 mV; (3) Select (W/L)_1,2 such that the differential pair experiences nearly complete switching for a differential input of 300 mV; (4) Select (W/L)_3,4 such that the small-signal gain around the regenerative loop exceeds unity; (5) Select (W/L)_5,6 such that the clocked pair steers most of the tail current with the specified clock swing. It is important to ensure that the feedback around the loop is negative (why?).

Figure 10.47 Divide-by-2 circuit consisting of two CML latches in a negative feedback loop.

The rough design thus obtained, along with the specified load capacitance, reaches a speed higher than the limit predicted by Eq. (10.49) because the voltage swings at X and Y in each latch need not reach the full amount of I_SSR_D for proper operation. In other words, as the clock frequency exceeds the limit given by (10.49), the output swings become smaller—up to a point where V_X and V_Y simply do not have enough time to cross and the circuit fails.

In practice, the transistor widths may need to exceed those obtained above for three reasons: (a) the tail node voltages of M₁-M₂ and M₃-M₄ may be excessively low, driving M₅-M₆ into the triode region; (b) the tail node voltage of M₅-M₆ may be so low as to leave little headroom for I_SS; and (c) at very high speeds, the voltage swings at X and Y do not reach R_DI_SS, demanding wider transistors for steering the currents.

The performance of high-speed dividers is typically characterized by plotting the minimum required clock voltage swing (“sensitivity”) as a function of the clock frequency. Sketch the sensitivity for the ÷2 circuit of Fig. 10.47.

Solution:

For a clock with abrupt edges, we expect the required clock swing to remain relatively constant up to the point where the internal time constants begin to manifest themselves. Beyond this point, the required swing must increase. The overall behavior, however, appears as shown in Fig. 10.48. Interestingly, the required clock swing falls to zero at some frequency, f₁. Since for zero input swings, I_SS is simply split equally between M₅ and M₆ in Fig. 10.47, the circuit reduces to that depicted in Fig. 10.49. We recognize that the result resembles a two-stage ring oscillator. In other words, in the absence of an input clock, the circuit simply oscillates at a frequency of f₁/2. This observation provides another perspective on the operation of the divider: the circuit behaves as an oscillator that is injection-locked to the input clock (Section 10.6.6). This viewpoint also explains why the clock swing cannot be arbitrarily small at low frequencies. Even with square clock waveforms, a small swing fails to steer all of the tail current, thereby keeping M₂-M₃ and M₃-M₄ simultaneously on. The circuit may therefore oscillate at f₁/2 (or injection-pulled by the clock).

Figure 10.48 Divider sensitivity plot.

Figure 10.49 Divide-by-2 circuit viewed as ring oscillator.

The “self-oscillation” of the divider also proves helpful in the design process: if the choice of device dimensions does not allow self-oscillation, then the divider fails to operate properly. We thus first test the circuit with a zero clock swing to ensure that it oscillates.

The stacking of the differential and regenerative pairs atop the clocked pair in Figs. 10.43 and 10.47 does not lend itself to low supply voltages. This issue is alleviated by omitting the tail current source, but the bias currents of the circuit must still be defined accurately. Figure 10.50 shows an example [9], where the bias of the clocked pair is defined by a current mirror and the clock is coupled capacitively. Without a current mirror, i.e., if the gates of M₅ and M₆ are directly tied to the preceding stage, the bias currents and hence the latch output swings would heavily depend on the process, temperature, and supply voltage. The value of the coupling capacitors is chosen about 5 to 10 times the gate capacitance of M₅ and M₆ to minimize the attenuation of the clock amplitude. Resistors R_B₁ and R_B₂ together with C₁ and C₂ yield a time constant much longer than the clock period. Note that capacitive coupling may be necessary even with a tail current if the VCO output CM level is incompatible with the latch input CM level (Chapter 8).

Figure 10.50 Class-AB latch.

In the above circuit, large clock swings allow transistors M₅ and M₆ to operate in the class AB mode, i.e., their peak currents well exceed their bias current. This attribute improves the speed of the divider [9].

A student designs a VCO with relatively large swings to minimize relative phase noise and a CML ÷2 circuit that requires only moderate clock swings. How should the coupling capacitors be chosen?

Solution:

Suppose the VCO output swing is twice that required by the divider. We simply choose each coupling capacitor to be equal to the input capacitance of the divider (Fig. 10.51). This minimizes the size of the coupling capacitors, the load capacitance seen by the VCO (half of the divider input capacitance), and the effect of divider input capacitance variation on the VCO.

Figure 10.51 Use of coupling capacitor equal to input capacitance of next stage.

Recall from Chapter 4 that a VCO/÷2 circuit cascade proves useful in both generating the I and Q phases of the LO and avoiding injection pulling by the PA. This topology, however, dictates operation at twice the carrier frequency of interest. For the ÷2 stage to run at high frequencies, the speed of the D latch of Fig. 10.43 or 10.50 must be maximized. For example, inductive peaking raises the bandwidth at the output nodes [Fig. 10.52(a)]. From a small-signal perspective, we observe that the inductors rise in impedance at higher frequencies, allowing more of the currents produced by the transistors to flow through the capacitors and hence generate a larger output voltage. This behavior can be formulated with the aid of the equivalent circuit shown in Fig. 10.52(b). We have

(10.50)

Figure 10.52 (a) CML latch using inductive peaking, (b) equivalent circuit.

It is common to rewrite this transfer function as

(10.51)

where

(10.52)

is the “damping factor” and

(10.53)

is the “natural frequency.” To determine the −3-dB bandwidth, we equate the squared magnitude of Eq. (10.51) to (1/2)(2ζ/ω_n)²(1/C_D)²:

(10.54)

It follows that

(10.55)

For example, noting that ω_n = 2ζ/(R_DC_D), we obtain ω_−3dB ≈ 1.8/(R_DC_D) if , i.e., the bandwidth increases by 80%. If L_D is increased further so that , then ω_−3dB = 1.85/(R_DC_D). On the other hand, a conservative value of ζ = 1 yields ω−_3dB = 1.41/(R_DC_D).

What is the minimum tolerable value of ζ if the frequency response must exhibit no peaking?

Solution:

Peaking occurs if the magnitude of the transfer function reaches a local maximum at some frequency. Taking the derivative of the magnitude squared of Eq. (10.51) with respect to ω and setting the result to zero, we have

(10.56)

A solution exists if

(10.57)

and hence if

(10.58)

This bound on ζ translates to

(10.59)

In practice, parasitics of on-chip inductors yield a bandwidth improvement less than the values predicted above. One may consider the Q of the inductor unimportant as R_D appears in series with L_D in Fig. 10.52. However, Q does play a role because of the finite parasitic capacitance of the inductor. For this reason, it is preferable to tie L_D to V_DD and R_D to the output node than L_D to the output node and R_D to V_DD; in the circuit of Fig. 10.52(a), about half of the distributed capacitance of L_D is absorbed by V_DD.⁷ This also allows a symmetric inductor and hence a higher Q.

The topology depicted in Fig. 10.52(a) is called “shunt peaking” because the resistor-inductor branch appears in parallel with the output port. It is also possible to incorporate “series peaking,” whereby the inductor is placed in series with the unwanted capacitance. Illustrated in Fig. 10.53, the circuit provides the following transfer function:

(10.60)

which is similar to Eq. (10.50) but for a zero. The −3-dB bandwidth is computed as

(10.61)

where ζ and ω_n are given by (10.52) and (10.53), respectively. For example, if , then , i.e., series peaking increases the bandwidth by about 40%. The reader can prove that the frequency response exhibits peaking if . Note that V_X/I_in satisfies the shunt peaking transfer function of (10.50).

Figure 10.53 Series peaking.

Having understood shunt peaking intuitively, a student reasons that series peaking degrades the bandwidth because, at high frequencies, inductor L_D in Fig. 10.53 impedes the flow of current, forcing a larger fraction of I_in to flow through C_D. Since a smaller current flows though L_D and R_D, V_out falls at higher frequencies. Explain the flaw in this argument.

Solution:

Let us study the behavior of the circuit at . As shown in Fig. 10.54(a), the Thevenin equivalent of I_in, C_D, and L_D is constructed by noting that (a) the open-circuit output voltage is equal to I_in/(C_Ds), and (b) the output impedance (with I_in set to zero) is zero because C_D and L_D resonate at ω_n. It follows that V_out = I_in/(C_Ds) at ω = ω_n, i.e., as if the circuit consisted of only I_in and C_D [Fig. 10.54(b)]. Since I_in appears to flow entirely through C_D, it yields a larger magnitude for V_out than if it must split between C_D and R_D.

Figure 10.54 (a) Use of Thevenin equivalent at resonance frequency, (b) simplified view.

Circuits employing series peaking are generally more complex than the situation portrayed in Fig. 10.53. Specifically, the transistor generating I_in suffers from an output capacitance, which can be represented by C_D, but the next stage also exhibits an input capacitance, which is part of C_D in Fig. 10.52(a) but not included in Fig. 10.53. A more complete model is shown in Fig. 10.55(a) and two circuit examples employing series peaking are depicted in Fig. 10.55(b). The transfer function of the topology in Fig. 10.55(a) is of third order, making it difficult to compute the bandwidth, but simulations can be used to quantify the performance. Compared to shunt peaking, series peaking typically requires a smaller inductor value.

Figure 10.55 (a) Series peaking circuit driving load capacitance C₂, (b) representative cases.

As L_D increases from zero in Fig. 10.52(a), the maximum operation frequency of the divider rises. Of course, the need for at least two (symmetric) inductors for a ÷2 circuit complicates the layout. Moreover, as the value of L_D becomes so large that L_Dω/R_D (the Q of the series combination) exceeds unity at the maximum output frequency, the lower end of the operation frequency range increases. That is, the circuit begins to fail at low frequencies. Illustrated in Fig. 10.56(a), this phenomenon occurs because the circuit approaches a quadrature LC oscillator that is injection-locked to the input clock. Figure 10.56(b) shows the simplified circuit, revealing resemblance to the quadrature topology studied in Chapter 8. As the Q of the tank exceeds unity, the injection lock range of the circuit becomes narrower.

Figure 10.56 (a) Sensitivity plots with small and large load inductors, (b) inductively-peaked latch viewed as quadrature oscillator.

CML dividers have reached very high speeds in deep-submicron CMOS technologies. For example, the use of inductive peaking and class-AB operation has afforded a maximum clock frequency of 96 GHz for a ÷2 circuit [10].

True Single-Phase Clocking

Another logic style often employed in divider design is “true single-phase clocking” (TSPC) [11]. Figure 10.57(a) shows a TSPC flipflop. Incorporating dynamic logic, the circuit operates as follows. When CK is high, the first stage operates as an inverter, impressing at A and E. When CK goes low, the first stage is disabled and the second stage becomes transparent, “writing” at B and C and hence making Q equal to A. The logical high at E and the logical low at B are degraded, but the levels at A and C ensure proper operation of the circuit.

Figure 10.57 (a) TSPC flipflop, (b) response to a rising input transition when CK is low.

Does the clock completely disable each of the first two stages? Suppose CK is low. If D goes from low to high, A remains constant. On the other hand, if D falls [Fig. 10.57(b)], A rises, but the state at B does not change because M₄ turns off and M₆ remains off. (For a state to change, a transistor must turn on.) Now suppose CK is high, keeping M₂ on and M₅ off. Does a change in D propagate to Q? For example, if Q is high, and D rises, then A falls and B rises, turning M₇ off. Thus, Q remains unchanged.

Since the flipflop of Fig. 10.57 contains one inversion, it can serve as a ÷2 circuit if Q is tied to D. An alternative ÷2 TSPC circuit is shown in Fig. 10.58 [11]. This topology achieves relatively high speeds with low power dissipation, but, unlike CML dividers, it requires rail-to-rail clock swings for proper operation. Moreover, it does not provide quadrature outputs. Note that (a) the circuit consumes no static power,⁸ and (b) as a dynamic logic topology, the divider fails at very low clock frequencies due to the leakage of the transistors. For example, if a synthesizer is designed with a reference frequency of 1 MHz, then the last few stages of the program counter in Fig. 10.24 must operate at a few megahertz, possibly failing to retain states stored on transistor capacitances.

Figure 10.58 TSPC divide-by-2 circuit.

The TSPC FF of Fig. 10.57 can readily incorporate logic at its input. For example, a NAND gate can be merged with the master latch as shown in Fig. 10.59. Thus circuits such as the ÷3 stage of Fig. 10.31 can be realized by TSPC logic as well. In the design of TSPC circuits, one observes that wider clocked devices [e.g., M₂ and M₅ in Fig. 10.57(a)] raises the maximum speed, but at the cost of loading the preceding stage, e.g., the VCO.

Figure 10.59 TSPC FF incorporating a NAND gate.

A variant of TSPC logic that achieves higher speeds is depicted in Fig. 10.60 [12]. Here, the first stage operates as the master D latch and the last two as the slave D latch. The slave latch is designed as “ratioed” logic, i.e., both NMOS devices are strong enough to pull down B and Q even if M₄ or M₆ is on. When CK is high, the first stage reduces to an inverter, the second stage forces a ZERO at B, and the third stage is in the store mode. When CK goes down, B remains low if A is high, or it rises if A is low, with Q tracking B because M₇ and M₆ act as a ratioed inverter.

Figure 10.60 TSPC circuit using ratioed logic.

The first stage in Fig. 10.60 is not completely disabled when CK is low. Explain what happens if D changes in this mode.

Solution:

If D goes from low to high, A does not change. If D falls, A rises, but since M₄ turns off, it cannot change the state at B. Thus, D does not alter the state stored by the slave latch.

The second and third stages in the circuit of Fig. 10.60 consume static power when their clocked transistor fights the input device. At high speeds, however, the dynamic power dominates, making this drawback less objectionable. In a typical design (with PMOS mobility about half of NMOS mobility), all transistors in the circuit can have equal dimensions, except for W₅, which must be two to three times the other transistor widths to maximize the speed. The input can also incorporate logic in a manner similar to that in Fig. 10.59. This technique allows ÷2 speeds around 10 GHz and ÷3 speeds around 6 GHz in 65-nm CMOS technology. We incorporate this logic style in the design of a ÷3/4 circuit in Chapter 13.

The TSPC circuit and its variants operate with rail-to-rail swings but do not provide differential or quadrature outputs. A complementary logic style resolving this issue is described in Chapter 13.

10.6.5 Miller Divider

As explained in Section 10.6.4, CML dividers achieve a high speed by virtue of current steering and moderate voltage swings. If the required speed exceeds that provided by CML circuits, one can consider the “Miller divider” [13], also known as the “dynamic divider.”⁹ Depicted in Fig. 10.61(a) and providing a divide ratio of 2, the Miller topology consists of a mixer and a low-pass filter, with the LPF output fed back to the mixer. If the circuit operates properly, f_out = f_in/2, yielding two components, 3f_in/2 and f_in/2, at node X. The former is attenuated by the LPF, and the latter circulates around the loop. In other words, correct operation requires that the loop gain for the former component be sufficiently small and that for the latter exceed unity.

Figure 10.61 (a) Miller divider, (b) realization.

The Miller divider can achieve high speeds for two reasons: (1) the low-pass behavior can simply be due to the intrinsic time constant at the output node of the mixer, and (2) the circuit does not rely on latching and hence fails more gradually than flipflops as the input frequency increases. Note, however, that the divider loop requires some cycles to reach steady state, i.e., it does not divide correctly instantaneously.

Figure 10.61(b) shows an example of the Miller divider realization. A double-balanced mixer senses the input at its LO port, with C₁ and C₂ returning the output to the RF port.¹⁰ The loop gain at mid-band frequencies is equal to (2/π)g_m1,2R_D (Chapter 6) and must remain above unity. The maximum speed of the circuit is roughly given by the frequency at which the loop gain falls to unity. Note that R_D and the total capacitance at the output nodes define the corner of the LPF. Of course, at high frequencies, the roll-off due to the pole at the drains of M₁ and M₂ may also limit the speed.

Is it possible to construct a Miller divider by returning the output to the LO port of the mixer?

Solution:

Shown in Fig. 10.62, such a topology senses the input at the RF port of the mixer. (Strangely enough, M₃ and M₄ now appear as diode-connected devices.) We will see below that this circuit fails to divide.

Figure 10.62 Miller divider with feedback to switching quad.

The reader may wonder why we said above that the component at 3f_in/2 must be sufficiently small. After all, this component is merely the third harmonic of the desired output and would seem to only sharpen the output edges. However, this harmonic in fact creates a finite lower bound on the divider operation frequency. As f_in decreases and the component at 3f_in/2 falls below the corner frequency of the LPF, the circuit fails to divide.

To understand this point, let us open the loop as shown in Fig. 10.63(a) and write the output of the mixer as

(10.62)-(10.63)

where α is related to the mixer conversion gain. Illustrated in Fig. 10.63(b), this sum exhibits additional zero crossings, prohibiting frequency division if traveling through the LPF unchanged [14]. Thus, the third harmonic must be attenuated—at least by a factor of three [14]—to avoid the additional zero crossings.

Figure 10.63 (a) Open-loop equivalent circuit of Miller divider, (b) circuit’s waveforms.

Does the arrangement shown in Fig. 10.64 operate as a divider?

Figure 10.64 Miller divider using first-order low-pass filter.

Solution:

Since the voltage drop across R₁ is equal to R₁C₁dV_out/dt, we have V_X = R₁C₁dV_out/dt + V_out. Also, V_X = αV_inV_out. If V_in = V₀ cos ω_int, then

(10.64)

It follows that

(10.65)

We integrate the left-hand side from V_out0 (initial condition at the output) to V_out and the right-hand side from 0 to t:

(10.66)

Thus,

(10.67)

Interestingly, the exponential term drives the output to zero regardless of the values of α or ω_in [14]. The circuit fails because a one-pole filter does not sufficiently attenuate the third harmonic with respect to the first harmonic. An important corollary of this analysis is that the topology of Fig. 10.62 cannot divide: the single-pole loop follows Eq. (10.67) and does not adequately suppress the third harmonic at the output.

To avoid the additional zero crossings shown in Fig. 10.63(b), it is also possible to introduce phase shift in cos(ω_int/2) and/or cos(3ω_int/2). For example, if cos(3ω_int/2) is attenuated by a factor of 2 but shifted by 45°, then the two components add up to the waveform shown in Fig. 10.65. In other words, the Miller divider operates properly if the third harmonic is attenuated and shifted so as to avoid the additional zero crossings [14]. This observation suggests that the topology of Fig. 10.61(b) divides successfully only if the pole at the drains of M₁ and M₂ provides enough phase shift, a difficult condition.

Figure 10.65 Proper Miller divider operation if third harmonic is shifted by 45° and halved in amplitude.

Miller Divider with Inductive Load

The topology of Fig. 10.61(b) suffers from the same gain-headroom trade-offs as those described for active mixers in Chapter 6. The limited voltage drop across the load resistors makes it difficult to achieve a high conversion gain. Wider input and LO transistors alleviate this issue but at the cost of speed. If the load resistors are replaced with inductors, the gain-headroom and gain-speed trade-offs are greatly relaxed, but the lower end of the frequency range rises. Also, the inductor complicates the layout. Figure 10.66 shows a Miller divider using inductive loads. Since the tanks significantly suppress the third harmonic of the desired output, this circuit proves more robust than the topology of Fig. 10.61(b) [14].

Figure 10.66 Miller divider using inductive loads.

The design of this circuit proceeds as follows. We assume a certain tolerable capacitance at the LO port and a certain load capacitance at the output node. The width of M₃-M₆ is chosen according to the former, and the LO common-mode level is preferably chosen equal to V_DD, leaving maximum headroom for M₁ and M₂. Inductors L₁ and L₂ must resonate with the total capacitance at X and Y at about half of the input “mid-band” frequency. For example, if we wish to accommodate an input range of 40 GHz to 50 GHz, we may choose a resonance frequency around 22.5 GHz. In this step, the capacitance of M₁ and M₂ is unknown, requiring a reasonable guess and possible iterations. With the value of L₁ and L₂ known, their equivalent parallel resistance, R_p, must provide enough gain along with M₁ and M₂. [Recall that the mixer has a conversion gain of (2/π)g_m1,2R_p.] The width and bias current of M₁ and M₂ are therefore chosen so as to maximize the gain. Of course, an excessively high bias current results in a low value for V_P and V_Q, driving M₁ and M₂ into the triode region. Some optimization is thus necessary. It can be shown that the input frequency range across which the circuit operates properly is given by

(10.68)

where ω₀ and Q denote the tank resonance frequency and quality factor, respectively [14].

Does the circuit of Fig. 10.62 operate as a divider if the load resistors are replaced with inductors?

Solution:

Depicted in Fig. 10.67(a), such an arrangement in fact resembles an oscillator. Redrawing the circuit as shown in Fig. 10.67(b), we note M₅ and M₆ act as a cross-coupled pair and M₃ and M₄ as diode-connected devices. In other words, the oscillator consisting of M₅-M₆ and L₁-L₂ is heavily loaded by M₃-M₄, failing to oscillate (unless the Q of the tank is infinite or M₃ and M₄ are weaker than M₅ and M₆). This configuration does operate as a divider but across a narrower frequency range than does the topology of Fig. 10.66.

Figure 10.67 (a) Inductively-loaded Miller divider with feedback to the switching quad, (b) alternative drawing of the circuit.

It is possible to construct a Miller divider using passive mixers. Figure 10.68 depicts an example, where M₁-M₄ constitute a passive mixer and M₅-M₆ an amplifier [16]. Since the output CM level is near V_DD, the feedback path incorporates capacitive coupling, allowing the sources and drains of M₁-M₄ to remain about 0.4 V above ground. (As explained in Chapter 6, the LO CM level must still be near V_DD to provide sufficient overdrive for M₁-M₄.) The cross-coupled pair M₇-M₈ can be added to increase the gain by virtue of its negative resistance. If excessively strong, however, this pair oscillates with the tanks, leading to a narrow frequency range (Section 10.6.6).

Figure 10.68 Miller divider using passive mixer.

While achieving speeds exceeding 100 GHz in CMOS technology [16], the Miller divider does not provide quadrature outputs, a drawback with respect to RF transceivers that operate the oscillator at twice the carrier frequency and employ a divider to generate the I and Q phases.

Miller Divider with Other Moduli

In his original paper, Miller also contemplates the use of dividers within the feedback loop of his topology so as to produce moduli other than 2. Shown in Fig. 10.69 is an example, where a ÷N circuit in the feedback path creates f_b = f_out/N, yielding f_in ± f_out/N at X. If the sum is suppressed by the LPF, then f_out = f_in − f_out/N and hence

(10.69)

Figure 10.69 Miller divider having another divider stage in feedback.

Also,

(10.70)

For example, if N = 2, the input frequency is divided by 1.5 and 3, two moduli that are difficult to obtain at high speeds by means of flipflop-based dividers.

An important issue in the topology of Fig. 10.69 is that the sum component at X comes closer to the difference component as N increases, dictating a sharper LPF roll-off. In the above example, these two components lie at 4f_in/3 and 2f_in/3, respectively, i.e., only one octave apart. Consequently, the circuit suffers from a more limited frequency range.

Another critical issue in the Miller divider of Fig. 10.69 relates to the port-to-port feedthroughs of the mixer. The following example illustrates this point.

Assume N = 2 in Fig. 10.69 and study the effect of feedthrough from each input port of the mixer to its output.

Solution:

Figure 10.70(a) shows the circuit. The feedthrough from the main input to node X produces a spur at f_in. Similarly, the feedthrough from Y to X creates a component at f_in/3. The output therefore contains two spurs around the desired frequency [Fig. 10.70(b)]. Interestingly, the signal at Y exhibits no spurs: as the spectrum of Fig. 10.70(b) travels through the divider, the main frequency component is divided while the spurs maintain their spacing with respect to the carrier (Chapter 9). Shown in Fig. 10.70(c), the spectrum at Y contains only harmonics and a dc offset. The reader can prove that these results are valid for any value of N.

Figure 10.70 (a) Miller divider with a ÷2 stage in feedback, (b) output spectrum, (c) spectrum at Y.

Does the original topology of Fig. 10.61(a) also suffer from spurs? In Problem 10.17, we prove that it does not.

The Miller divider frequency range can be extended through the use of a single-sideband mixer. Illustrated in Fig. 10.71(a), the idea is to suppress the sum component by SSB mixing rather than filtering, thereby avoiding the problem of additional zero crossings depicted in Fig. 10.63. In the absence of the sum component, the circuit divides properly at arbitrarily low frequencies. Unfortunately, however, this approach requires a broadband 90° phase shift, a very difficult design.

Figure 10.71 (a) Miller divider using SSB mixer, (b) divide-by-3 realization.

Nonetheless, the use of SSB mixing does prove useful if the loop contains a divider that generates quadrature outputs [17]. Shown in Fig. 10.71(b) is an example employing a ÷2 circuit and generating f_in/3 at the output [17]. This topology achieves a wide frequency range and generates quadrature outputs, a useful property for multiband applications. By contrast, the flipflop-based ÷3 circuit in Fig. 10.28 does not provide quadrature outputs. We note from Example 10.28 that the ÷1.5 output available at X exhibits spurs and may not be suited to stringent systems.

The principal drawback of the circuit of Fig. 10.71(b) and its variants is that it requires quadrature LO phases. As explained in Chapter 8, quadrature oscillators exhibit a higher phase noise and two possible modes.

10.6.6 Injection-Locked Dividers

Another class of dividers is based on oscillators that are injection-locked to a harmonic of their oscillation frequency [15]. To understand this principle, let us return to the Miller divider of Fig. 10.68 and assume the cross-coupled pair is strong enough to produce oscillation. Transistors M₅ and M₆ can now be viewed as devices that couple the mixer output to the oscillator.¹¹ The overall loop can thus be modeled as shown in Fig. 10.72, where the connection between X and the oscillator denotes injection rather than frequency control. If the loop operates properly, then f_out = f_in/2, yielding both f_in/2 and 3f_in/2 at X. The former couples to and locks the oscillator while the latter is suppressed by the selectivity of the oscillator. If f_in varies across a certain “lock range,” the oscillator remains injection-locked to the f_out − f_in component at node X. On the other hand, if f_in falls outside the lock range, the oscillator is injection-pulled, thus producing a corrupted output.

Figure 10.72 Injection-locked divider.

Determine the divide ratio of the topology shown in Fig. 10.73 if the oscillator remains locked.

Figure 10.73 Injection-locked divider having another divider in feedback.

Solution:

The mixer yields two components at node X, namely, f_in − f_out/N and f_in + f_out/N. If the oscillator locks to the former, then f_in − f_out/N = f_out and hence

(10.71)

Similarly, if the oscillator locks to the latter, then

(10.72)

The oscillator lock range must therefore be narrow enough to lock to only one of the two components.

The reader may wonder if the injection-locked divider (ILD) of Fig. 10.72 is any different from the Miller loop of Fig. 10.61(a). The fundamental difference between the two is that the former oscillates even with the input amplitude set to zero, whereas the latter does not. For this reason, ILDs generally exhibit a narrower operation frequency range than Miller dividers.

Let us now implement an ILD. While the topology of Fig. 10.72 serves as a candidate, even a simpler arrangement is conceived if we recognize that the cross-coupled pair within an oscillator can also operate as a mixer. Indeed, we utilized this property to study the effect of the tail noise current on the phase noise in Chapter 8. The loop shown in Fig. 10.72 thus reduces to a single cross-coupled pair providing a negative resistance at its drains and a mixing input at its tail node. Figure 10.74(a) depicts the result: I_in (= g_m₃V_in) is commutated by M₁ and M₂ and hence translated to f_out ± f_in as it emerges at the drains of these transistors. The circuit can be equivalently viewed as shown in Fig. 10.74(b), where I_eq represents the current components at f_out ± f_in, with the amplitude of each component given by 2/π times the amplitude of I_in.¹²

Figure 10.74 (a) Example of injection-locked divider, (b) equivalent view.

Since the sum component is greatly attenuated by the oscillator, we can consider only the difference component as the input to the oscillator. From Chapter 8, the two-sided injection lock range is given by (ω₀/Q)(I_inj/I_osc), where I_inj = (2/π)I_in. Thus, the output frequency range across which the circuit remains locked is given by [18]

(10.73)

The input lock range is twice this value:

(10.74)

As explained in Chapter 8, the phase noise of an injection-locked oscillator approaches that of the unlocked circuit if the oscillator is locked near the edge of the lock range. Equation (10.74) therefore points to a trade-off between the lock range and the phase noise: as Q is lowered, the former widens but the latter degrades.

As mentioned in Section 10.6.4, flipflop-based dividers can also be considered injection-locked ring oscillators. Achieving a much wider lock range than their LC oscillator counterparts, these dividers exhibit a low phase noise by virtue of strong locking to the input. It is important to bear in mind that LC oscillators exhibit injection-locking dynamics that require a settling time commensurate with their Q. That is, such dividers do not begin to operate correctly instantaneously.

It is also critical to note that, in a PLL environment, the divider lock range must exceed the VCO tuning range. During the lock transient, the VCO frequency swings up or down and, if the divider fails at any VCO frequency, the PLL may simply not lock.

10.6.7 Divider Delay and Phase Noise

A divider incorporating asynchronous logic may experience a significant delay from the input to the output. For example, in the pulse swallow divider of Fig. 10.24, all three counters contribute delay: on one edge of the main input, the prescaler incurs some delay before it produces a transition at node A, which must then propagate through the program counter to reach the output.

What is the effect of divider delay on an integer-N synthesizer? The transfer function of a stage having a constant delay of ΔT is given by exp(−ΔT · s), yielding an overall open-loop transfer function of

(10.75)

If the delay is small with respect to the time scales of interest, we can write exp(−ΔT · s) ≈ 1 − ΔT · s, recognizing that the delay results in a zero in the right-half plane. Such a zero contributes negative phase shift, −tan⁻¹(ΔT · ω), making the loop less stable. Plotted in Fig. 10.75 is the open-loop frequency response in this case, revealing that the zero has two undesirable effects: it flattens the gain, pushing the gain crossover frequency to higher values (in principle, infinity), and it bends the phase profile downward. Thus, this zero must remain well above the original unity-gain bandwidth of the loop, e.g.,

(10.76)-(10.77)

Figure 10.75 Effect of divider delay on PLL phase margin.

In most practical designs, the divider delay satisfies the above condition, thus negligibly affecting the loop dynamics. Otherwise, the divider must employ more synchronous logic to reduce the delay.

The divider phase noise may also prove troublesome. As shown in Fig. 10.76, the output phase noise of the divider, φ_n,div, directly adds to the input phase noise, φ_n,in, experiencing the same low-pass response as it propagates to φ_out. In other words, φ_n,div is also multiplied by a factor of N within the loop bandwidth. Thus, for the divider to contribute negligible phase noise, we must have

(10.78)

Figure 10.76 Effect of divider phase noise on PLL.

In narrowband synthesizers, of course, the output phase noise is dominated by that of the VCO, making φ_n,in and φ_n,div less critical. Nonetheless, if the divider phase noise is significant, a retiming flipflop can be used to suppress its effect. Illustrated in Fig. 10.77(a), the idea is to sample the divider output by the VCO waveform, thus presenting the edges to the PFD only at the VCO transitions. As depicted in Fig. 10.77(b), V_Y changes only when the VCO output changes, avoiding the jitter (phase noise) in V_X (if the jitter is less than one cycle of V_out). In essence, the retiming operation bypasses the phase noise accumulated in the divider chain.

Figure 10.77 (a) Use of retiming FF to remove divider phase noise, (b) waveforms showing retiming operation, (c) problem of metastability.

Compare the output phase noise of the above circuit with that of a similar loop that employs noiseless dividers and no retiming flipflop. Consider only the input phase noise.

Solution:

The phase noise is similar. Invoking the time-domain view, we note that a (slow) displacement of the input edges by ΔT seconds still requires that the edges at Y be displaced by ΔT, which is possible only if the VCO edges are shifted by the same amount.

Does the retiming operation in Fig. 10.77(a) remove the effect of the divider delay?

Solution:

No, it does not. An edge entering the divider still takes a certain amount of time before it appears at X and hence at Y. In fact, Fig. 10.77(b) indicates that V_Y is delayed with respect to V_X by at most one VCO cycle. That is, the overall feedback delay is slightly longer in this case.

A concern in the use of the retiming FF in Fig. 10.77(a) arises if the VCO output edge occurs close to the transition at node X. Under this condition, the FF becomes “metastable,” i.e., it takes a long time to produce a well-defined logical level. Depicted in Fig. 10.77(c), this effect results in a distorted transition at node Y, confusing the PFD. It is therefore essential to guarantee by design that the sampling edges of the VCO remain safely away from the transitions at node X. If the divider delay varies by as much as one VCO period with process and temperature, then it becomes extremely difficult to avoid metastability.

References

[1] S. E. Meninger and M. H. Perrott, “A 1-MHz Bandwidth 3.6-GHz 0.18-μm CMOS Fractional-N Synthesizer Utilizing a Hybrid PFD/DAC Structure for Reduced Broadband Phase Noise,” IEEE J. Solid-State Circuits, vol. 41, pp. 966–981, April 2006.

[2] K. J. Wang, A. Swaminathan, and I. Galton, “Spurious Tone Suppression Techniques Applied to a Wide-Bandwidth 2.4 GHz Fractional-N PLL,” IEEE J. of Solid-State Circuits, vol. 43, pp. 2787–2797, Dec. 2008.

[3] A. Zolfaghari, A. Y. Chan, and B. Razavi, “A 2.4-GHz 34-mW CMOS Transceiver for Frequency-Hopping and Direct-Sequence Applications,” ISSCC Dig. Tech. Papers, pp. 418–419, Feb. 2001.

[4] G. Irvine et al., “An Upconversion Loop Transmitter IC for Digital Mobile Telephones,” ISSCC Dig. Tech. Papers, pp. 364–365, Feb. 1998.

[5] T. Yamawaki et al., “A 2.7-V GSM RF Transceiver IC,” IEEE J. of Solid-State Circuits, vol. 32, pp. 2089–2096, Dec. 1997.

[6] C. Lam and B. Razavi, “A 2.6-GHz/5.2-GHz Frequency Synthesizer in 0.4-μm CMOS Technology,” IEEE J. of Solid-State Circuits, vol. 35, pp. 788–794, May 2000.

[7] C. S. Vaucher et al., “A Family of Low-Power Truly Modular Programmable Dividers in Standard 0.35-μm CMOS Technology,” IEEE J. of Solid-State Circuits, vol. 35, pp. 1039–1045, July 2000.

[8] B. Razavi, Y. Ota, and R. G. Swartz, “Design Techniques for Low-Voltage High-Speed Digital Bipolar Circuits,” IEEE J. of Solid-State Circuits, vol. 29, pp. 332–339, March 1994.

[9] J. Lee and B. Razavi, “A 40-Gb/s Clock and Data Recovery Circuit in 0.18-μm CMOS Technology,” IEEE J. of Solid-State Circuits, vol. 38, pp. 2181–2190, Dec. 2003.

[10] D. D. Kim, K. Kim, and C. Cho, “A 94GHz Locking Hysteresis-Assisted and Tunable CML Static Divider in 65nm SOI CMOS,” ISSCC Dig. Tech. Papers, pp. 460–461, Feb. 2008.

[11] J. Yuan and C. Svensson, “High-Speed CMOS Circuit Technique,” IEEE J. Solid-State Circuits, vol. 24, pp. 62–70, Feb. 1989.

[12] B. Chang, J. Park, and W. Kim, “A 1.2-GHz CMOS Dual-Modulus Prescaler Using New Dynamic D-Type Flip-Flops,” IEEE J. Solid-State Circuits, vol. 31, pp. 749–754, May 1996.

[13] R. L. Miller, “Fractional-Frequency Generators Utilizing Regenerative Modulation,” Proc. IRE, vol. 27, pp. 446–456, July 1939.

[14] J. Lee and B. Razavi, “A 40-GHz Frequency Divider in 0.18-μm CMOS Technology,” IEEE J. of Solid-State Circuits, vol. 39, pp. 594–601, Apr. 2004.

[15] H. R. Rategh and T. H. Lee, “Superharmonic Injection-Locked Frequency Dividers,” IEEE J. of Solid-State Circuits, vol. 34, pp. 813–821, June 1999.

[16] B. Razavi, “A Millimeter-Wave CMOS Heterodyne Receiver with On-Chip LO and Divider,” IEEE J. of Solid-State Circuits, vol. 43, pp. 477–485, Feb. 2008.

[17] C.-C. Lin and C.-K. Wang, “A Regenerative Semi-Dynamic Frequency Divider for Mode-1 MB-OFDM UWB Hopping Carrier Generation,” ISSCC Dig. Tech. Papers, pp. 206–207, Feb. 2005.

[18] B. Razavi, “A Study of Injection Locking and Pulling in Oscillators,” IEEE J. of Solid-State Circuits, vol. 39, pp. 1415–1424, Sep. 2004.

Problems

10.1. Prove that insertion of a feedback divider in Fig. 10.14(b) results in Eqs. (10.25) and (10.26).

10.2. Prove that the far-out phase noise of the LO in Fig. 10.20 also appears as noise in the RX band. Neglecting other sources of noise, determine the phase noise at 25-MHz offset for GSM.

10.3. Does the sampling filter shown in Fig. 10.13(b) remove the effect of the mismatch between the Up and Down currents?

10.4. In practice, the sampling filter of Fig. 10.13(b) employs another capacitor tied from V_cont to ground. Explain why. How should this capacitor and C₂ be chosen to negligibly degrade the loop stability?

10.5. Suppose S₁ in Fig. 10.13(b) has a large on-resistance. Does this affect the loop stability?

10.6. Explain whether or not the charge injection and clock feedthrough of S₁ in Fig. 10.13(b) produce ripple on the control voltage after the loop has locked.

10.7. An in-loop modulation scheme is shown in Fig. 10.78. Consider two cases: the baseband bit period is (I) much shorter, or (II) much longer than the loop time constant.

(a) Sketch the output waveform of the VCO for both cases.

(b) Sketch the output waveform of the divider for both cases.

Figure 10.78 PLL with in-loop modulation.

10.8. In the pulse swallow divider of Fig. 10.24, the modulus control of the prescaler is accidentally inverted. Explain what happens.

10.9. In the PLL shown in Fig. 10.79, the control voltage has a sinusoidal ripple at a frequency of f_REF. Plot the spectra at A and B.

Figure 10.79 PLL with divide-by-two circuit preceding the dual-modulus divider.

10.10. In the divider of Fig. 10.31, gate G₁ is mistakenly realized as a NAND gate. Explain what happens.

10.11. How can the XOR of Fig. 10.42 be modified to provide differential outputs?

10.12. In the circuit of Fig. 10.44(a), the resistors are replaced with ideal current sources. Explain what happens.

10.13. In our study of oscillators in Chapter 8, we concluded that a loop containing only two poles cannot oscillate (unless both are at the origin). Why then does the circuit of Fig. 10.49 oscillate?

10.14. Example 10.18 suggests that τ_reg is independent of R_D if g_m3,4R_D 1. Explain this property intuitively.

10.15. Must the clock transition be abrupt for the D latch of Fig. 10.43(a) to operate properly? Consider a clock transition time (a) on the order of the time constant at X and Y, and (b) much longer than this time constant.

10.16. For the Miller divider of Fig. 10.68, determine the loop gain. Assume the tanks in the drains of M₁ and M₂ can be replaced by a resistance of R_p at resonance. Neglect all transistor capacitances and assume the resistors tied to the gates of M₅ and M₆ are large.

10.17. Prove that the Miller divider of Fig. 10.61(a) does not exhibit spurs at the output even if the mixer suffers from port-to-port feedthroughs.

10.18. Repeat the above problem if the mixer suffers from nonlinearity at each port.

10.19. Study the spurious response of the Miller divider shown in Fig. 10.69 if the mixer exhibits (a) port-to-port feedthrough, or (b) port nonlinearity.

10.20. Of the active mixer topologies studied in Chapter 6, which ones are suited to the Miller divider of Fig. 10.61(a)?

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 10. Integer-N Frequency Synthesizers

Create new playlist

Sign In

Sign Up

Chapter 10. Integer-N Frequency Synthesizers

10.1 General Considerations

Solution:

Solution:

10.2 Basic Integer-N Synthesizer

Solution:

10.3 Settling Behavior

Solution:

Solution:

10.4 Spur Reduction Techniques

Solution:

10.5 PLL-Based Modulation

10.5.1 In-Loop Modulation

Solution:

10.5.2 Modulation by Offset PLLs

Solution:

Solution:

Solution:

10.6 Divider Design

10.6.1 Pulse Swallow Divider

Solution:

10.6.2 Dual-Modulus Dividers

Solution:

Solution:

Solution:

Solution:

10.6.3 Choice of Prescaler Modulus

Solution:

10.6.4 Divider Logic Styles

Current-Steering Circuits

Solution:

Solution:

Solution:

Design Procedure

Solution:

Solution:

Solution:

Solution:

True Single-Phase Clocking

Solution:

10.6.5 Miller Divider

Solution:

Solution:

Miller Divider with Inductive Load

Solution:

Miller Divider with Other Moduli

Solution:

10.6.6 Injection-Locked Dividers

Solution:

10.6.7 Divider Delay and Phase Noise

Solution:

Solution:

References

Problems

Table of Contents for
Chapter 10. Integer-N Frequency Synthesizers