Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 8

Congestion Control in Ethernet Networks

This chapter is on the topic of congestion control in Ethernet networks. Traditionally, Ethernet, which operates at Layer 2, has left the task of congestion control to the TCP layer. However, recent developments such as the spread of Ethernet use in applications such as Storage Area Networks (SANs) has led the networking community to revisit this design because SANs have a very strict requirement that no packets be dropped. As a result, the Institute of Electrical and Electronics Engineers (IEEE) 802.1 Standards group has recently proposed a congestion control algorithm called IEEE802.1Qau or Quantum Congestion Notification (QCN) for use in Ethernet networks. This algorithm uses several advances in congestion control techniques described in the previous chapters, such as the use of rate averaging at the sender and Active Queue Management (AQM) feedback, which takes the occupancy as well as the rate of change of the buffer size into account.

Keywords

Ethernet congestion control; Quantum Congestion Notification; QCN; Reaction Point or RP algorithm; Control Point or CP Algorithm; QCN Stability Analysis; IEEE802.1Qau

8.1 Introduction

Ethernet is the most prevalent Layer 2 technology used in Internet Protocol (IP) networks; it is almost as old as IP itself. Traditionally, Ethernet left the job of congestion control to IP while providing basic medium access control (its original purpose in shared media links) and later Layer 2 bridging and switching functionalities. This state of affairs started to change in the past 10 to 15 years as the use of Ethernet expanded to wider domains. One of these domains is data center networking, and Chapter 7 discusses server-to-server networking in DCNs using Ethernet as the switching fabric with TCP running on top.

An aspect of data center networks (DCNs) not discussed in Chapter 7 is that of communications between the servers and storage devices. The networks that are used to interconnect them together are known as Storage Area Networks (SANs). Fiber Channel (FC) is a Layer 1–2 high-speed point-to-point interconnect technology that is used to create SANs. A very strong requirement in SAN networks is that the packet drop rate caused by buffer overflows should be zero. FC networks use a hop-by-hop congestion control method to accomplish this goal, whereby a node that is running out of buffers sends a signal to the node upstream from it to stop it from transmitting.

Link speeds in FC increased over time, but they were not able to keep pace with the more rapid increases in Ethernet link speeds that have gone from 1 to 10 Gbps and then to 100 Gbps in the past decade and a half. In addition, Ethernet adaptors and switches come at a lower cost because of their wider adoption in the industry. Motivated by these considerations, the industry created a FC over Ethernet (FCoE) standard, in which Ethernet is used as the Layer 1–2 substrate in SANs. However, using TCP as the congestion control mechanism will not work in an FCoE SAN because TCP does drop packets during the course of its normal operation. As a result, an effort was started in the Institute of Electrical and Electronics Engineers (IEEE) 802.2 Standards Body to make modifications to Ethernet so that it is suitable for deployment in a SAN (or a DCN in general), called Data Center Bridging (DCB) [1]. One of the outcomes of this effort is a congestion control protocol called IEEE 802.1Qau [2], also known as Quantum Congestion Notification (QCN), which operates at the Ethernet layer and aims to satisfy the special requirements in DCB networks [3].

There is more design latitude in the design of QCN compared with TCP because Ethernet provides 6 bits of congestion feedback (as opposed to 1 bit in TCP), and the QCN design takes advantage of this by implementing a proportional-integral (PI) controller at the Ethernet switch, which we know to be superior than Random Early Detection (RED) controllers. The QCN increase–decrease protocol has some novel features described in Section 8.4. Alizadeh et al. [4] carried out a comprehensive control theoretic analysis of the fluid model of the QCN algorithm that forms the subject matter of Section 8.5.

8.2 Differences between Switched Ethernet and IP Networks

Alizadeh et al. listed the following contrasts between switched Ethernet networks and IP networks [3]. We already mentioned one of them in the Introduction (i.e., packets should not be dropped); the others are:

• Ethernet does not use per-packet ACKs: This means that (1) packet transmissions are not self-clocked, (2) round trip latencies are unknown, and (3) congestion has to be signaled directly by the switches back to the source.

• There are no Packet Sequence Numbers in Ethernet.

• Sources start at Line Rate because there is no equivalent of the TCP Slow-Start mechanism. Because Ethernet implements its congestion control in hardware, a Slow-Start–like algorithm would have required a rate limiter, which are few in number and used up by the main congestion control algorithm.

The next three differences are more specific to DCNs in general and served as constraints in the design of a transport level congestion control algorithm as well:

• Very shallow buffers: Ethernet switch buffers are of the order of 100s of kilobytes in size.

• Small number of simultaneously active connections.

• Multipathing: Traditional Layer 2 networks used Spanning Trees for routing, which restricted the number of paths to one. The use of Equal Cost Multi-Path (ECMP) opens up the system to using more than 1 path.

8.3 Objectives of the Quantum Congestion Notification Algorithm

The QCN algorithm was designed based on all the learning from the design and modeling of TCP congestion control algorithms over the previous 2 decades. As a result, it incorporates features that keep it from falling into performance and stability traps that TCP often runs into. Some of the objectives that it seeks to satisfy include the following:

• Stability: We will use this term in the same sense as in the rest of this book, that is, the bottleneck queue length in a stable system should not fluctuate widely, thus causing overflows and underflows. Whereas the former leads to excessive packet drops, the latter causes link underutilization.

• Responsiveness to link bandwidth fluctuations

• Fairness

• Implementation simplicity: Because the algorithm is implemented entirely in hardware, simplicity is a must.

8.4 Quantum Congestion Notification Algorithm Description

The algorithm is composed of two main parts: Switch or Control Point (CP) Dynamics and Rate Limiter or Reaction Point (RP) Dynamics.

8.4.1 The Control Point or CP Algorithm

The CP algorithm runs at the network nodes, and its objective is to maintain the node’s buffer occupancy at the operating point b_eq (Figure 8.1). It computes a congestion measure F_b and randomly samples an incoming packet with a probability that that depends on the severity of the congestion (Figure 8.2). It sends the value of F_b back to the source of the sampled packet after quantizing it to 6 bits.

Figure 8.1 Congestion detection at the Control Point (CP).

Figure 8.2 Sampling probability at the Control Point (CP) as a function of |Fb|.

Define the following:

b: Value of the current queue length

b_old: Value of the buffer occupancy when the last feedback message was generated

b_off:=b−b_eq

b_d:=b−b_old

Then F_b is given by the formula:

$F_{b} = b_{o f f} + w b_{d}$ $F_{b} = b_{o f f} + w b_{d}$ (1)

(1)

where w is a non-negative constant, set equal to 2 for the baseline implementation. Note that equation 1 is basically the PI Active Queue Management AQM controller from Chapter 3. The first term on the RHS is the offset from the target operating point (i.e., the buffer oversubscription), and the second term is proportional to the rate at which the queue size is changing. As per equation 55 in Chapter 3, this is proportional to the difference between the link capacity and the total traffic rate flowing through the link (i.e., the link oversubscription).

When $F_{b} < 0$ $F_{b} < 0$ , there is no congestion, and no feedback messages are sent. When $F_{b} \geq 0$ $F_{b} \geq 0$ , then either the buffers or the link or both are oversubscribed, and control action needs to be taken. An incoming packet is sampled with probability $p_{s} = ϕ (F_{b})$ $p_{s} = ϕ (F_{b})$ (see Figure 8.2), and if p_s=1 and $F_{b} \geq 0$ $F_{b} \geq 0$ , then a congestion feedback message is sent back to the source.

8.4.2 The Reaction Point or RP Algorithm

The RP algorithm runs at the end systems and controls the rate at which Ethernet packets are transmitted in to the network. Unlike TCP, the RP algorithm does not get positive ACKs from the network and hence needs alternative mechanisms for increasing its sending rate. Define the following:

Current rate (R_C): The transmission rate of the source

Target rate (R_T): The transmission rate of the source just before the arrival of the last feedback message

Byte counter: A counter at the RP for counting transmitted bytes; it is used to time rate increases

Timer: A clock at the RP that is used for timing rate increases. It allows the source to rapidly increase its sending rate from a low value when a lot of bandwidth becomes available.

Initially assuming only the Byte Counter is available, RP uses the following rules for increasing and decreasing its rate.

8.4.2.1 Rate Decreases

A rate decrease is only done when a feedback message is received, and in this case, CR and TR are updates as follows:

$R_{T} \leftarrow R_{C}$ $R_{T} \leftarrow R_{C}$ (2)

(2)

$R_{C} \leftarrow R_{C} (1 - G_{d} | F_{b} |)$ $R_{C} \leftarrow R_{C} (1 - G_{d} | F_{b} |)$ (3)

(3)

The constant G_d is chosen so that $G_{d} | F_{b \max} | = \frac{1}{2}$ $G_{d} | F_{b \max} | = \frac{1}{2}$ (i.e., the rate can decrease by at most 50%). Because only 6 bits are available for feedback, it follows that $F_{b_{\max}} = 64$ $F_{b_{\max}} = 64$ , so that $G_{d} = 1 / 128$ $G_{d} = 1 / 128$ accomplishes this objective.

8.4.2.2 Rate Increases

Rate Increase is done in two phases: Fast Recovery and Active Increase (Figure 8.3)

Figure 8.3 Quantum Congestion Notification (QCN) Control Point (CP) operation.

Fast Recovery (FR): The source enters the FR state immediately after a rate decrease event, at which point the Byte Counter is reset. FR consists of 5 cycles, in each of which 150 Kbytes of data are transmitted (100 packets of 1500 bytes each), as counted by the Byte Counter. At the end of each cycle, R_T remains unchanged, and R_C is updated as follows:

$R_{C} \leftarrow \frac{R_{C} + R_{T}}{2}$ $R_{C} \leftarrow \frac{R_{C} + R_{T}}{2}$ (4)

(4)

The rationale behind this rule is if the source is able to transmit 100 packets without receiving another Rate Decrease message (which are sent by the CP once every 100 packets on the average since p_s=0.01), then it can conclude that the CP is uncongested, and therefore it increases its rate. Note that FR is similar to the way in which a Binary Increase Congestion Control (BIC) TCP source increases its window size after a packet drop (see Chapter 5). This mechanism was discovered independently by Alizadeh et al. [3].

Active Increase (AI): After 5 cycles of FR, the source enters the AI state, where it probes for extra bandwidth. AI consists of multiple cycles of 50 packets each. During this phase, R_T and R_C are updates as follows:

$R_{T} \leftarrow R_{T} + R_{A I}$ $R_{T} \leftarrow R_{T} + R_{A I}$ (5)

(5)

$R_{C} \leftarrow \frac{R_{C} + R_{T}}{2}$ $R_{C} \leftarrow \frac{R_{C} + R_{T}}{2}$ (6)

(6)

where R_AI is a constant, set to 5 mbps by default.

When R_C is extremely small after a rate decrease, then the time required to send out 150 Kbyes can be excessive. To speed this up, the source also uses a Timer, which is used as follows: The Timer is reset when the rate decrease message arrives. The source then enters FR and counts out 5 cycles of T ms duration (T=10 ms in the baseline implementation), and in the AI state, each cycle is T/2 ms long.

• In the AI state, the R_C is updated when either the Bye Counter or the Timer completes a cycle.

• The source is in the AI state if and only if either the Byte Counter or the Timer is in the AI state. In this case, when either completes a cycle, R_T and R_C are updated according to equations 5 and 6.

• The source is in the Hyper-Active Increase (HAI) state if both the Bye Counter and the Timer are in AI. In this case, at the completion of the i^th Byte Counter or Timer cycle, R_T and R_C are updated as follows:

$R_{T} \leftarrow R_{T} + i R_{H A I}$ $R_{T} \leftarrow R_{T} + i R_{H A I}$ (7)

(7)

$R_{C} \leftarrow \frac{R_{C} + R_{T}}{2}$ $R_{C} \leftarrow \frac{R_{C} + R_{T}}{2}$ (8)

(8)

where R_HAI is set to 50 mbps in the baseline.

8.5 Quantum Congestion Notification Stability Analysis

The stability analysis is done on a simplified model of the type shown in Chapter 3, Figure 3.2, with N connections with the same round trip latency, passing through a single bottleneck node with capacity C [4]. Following the usual recipe, we will first write down the differential equations in the fluid limit and then linearize them around an operating point, which allows us to analyze their stability using tools such as the Nyquist stability criterion. We will assume that p_s is fixed at p_s=1% in this section to simplify the analysis. Also, all connections are assumed to have the same round trip latency equal to $τ$ $τ$ seconds.

In contrast to other congestion control protocols, two variables R_C(t) and R_T(t), are needed to describe the source behavior:

$\frac{d R_{C}}{d t} = - G_{d} F_{b} (t - τ) R_{C} (t) R_{C} (t - τ) p_{r} (t - τ) + (\frac{R_{T} (t) - R_{C} (t)}{2}) \frac{R_{C} (t - τ) p_{r} (t - τ)}{{(1 - p_{r} (t - τ))}^{- 100} - 1}$ $\frac{d R_{C}}{d t} = - G_{d} F_{b} (t - τ) R_{C} (t) R_{C} (t - τ) p_{r} (t - τ) + (\frac{R_{T} (t) - R_{C} (t)}{2}) \frac{R_{C} (t - τ) p_{r} (t - τ)}{{(1 - p_{r} (t - τ))}^{- 100} - 1}$ (9)

(9)

$\frac{d R_{T}}{d t} = - (R_{T} (t) - R_{C} (t)) R_{C} (t - τ) p_{r} (t - τ) + R_{A I} R_{C} (t - τ) \frac{p_{r} (t - τ)}{{(1 - p_{r} (t - τ))}^{- 500} - 1}$ $\frac{d R_{T}}{d t} = - (R_{T} (t) - R_{C} (t)) R_{C} (t - τ) p_{r} (t - τ) + R_{A I} R_{C} (t - τ) \frac{p_{r} (t - τ)}{{(1 - p_{r} (t - τ))}^{- 500} - 1}$ (10)

(10)

$\frac{d b}{d t} = {\begin{array}{l} N R_{C} (t) - C & i f & b (t) > 0 \\ \max {N R_{C} (t) - C, 0} & i f & b (t) = 0 \end{array}$ $\frac{d b}{d t} = {\begin{array}{l} N R_{C} (t) - C & i f & b (t) > 0 \\ \max {N R_{C} (t) - C, 0} & i f & b (t) = 0 \end{array}$ (11)

(11)

$F_{b} (t) = b (t) - b_{e q} + \frac{w}{C p_{s}} (N R_{C} (t) - C)$ $F_{b} (t) = b (t) - b_{e q} + \frac{w}{C p_{s}} (N R_{C} (t) - C)$ (12)

(12)

$p_{r} (t) = p_{s} 1_{[F_{b} (t) > 0]}$ $p_{r} (t) = p_{s} 1_{[F_{b} (t) > 0]}$ (13)

(13)

To justify the negative first terms in equations 9 and 10, note that $R_{C} (t - τ) p_{r} (t - τ)$ $R_{C} (t - τ) p_{r} (t - τ)$ is the rate at which negative ACKs are arriving at the source. Each of these causes R_C to decrease by $R_{C} (t) G_{d} F_{b} (t - τ)$ $R_{C} (t) G_{d} F_{b} (t - τ)$ and RT to decrease by $R_{T} (t) - R_{C} (t)$ $R_{T} (t) - R_{C} (t)$ .

To derive the positive second term in equation 9, consider the following: The rate R_C is increased on the transmission of 150 Kbytes, or 100 packets of 1500 bytes each, if no negative ACK is received in the interim. The change in R_C when this happens is given by

$\begin{array}{l} Δ R_{C} (t) & = \frac{R_{C} (t) + R_{T} (t)}{2} - R_{C} (t) \\ = \frac{R_{T} (t) - R_{C} (t)}{2} \end{array}$ $\begin{array}{l} Δ R_{C} (t) & = \frac{R_{C} (t) + R_{T} (t)}{2} - R_{C} (t) \\ = \frac{R_{T} (t) - R_{C} (t)}{2} \end{array}$

To compute the rate at which the R_C rate increase events occur, consider the Markov chain in Figure 8.4: It is in state k if k packets have been transmitted back to back without the receipt of a single negative ACK. Starting from state 0, in general, the system may undergo several cycles where it returns back to state 0 before it finally transmits 100 packets back to back and gets to state 100. It can be shown that the average number of packets transmitted to get to state 100 when starting from state 0, is given by

$E_{0} (T_{100}) = \frac{{(1 - p_{r})}^{- 100} - 1}{p_{r}}$ $E_{0} (T_{100}) = \frac{{(1 - p_{r})}^{- 100} - 1}{p_{r}}$ (14)

(14)

Figure 8.4 Markov chain governing the rate R_C.

This can be derived as follows: Define $u_{i} = E_{i} (T_{100}), 0 \leq i \leq 99$ $u_{i} = E_{i} (T_{100}), 0 \leq i \leq 99$ as the average number of packets transmitted to get to state 100 when starting from state i. Based on the Markov chain in Figure 8.4, the sequence u_i satisfies the following set of equations (with q_r=1−p_r):

$\begin{array}{l} u_{0} = (1 + u_{1}) q_{r} + (1 + u_{0}) p_{r} \\ u_{1} = (1 + u_{2}) q_{r} + (1 + u_{0}) p_{r} \\ \dots \\ u_{98} = (1 + u_{99}) q_{r} + (1 + u_{0}) p_{r} \\ u_{99} = q_{r} + (1 + u_{0}) p_{r} \end{array}$ $\begin{array}{l} u_{0} = (1 + u_{1}) q_{r} + (1 + u_{0}) p_{r} \\ u_{1} = (1 + u_{2}) q_{r} + (1 + u_{0}) p_{r} \\ \dots \\ u_{98} = (1 + u_{99}) q_{r} + (1 + u_{0}) p_{r} \\ u_{99} = q_{r} + (1 + u_{0}) p_{r} \end{array}$

This set of equations can be solved recursively for u₀, and results in equation 14.

Because the average time between packet transmissions is given by $\frac{1}{R_{C} (t - τ)}$ $\frac{1}{R_{C} (t - τ)}$ , it follows that the average time between increase events is given by

$Δ T = \frac{{(1 - p_{r} (t - τ))}^{- 100} - 1}{R_{C} (t - τ) p_{r} (t - τ)}$ $Δ T = \frac{{(1 - p_{r} (t - τ))}^{- 100} - 1}{R_{C} (t - τ) p_{r} (t - τ)}$ (15)

(15)

The second term on the RHS of equation 9 follows by dividing equation 14 by equation 15. The second term on the RHS of equation 10 is derived using similar considerations. In this case, R_T increments by R_AI when 500 packets are transmitted back to back without returning to state 0, which results in a Markov chain just like that in Figure 8.4, except that it extends up to state 500.

We now compute the equilibrium values of the variables in equations 9 to 13. In equilibrium, we replace p_r by p_s throughout.

Define

$η (p_{s}) = \frac{p_{s}}{{(1 - p_{s})}^{- 100} - 1}, ζ (p_{s}) = \frac{p_{s}}{{(1 - p_{s})}^{- 500} - 1}$ $η (p_{s}) = \frac{p_{s}}{{(1 - p_{s})}^{- 100} - 1}, ζ (p_{s}) = \frac{p_{s}}{{(1 - p_{s})}^{- 500} - 1}$

The fluid flow model in equations 9 to 13 has the following equilibrium points:

$R_{C}^{*} = \frac{C}{N}$ $R_{C}^{*} = \frac{C}{N}$ (16)

(16)

Equation 16 follows by setting db/dt=0 in equation 11.

$R_{T}^{*} = \frac{C}{N} + \frac{ς (p_{s}) R_{A I}}{p_{s}}$ $R_{T}^{*} = \frac{C}{N} + \frac{ς (p_{s}) R_{A I}}{p_{s}}$ (17)

(17)

Equation 17 follows by setting dR_T/dt=0 in equation 10.

Note that from equation 12, it follows that

$b^{*} = b_{e q} + F_{b}^{*}$ $b^{*} = b_{e q} + F_{b}^{*}$ (18)

(18)

and equation 9 implies that

$F_{b}^{*} = \frac{(R_{T}^{*} - R_{C}^{*})}{2 R_{C}^{*} ({(1 - p_{s})}^{- 100} - 1}$ $F_{b}^{*} = \frac{(R_{T}^{*} - R_{C}^{*})}{2 R_{C}^{*} ({(1 - p_{s})}^{- 100} - 1}$ (19)

(19)

Substituting for $R_{C}^{*}$ $R_{C}^{*}$ and $R_{T}^{*}$ $R_{T}^{*}$ from equations 16 and 17 into equation 19, we finally obtain

$b^{*} = b_{e q} + \frac{η (p_{s}) ς (p_{s}) N R_{A I}}{2 p_{s}^{2} G_{d} C}$ $b^{*} = b_{e q} + \frac{η (p_{s}) ς (p_{s}) N R_{A I}}{2 p_{s}^{2} G_{d} C}$ (20)

(20)

We now proceed to linearize the equations 9 to 11 around the equilibrium points $(R_{C}^{*}, R_{T}^{*}, b^{*})$ $(R_{C}^{*}, R_{T}^{*}, b^{*})$ . Define the following deltas around the equilibrium:

$δ R_{C} (t) = R_{C} (t) - R_{C}^{*}, δ R_{T} (t) = R_{T} (t) - R_{T}^{*}, δ b (t) = b (t) - b^{*}$ $δ R_{C} (t) = R_{C} (t) - R_{C}^{*}, δ R_{T} (t) = R_{T} (t) - R_{T}^{*}, δ b (t) = b (t) - b^{*}$

Using the Linearization procedure described in Appendix 3.A of Chapter 3, it can be shown that the following equations result:

$\frac{d δ R_{C}}{d t} = - a_{1} δ R_{C} (t) + a_{2} δ R_{T} (t) - a_{3} δ R_{C} (t - τ) - a_{4} δ b (t - τ)$ $\frac{d δ R_{C}}{d t} = - a_{1} δ R_{C} (t) + a_{2} δ R_{T} (t) - a_{3} δ R_{C} (t - τ) - a_{4} δ b (t - τ)$ (21)

(21)

$\frac{d δ R_{T}}{d t} = g δ R_{C} (t) - g δ R_{T} (t)$ $\frac{d δ R_{T}}{d t} = g δ R_{C} (t) - g δ R_{T} (t)$ (22)

(22)

$\frac{d δ b}{d t} = N δ R_{C} (t)$ $\frac{d δ b}{d t} = N δ R_{C} (t)$

where

$a_{1} = \frac{η (p_{s})}{2} R_{C}^{*} + \frac{η (p_{s}) ς (p_{s})}{2 p_{s}} R_{A I}$ $a_{1} = \frac{η (p_{s})}{2} R_{C}^{*} + \frac{η (p_{s}) ς (p_{s})}{2 p_{s}} R_{A I}$

$a_{2} = \frac{η (p_{s})}{2} R_{C}^{*}, a_{3} = G_{d} w R_{C}^{*}, a_{4} = p_{s} G_{d} {(R_{C}^{*})}^{2}, g = p_{s} R_{C}^{*}$ $a_{2} = \frac{η (p_{s})}{2} R_{C}^{*}, a_{3} = G_{d} w R_{C}^{*}, a_{4} = p_{s} G_{d} {(R_{C}^{*})}^{2}, g = p_{s} R_{C}^{*}$

This system of equations can be shown to have the following characteristic function:

$1 + G (s) = 0$ $1 + G (s) = 0$ (23)

(23)

where

$G (s) = e^{- s τ} \frac{a_{3} (s + g) (s + γ)}{s (s^{2} + β s + α)}$ $G (s) = e^{- s τ} \frac{a_{3} (s + g) (s + γ)}{s (s^{2} + β s + α)}$ (24)

(24)

with

$γ = \frac{C p_{s}}{w}, β = g + a_{1}, α = g (a_{1} - a_{2}) .$ $γ = \frac{C p_{s}}{w}, β = g + a_{1}, α = g (a_{1} - a_{2}) .$

We now state the stability result for QCN. Let

$τ^{*} = \frac{1}{ω^{*}} (\tan^{- 1} \frac{ω^{*}}{g} - \tan^{- 1} \frac{ω^{*}}{β} + \tan^{- 1} \frac{ω^{*}}{γ})$ $τ^{*} = \frac{1}{ω^{*}} (\tan^{- 1} \frac{ω^{*}}{g} - \tan^{- 1} \frac{ω^{*}}{β} + \tan^{- 1} \frac{ω^{*}}{γ})$ (25)

(25)

where

$ω^{*} = \sqrt{\frac{a_{3}^{2}}{2} + \sqrt{\frac{a_{3}^{4}}{4} + γ^{2} a_{3}^{2}}}$ $ω^{*} = \sqrt{\frac{a_{3}^{2}}{2} + \sqrt{\frac{a_{3}^{4}}{4} + γ^{2} a_{3}^{2}}}$ (26)

(26)

Then $τ^{*} > 0$ $τ^{*} > 0$ , and the system in equations 21 and 22 is stable for all $τ \leq τ^{*}$ $τ \leq τ^{*}$ .

To prove $τ^{*} > 0$ $τ^{*} > 0$ , note that because $β > g$ $β > g$ , it follows that $\tan^{- 1} \frac{ω^{*}}{g} > \tan^{- 1} \frac{ω^{*}}{β}$ $\tan^{- 1} \frac{ω^{*}}{g} > \tan^{- 1} \frac{ω^{*}}{β}$ .

To apply the Nyquist criterion, we pass to the frequency domain and write equation 24 as

$G (j ω) = | G (j ω) | e^{- j arg (G (j ω))}$ $G (j ω) = | G (j ω) | e^{- j arg (G (j ω))}$

where

$\begin{array}{l} | G (j ω) |^{2} & = \frac{a_{3}^{2} (ω^{2} + g^{2}) (ω^{2} + γ^{2})}{ω^{2} ({(ω^{2} - α)}^{2} + β^{2} ω^{2})} \\ < \frac{a_{3}^{2} (ω^{2} + g^{2}) (ω^{2} + γ^{2})}{ω^{4} (ω^{2} + β^{2} - 2 α)} \\ < \frac{a_{3}^{2} (ω^{2} + γ^{2})}{ω^{4}} \end{array}$ $\begin{array}{l} | G (j ω) |^{2} & = \frac{a_{3}^{2} (ω^{2} + g^{2}) (ω^{2} + γ^{2})}{ω^{2} ({(ω^{2} - α)}^{2} + β^{2} ω^{2})} \\ < \frac{a_{3}^{2} (ω^{2} + g^{2}) (ω^{2} + γ^{2})}{ω^{4} (ω^{2} + β^{2} - 2 α)} \\ < \frac{a_{3}^{2} (ω^{2} + γ^{2})}{ω^{4}} \end{array}$

The last inequality follows from the fact that $β^{2} - 2 α > g^{2}$ $β^{2} - 2 α > g^{2}$ , which can verified by substituting $β = g + a_{1}, α = g (a_{1} - a_{2})$ $β = g + a_{1}, α = g (a_{1} - a_{2})$ . Note that setting $\frac{a_{3}^{2} (ω^{2} + γ^{2})}{ω^{4}} = 1$ $\frac{a_{3}^{2} (ω^{2} + γ^{2})}{ω^{4}} = 1$ , implies that $ω = ω^{*}$ $ω = ω^{*}$ . Hence, it follows that $| G (j ω^{*}) | < 1$ $| G (j ω^{*}) | < 1$ . Because $| G (j ω) |$ $| G (j ω) |$ is a monotonically decreasing function of $ω$ $ω$ , it follows that the critical frequency $ω_{c}$ $ω_{c}$ at which $| G (j ω_{c}) | = 1$ $| G (j ω_{c}) | = 1$ , is such that $ω_{c} < ω^{*}$ $ω_{c} < ω^{*}$ . By the Nyquist criterion, if we can show that $arg (G (j ω)) < π$ $arg (G (j ω)) < π$ for all $0 \leq ω < ω^{*}$ $0 \leq ω < ω^{*}$ , then the system is stable. This can be done as follows:

$\begin{array}{l} arg (G (j ω)) & = \frac{π}{2} + ω τ + \tan^{- 1} (\frac{β ω}{α - ω^{2}}) - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \\ = \frac{π}{2} + ω τ + \frac{π}{2} - \tan^{- 1} (\frac{α - ω^{2}}{β ω}) - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \\ = π + ω τ + \tan^{- 1} (\frac{ω^{2} - α}{β ω}) - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \end{array}$ $\begin{array}{l} arg (G (j ω)) & = \frac{π}{2} + ω τ + \tan^{- 1} (\frac{β ω}{α - ω^{2}}) - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \\ = \frac{π}{2} + ω τ + \frac{π}{2} - \tan^{- 1} (\frac{α - ω^{2}}{β ω}) - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \\ = π + ω τ + \tan^{- 1} (\frac{ω^{2} - α}{β ω}) - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \end{array}$

Because $α > 0$ $α > 0$ , it follows that

$\begin{array}{l} arg (G (j ω)) & < π + ω τ + \tan^{- 1} \frac{ω}{β} - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \\ = π + ω τ - \tan^{- 1} (\frac{(β - g) ω}{β g + ω^{2}}) - \tan^{- 1} \frac{ω}{γ} \\ \leq π + ω τ - \tan^{- 1} (\frac{(β - g) ω}{β g + {(ω^{*})}^{2}}) - \tan^{- 1} \frac{ω}{γ} \end{array}$ $\begin{array}{l} arg (G (j ω)) & < π + ω τ + \tan^{- 1} \frac{ω}{β} - \tan^{- 1} \frac{ω}{g} - \tan^{- 1} \frac{ω}{γ} \\ = π + ω τ - \tan^{- 1} (\frac{(β - g) ω}{β g + ω^{2}}) - \tan^{- 1} \frac{ω}{γ} \\ \leq π + ω τ - \tan^{- 1} (\frac{(β - g) ω}{β g + {(ω^{*})}^{2}}) - \tan^{- 1} \frac{ω}{γ} \end{array}$ (27)

(27)

The last inequality follows from the fact that $ω \leq ω^{*}$ $ω \leq ω^{*}$ . Defining

$Ψ (ω) = π + ω τ - \tan^{- 1} (\frac{(β - g) ω}{β g + {(ω^{*})}^{2}}) - \tan^{- 1} \frac{ω}{γ}$ $Ψ (ω) = π + ω τ - \tan^{- 1} (\frac{(β - g) ω}{β g + {(ω^{*})}^{2}}) - \tan^{- 1} \frac{ω}{γ}$

note that $Ψ (0) = 0$ $Ψ (0) = 0$ and for $τ \leq τ^{*}$ $τ \leq τ^{*}$ , $Ψ (ω^{*}) \leq π$ $Ψ (ω^{*}) \leq π$ . Moreover $Ψ (ω)$ $Ψ (ω)$ is convex for $0 \leq ω \leq ω^{*}$ $0 \leq ω \leq ω^{*}$ , which implies that $Ψ (ω) \leq π$ $Ψ (ω) \leq π$ for $ω \in [0, ω^{*}]$ $ω \in [0, ω^{*}]$ , and from equation 27, it follows that $arg (G (j ω)) < π$ $arg (G (j ω)) < π$ in this range, so that the Nyquist stability criterion is satisfied.

8.5.1 Discussion of the Stability Result

Writing out the formula for $| G (j ω) |$ $| G (j ω) |$ in detail, we get

${| G (j ω) |}^{2} = \frac{{(\frac{G_{d} w C}{N})}^{2} (ω^{2} + {(\frac{p_{s} C}{N})}^{2}) (ω^{2} + {(\frac{p_{s} C}{w})}^{2})}{ω^{2} ({(ω^{2} - \frac{g η (p_{s}) ζ (p_{s}) R_{A I}}{2 p_{s}})}^{2} + ω^{2} {(\frac{p_{s} C}{N} + \frac{η (p_{s}) C}{2 N} + \frac{η (p_{s}) ζ (p_{s}) R_{A I}}{2 p_{s}})}^{2})}$ ${| G (j ω) |}^{2} = \frac{{(\frac{G_{d} w C}{N})}^{2} (ω^{2} + {(\frac{p_{s} C}{N})}^{2}) (ω^{2} + {(\frac{p_{s} C}{w})}^{2})}{ω^{2} ({(ω^{2} - \frac{g η (p_{s}) ζ (p_{s}) R_{A I}}{2 p_{s}})}^{2} + ω^{2} {(\frac{p_{s} C}{N} + \frac{η (p_{s}) C}{2 N} + \frac{η (p_{s}) ζ (p_{s}) R_{A I}}{2 p_{s}})}^{2})}$ (28)

(28)

From equation 28, we can see that the loop gain K for the system is of the order given by

$K ~ O (\frac{C^{2}}{N})$ $K ~ O (\frac{C^{2}}{N})$ (29)

(29)

This is in contrast to TCP Reno (see Chapter 3, Section 3.4), whose loop gain (without RED) is of the order

$K_{R E N O} ~ O (\frac{C^{3} τ^{3}}{N^{2}})$ $K_{R E N O} ~ O (\frac{C^{3} τ^{3}}{N^{2}})$

The absence of the round trip latency from the loop gain in equation 29 is attributable to the fact that QCN is not a window-based congestion control algorithm. In both cases, an increase in link capacity C or a decrease in number of connections drives the system toward instability. Because $τ < < 1$ $τ < < 1$ , the window based feedback loop plays a role in stabilizing Reno compared with QCN by reducing the system loop gain.

Also note that the critical frequency can be written as

$\begin{array}{l} ω^{*} & = a_{3} \sqrt{\frac{1}{2} + \sqrt{\frac{1}{4} + {(\frac{γ}{a_{3}})}^{2}}} \\ = \frac{G_{d} w C}{N} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}} \end{array}$ $\begin{array}{l} ω^{*} & = a_{3} \sqrt{\frac{1}{2} + \sqrt{\frac{1}{4} + {(\frac{γ}{a_{3}})}^{2}}} \\ = \frac{G_{d} w C}{N} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}} \end{array}$ (30)

(30)

It follows that

$\frac{ω^{*}}{γ} = \frac{G_{d} w^{2}}{N p_{s}} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}}$ $\frac{ω^{*}}{γ} = \frac{G_{d} w^{2}}{N p_{s}} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}}$ (31)

(31)

$\frac{ω^{*}}{g} = \frac{G_{d} w}{p_{s}} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}} and$ $\frac{ω^{*}}{g} = \frac{G_{d} w}{p_{s}} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}} and$ (32)

(32)

$\frac{ω^{*}}{β} = \frac{G_{d} w}{[p_{s} + \frac{η (p_{s})}{2} + \frac{η (p_{s}) ς (p_{s}) N R_{A I}}{2 C p_{s}}]} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}}$ $\frac{ω^{*}}{β} = \frac{G_{d} w}{[p_{s} + \frac{η (p_{s})}{2} + \frac{η (p_{s}) ς (p_{s}) N R_{A I}}{2 C p_{s}}]} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}}$ (33)

(33)

The last two terms in the denominator of equation 33 are much smaller than the first term. As a result, it follows that $\frac{ω^{*}}{g} \approx \frac{ω^{*}}{β}$ $\frac{ω^{*}}{g} \approx \frac{ω^{*}}{β}$ , so that the stability threshold for latency is given by

$τ^{*} \approx \frac{\tan^{- 1} ⌊ \frac{G_{d} w^{2}}{N p_{s}} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}} ⌋}{\frac{G_{d} w C}{N} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}}}$ $τ^{*} \approx \frac{\tan^{- 1} ⌊ \frac{G_{d} w^{2}}{N p_{s}} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}} ⌋}{\frac{G_{d} w C}{N} \sqrt{0.5 + \sqrt{0.25 + {(\frac{N p_{s}}{G_{d} w^{2}})}^{2}}}}$ (34)

(34)

Substituting G_d=1/128, w=2 and p_s=0.01, we obtain

$ω^{*} = \frac{C}{64 N} \sqrt{0.5 + \sqrt{0.25 + 0.1024 N^{2}}}$ $ω^{*} = \frac{C}{64 N} \sqrt{0.5 + \sqrt{0.25 + 0.1024 N^{2}}}$ (35)

(35)

and

$τ^{*} \approx \frac{64 N}{C \sqrt{0.5 + \sqrt{0.25 + 0.1024 N^{2}}}} \tan^{- 1} ⌊ \frac{3.125}{N} \sqrt{0.5 + \sqrt{0.25 + 0.1024 N^{2}}} ⌋$ $τ^{*} \approx \frac{64 N}{C \sqrt{0.5 + \sqrt{0.25 + 0.1024 N^{2}}}} \tan^{- 1} ⌊ \frac{3.125}{N} \sqrt{0.5 + \sqrt{0.25 + 0.1024 N^{2}}} ⌋$ (36)

(36)

Hence, the stability threshold for the round trip latency is inversely proportional to C and directly proportional to $\sqrt{N}$ $\sqrt{N}$ . For example, substituting C=1 Gbps (=83,333 packets/s) and N=10 yields $τ^{*} \approx 2.158 ms$ $τ^{*} \approx 2.158 ms$ .

8.6 Further Reading

In addition to QCN, there were two other algorithms that were considered as candidates for the IEEE802.1Qau protocol. The algorithm by Jiang et al. [5] uses an AQM scheme with explicit rate calculation at the network nodes that is fed back to the source. This scheme has some similarities to the RCP algorithm from Chapter 5. The algorithm by Bergamasco and Pan [6] has some similarities to QCN because it is also based on an AQM scheme that provides PI feedback back to the source using a quantized congestion number F_b. The source nodes then use this number to adjust the parameters of their additive increase/multiplicative decrease (AIMD) scheme, such that the rate is additively increased if F_b>0 and multiplicatively decreased if F_b<0.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 8. Congestion Control in Ethernet Networks

Create new playlist

Sign In

Sign Up

Congestion Control in Ethernet Networks

Keywords

8.1 Introduction

8.2 Differences between Switched Ethernet and IP Networks

8.3 Objectives of the Quantum Congestion Notification Algorithm

8.4 Quantum Congestion Notification Algorithm Description

8.4.1 The Control Point or CP Algorithm

8.4.2 The Reaction Point or RP Algorithm

8.4.2.1 Rate Decreases

8.4.2.2 Rate Increases

8.5 Quantum Congestion Notification Stability Analysis

8.5.1 Discussion of the Stability Result

8.6 Further Reading

Table of Contents for
Chapter 8. Congestion Control in Ethernet Networks