Chapter 3. Measuring Availability

Measuring availability is important to keeping your system highly available. Only by measuring availability can you understand how your application is performing now and examine how your application’s availability changes over time.

The most widely held mechanism for measuring the availability of a web application is calculating the percent of time it’s accessible for use by customers. We can describe this by using the following formula for a given period:

Site availability percentage equals StartFraction t o t a l normal bar s e c o n d s normal bar i n normal bar p e r i o d minus s e c o n d s normal bar s y s t e m normal bar i s normal bar d o w n Over t o t a l normal bar s e c o n d s normal bar i n normal bar p e r i o d EndFraction

Let’s consider an example. Suppose that over the month of April, your website was down twice; the first time it was down for 37 minutes, and the second time it was down for 15 minutes. What is the availability of your website?

Example 3-1. Availability percentage
Total Number of Seconds Down equals left-parenthesis 37 plus 15 right-parenthesis asterisk 60 equals 3 comma 120 s e c o n d s
Total Number of Seconds in Month equals 30 d a y s asterisk 86 comma 400 s e c o n d s slash d a y equals 2 comma 592 comma 000 s e c o n d s
Site availability percentage equals StartFraction t o t a l normal bar s e c o n d s normal bar i n normal bar p e r i o d minus s e c o n d s normal bar s y s t e m normal bar i s normal bar d o w n Over t o t a l normal bar s e c o n d s normal bar i n normal bar p e r i o d EndFraction
Site availability percentage equals StartFraction 2 comma 592 comma 000 s e c o n d s minus 3 comma 120 s e c o n d s Over 2 comma 592 comma 000 s e c o n d s EndFraction
Site availability percentage equals 99.8795

Your site availability is 99.8795%.

You can see from this example that it only takes a small amount of outage to have an impact on your availability percentage.

The Nines

Often you will hear availability described as “the nines.” This is a shorthand way of indicating high availability percentages. Table 3-1 illustrates what it means.

Table 3-1. The Nines
Nines Percentage Monthly outage a

2 Nines

99%

432 minutes

3 Nines

99.9%

43 minutes

4 Nines

99.99%

4 minutes

5 Nines

99.999%

26 seconds

6 Nines

99.9999%

2.6 seconds

a This assumes a 30-day month with 43,200 minutes in the month.

In Example 3-1, we see that the website has fallen just short of the 3 nines metric (99.8795% compared to 99.9%). For a website that maintains 5 nines of availability, there can be only 26 seconds of downtime every month.

What’s Reasonable?

What’s a reasonable availability number in order to consider your system as high availability?

It is impossible to give a single answer to this question because it depends dramatically on your website, your customer expectations, your business needs, and your business expectations. You need to determine for yourself what number is required for your business.

Often, for basic web applications, 3 nines is considered acceptable availability. Using Table 3-1, this amounts to 43 minutes of downtime every month. For a web application to be considered highly available, often an indication of 5 nines is used. This amounts to only 26 seconds of downtime every month.

Don’t Be Fooled

Don’t be fooled into thinking your site is highly available when it isn’t. Planned and regular maintenance that involves your application being unavailable still count against availability.

Here’s a comment that I often overhear: “Our application never fails. That’s because we regularly perform system maintenance. By scheduling weekly two-hour maintenance windows, and performing maintenace during these windows, we keep our availability high.”

Does this group keep its application’s availability high?

Let’s find out.

Example 3-2. Maintenance Window Example Availability
Site availability percentage equals StartFraction t o t a l normal bar h o u r s normal bar i n normal bar p e r i o d minus h o u r s normal bar s y s t e m normal bar i s normal bar d o w n Over t o t a l normal bar h o u r s normal bar i n normal bar p e r i o d EndFraction
h o u r s normal bar i n normal bar w e e k equals 7 d a y s asterisk 24 h o u r s equals 168 h o u r s
h o u r s normal bar u n a v a i l a b l e normal bar e a c h normal bar w e e k equals 2 h o u r s
Site availability left-parenthesis no failures right-parenthesis equals StartFraction 168 h o u r s minus 2 h o u r s Over 168 h o u r s EndFraction equals 98.8 percent-sign
Site availability left-parenthesis no failures right-parenthesis equals 98.8 percent-sign

Without having a single failure of its application, the best this organization can achieve is 98.8% availability. This falls short of even 2 nines of availability (98.8% versus 99%).

Planned maintenance hurts nearly as much as unplanned outages. If your customer expects your application to be available and it isn’t, your customer has a negative experience. It doesn’t matter if you planned for the outage or not.

Availability by the Numbers

Measuring availability is important to keeping your system highly available, now and in the future. This chapter discussed a common mechanism for measuring availability and provided some guidelines on what is considered reasonable availability.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.116.20.52