Chapter 4: Measuring Availability

Chapter 4.

Measuring availability is important to keeping your system highly available. Only by measuring availability can you understand how your application is performing now and examine how your application’s availability changes over time.

The most widely held mechanism for measuring the availability of a web application is calculating the percent of time it’s accessible for use by customers. We can describe this by using the following formula for a given period:

Site availability percentage = total_seconds_in_period − seconds_system_is_down / total_seconds_in_period

Figure 4 -1. Availability formula

Let’s consider an example. Suppose that over the month of April, your website was down twice; the first time it was down for 37 minutes, and the second time it was down for 15 minutes. What is the availability of your website?

Example 4-1. Availability percentage

Total Number of Seconds Down = (37 + 15)*60 = 3,120 seconds

Total Number of Seconds in Month = 30 days*86,400 seconds/day = 2,592,000 seconds

Site availability percentage = total_seconds_in_period − seconds_system_is_down / total_seconds_in_period

Site availability percentage = 2,592,000 seconds − 3,120 seconds / 2,592,000 seconds

Site availability percentage = 99.8795

Your site availability is 99.8795%.

You can see from Example 4-1 that it only takes a small amount of outage to have an impact on your availability percentage.

The Nines

Often you will hear availability described as “the nines.” This is a shorthand way of indicating high availability percentages. Table 4-1 illustrates what it means.

Nines Percentage Monthly outage
2 Nines 99% 432 minutes
3 Nines 99.9% 43 minutes
4 Nines 99.99% 4 minutes
5 Nines 99.999% 26 seconds
6 Nines 99.9999% 2.6 seconds

Table 4-1. The Nines

In Example , we see that the website has fallen just short of the 3 nines metric (99.8795% compared to 99.9%). For a website that maintains 5 nines of availability, there can be only 26 seconds of downtime every month.

What’s Reasonable?

What’s a reasonable availability number in order to consider your system as high availability?

It is impossible to give a single answer to this question because it depends dramatically on your website, your customer expectations, your business needs, and your business expectations. You need to determine for yourself what number is required for your business.

Often, for basic web applications, 3 nines is considered acceptable availability. Using Table #table(thenines), this amounts to 43 minutes of downtime every month. For a web application to be considered highly available, often an indication of 5 nines is used. This amounts to only 26 seconds of downtime every month.

Don’t Be Fooled

Don’t be fooled into thinking your site is highly available when it isn’t. Planned and regular maintenance that involves your application being unavailable still count against availability.

Here’s a comment that I often overhear: “Our application never fails. That’s because we regularly perform system maintenance. By scheduling weekly two-hour maintenance windows, and performing maintenance during these windows, we keep our availability high.”

Does this group keep its application’s availability high?

Let’s find out.

Example 4- 2 . Maintenance Window Example Availability

Site availability percentage = total_hours_in_period − hours_system_is_down / total_hours_in_period

hours_in_week = 7 days*24 hours = 168 hours

hours_unavailable_each_week = 2 hours

Site availability (no failures) = 168 hours − 2 hours / 168 hours = 98.8%

Site availability (no failures) = 98.8%

Without having a single failure of its application, the best this organization can achieve is 98.8% availability. This falls short of even 2 nines of availability (98.8% versus 99%).

Planned maintenance hurts nearly as much as unplanned outages. If your customer expects your application to be available and it isn’t, your customer has a negative experience. It doesn’t matter if you planned for the outage or not.

Availability by the Numbers

Measuring availability is important to keeping your system highly available, now and in the future. This chapter discussed a common mechanism for measuring availability and provided some guidelines on what is considered reasonable availability.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.191.202.72