11.3 Disk Scheduling

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

11.3 Disk Scheduling

The most important hardware device used as secondary memory is the magnetic disk drive. File systems stored on these drives must be accessed in an efficient manner. It turns out that transferring data to and from secondary memory is the worst bottleneck in a general computer system.

Recall from Chapter 10 that the speed of the CPU and the speed of main memory are much faster than the speed of data transfer to and from secondary memory such as a magnetic disk. That’s why a process that must perform I/O to disk is made to wait while that data is transferred, to give another process a chance to use the CPU.

Because secondary I/O is the slowest aspect of a general computer system, the techniques for accessing data on a disk drive are of crucial importance to file systems. As a computer deals with multiple processes over a period of time, requests to access the disk accumulate. The technique that the operating system uses to determine which requests to satisfy first is called disk scheduling. We examine several specific disk-scheduling algorithms in this section.

Recall from Chapter 5 that a magnetic disk drive is organized as a stack of platters, where each platter is divided into tracks, and each track is divided into sectors. The set of corresponding tracks on all platters is called a cylinder. FIGURE 11.6 revisits the disk drive depicted in Chapter 5 to remind you of this organization.

A figure represents the layout of a magnetic disk drive. — FIGURE 11.6 A magnetic disk drive

Of primary importance to us in this discussion is the fact that the set of read/write heads hovers over a particular cylinder along all platters at any given point in time. The seek time is the amount of time it takes for the heads to reach the appropriate cylinder. The latency is the additional time it takes the platter to rotate into the proper position so that the data can be read or written. Seek time is the more restrictive of these two parameters and, therefore, is the primary issue dealt with by the disk-scheduling algorithms.

At any point in time, a disk drive may have a set of outstanding requests that must be satisfied. For now, we consider only the cylinder (the parallel concentric circles) to which the requests refer. A disk may have thousands of cylinders. To keep things simple, let’s also assume a range of 110 cylinders. Suppose at a particular time the following cylinder requests have been made, in this order:

49, 91, 22, 61, 7, 62, 33, 35

Suppose also that the read/write heads are currently at cylinder 26. Now a question arises: To which cylinder should the disk heads move next? Different algorithms produce different answers to this question.

First-Come, First-Served Disk Scheduling

In Chapter 10 we examined a CPU-scheduling algorithm called first come, first served (FCFS). An analogous algorithm can be used for disk scheduling. It is one of the easiest to implement, though not usually the most efficient.

In FCFS, we process the requests in the order they arrive, without regard to the current position of the heads. Therefore, under a FCFS algorithm, the heads move from cylinder 26 (the current position) to cylinder 49. After the request for cylinder 49 is satisfied (that is, the data is read or written), the heads move from 49 to 91. After processing the request at 91, the heads move to cylinder 22. Processing continues in this manner, satisfying requests in the order that they were received.

Note that at one point the heads move from cylinder 91 all the way back to cylinder 22, during which they pass over several cylinders whose requests are currently pending.

Shortest-Seek-Time-First Disk Scheduling

The shortest-seek-time-first (SSTF) disk-scheduling algorithm moves the heads by the minimum amount necessary to satisfy any pending request. This approach could potentially result in the heads changing directions after each request is satisfied.

Let’s process our hypothetical situation using this algorithm. From our starting point at cylinder 26, the closest cylinder among all pending requests is 22. So, ignoring the order in which the requests came, the heads move to cylinder 22 to satisfy that request. From 22, the closest request is for cylinder 33, so the heads move there. The closest unsatisfied request to 33 is at cylinder 35. The distance to cylinder 49 is now the smallest, so the heads move there next. Continuing that approach, the rest of the cylinders are visited in the following order: 49, 61, 62, 91, 7.

This approach does not guarantee the smallest overall head movement, but it generally offers an improvement in performance over the FCFS algorithm. However, a major problem can arise with this approach. Suppose requests for cylinders continue to build up while existing ones are being satisfied. And suppose those new requests are always closer to the current position than an earlier request. It is theoretically possible that the early request never gets processed because requests keep arriving that take priority. This situation is called starvation. By contrast, FCFS disk scheduling cannot suffer from starvation.

SCAN Disk Scheduling

A classic example of algorithm analysis in computing comes from the way an elevator is designed to visit floors that have people waiting. In general, an elevator moves from one extreme to the other (say, the top of the building to the bottom), servicing requests as appropriate. It then travels from the bottom to the top, servicing those requests.

The SCAN disk-scheduling algorithm works in a similar way, except that instead of moving up and down, the read/write heads move in toward the spindle, then out toward the platter edge, then back toward the spindle, and so forth.

Let’s use this algorithm to satisfy our set of requests. Unlike in the other approaches, though, we need to decide which way the heads are moving initially. Let’s assume they are moving toward the lower cylinder values (and are currently at cylinder 26).

As the read/write heads move from cylinder 26 toward cylinder 1, they satisfy the requests at cylinders 22 and 7 (in that order). After reaching cylinder 1, the heads reverse direction and move all the way out to the other extreme. Along the way, they satisfy the following requests, in order: 33, 35, 49, 61, 62, 91.

New requests are not given any special treatment under this scheme. They may or may not be serviced before earlier requests—it depends on the current location of the heads and direction in which they are moving. If a new request arrives just before the heads reach that cylinder, it is processed right away. If it arrives just after the heads move past that cylinder, it must wait for the heads to return. There is no chance for starvation because each cylinder is processed in turn.

Some variations on this algorithm can improve its performance. For example, a request at the edge of the platter may have to wait for the heads to move almost all the way to the spindle and all the way back. To improve the average wait time, the Circular SCAN algorithm treats the disk as if it were a ring and not a disk. That is, when it reaches one extreme, the heads return all the way to the other extreme without processing requests.

Tony Hoare

Tony Hoare’s interest in computing was awakened in the early 1950s, when he studied philosophy (together with Latin and Greek) at Oxford University under the tutelage of John Lucas. He was fascinated by the power of mathematical logic as an explanation of the apparent certainty of mathematical truth. During his National Service (1956–1958), he studied Russian in the Royal Navy. Then he took a qualification in statistics, and incidentally a course in programming given by Leslie Fox. In 1959, as a graduate student at Moscow State University, he studied the machine translation of languages (together with probability theory) in the school of Kolmogorov. To assist in efficient lookup of words in a dictionary, he discovered the well-known sorting algorithm Quicksort.

On return to England in 1960, he worked as a programmer for Elliott Brothers, a small scientific computer manufacturer. He led a team (including his later wife, Jill) in the design and delivery of the first commercial compiler for the programming language ALGOL 60. He attributes the success of the project to the use of ALGOL itself as the design language for the compiler, although the implementation used decimal machine code. Promoted to the rank of chief engineer, he then led a larger team on a disastrous project to implement an operating system. After managing a recovery from the failure, he moved as chief scientist to the computing research division, where he worked on the hardware and software architecture for future machines.

Courtesy of Inamori Foundation

These machines were cancelled when the company merged with its rivals, and in 1968 Tony took a chance to apply for the Professorship of Computing Science at the Queen’s University, Belfast. His research goal was to understand why operating systems were so much more difficult than compilers, and to see if advances in programming theory and languages could help with the problems of concurrency. In spite of civil disturbances, he built up a strong teaching and research department and published a series of papers on the use of assertions to prove correctness of computer programs. He knew that this was long-term research, unlikely to achieve industrial application within the span of his academic career.

In 1977 he moved to Oxford University and undertook to build up the Programming Research Group, founded by Christopher Strachey. With the aid of external funding from government initiatives, industrial collaborations, and charitable donations, Oxford now teaches a range of degree courses in computer science, including an external master’s degree for software engineers from industry. The research of his teams at Oxford pursued an ideal that takes provable correctness as the driving force for the accurate specification, design, and development of computing systems, both critical and noncritical. Well-known results of the research include the Z specification language and the CSP concurrent programming model. A recent personal research goal has been the unification of a diverse range of theories applying to different programming languages, paradigms, and implementation technologies.

Throughout more than 30 years as an academic, Tony has maintained strong contacts with industry through consultation, teaching, and collaborative research projects. He took a particular interest in the sustenance of legacy code, where assertions are now playing a vital role, not for his original purpose of program proof, but rather in instrumentation of code for testing purposes. On reaching retirement age at Oxford, he welcomed an opportunity to return to industry as a senior researcher with Microsoft Research in Cambridge. He hopes to expand the opportunities for industrial application of good academic research and to encourage academic researchers to continue the pursuit of deep and interesting questions in areas of long-term interest to the software industry and its customers.

Note: This biographical sketch was written by Sir Tony Hoare himself and reprinted with his permission. What he does not say is that he received the Turing Award in 1980 for his fundamental contributions to the definition and design of programming languages, and he was awarded a Knighthood in 1999 for his services to education and computer science.

Another variation is to minimize the extreme movements at the spindle and at the edge of the platter. Instead of going to the edge, the heads move only as far out (or in) as the outermost (or innermost) request. Before moving on to tackle the next request, the list of pending requests is examined to see whether movement in the current direction is warranted. This variation is referred to as the LOOK disk-scheduling algorithm, because it looks ahead to see whether the heads should continue in the current direction.

SUMMARY

A file system defines the way our secondary memory is organized. A file is a named collection of data with a particular internal structure. Text files are organized as a stream of characters; binary files have a particular format that is meaningful only to applications set up to handle that format.

File types are often indicated by the file extension of the file name. The operating system maintains a list of recognized file types so that it may open them in the correct kind of application and display the appropriate icons in the graphical user interface. The file extension can be associated with any particular kind of application that the user chooses.

The operations performed on files include creating, deleting, opening, and closing files. Of course, files must be able to be read from and written to as well. The operating system provides mechanisms to accomplish the various file operations. In a multiuser system, the operating system must also provide file protection to ensure that only authorized users have access to files.

Directories are used to organize files on disk. They can be nested to form hierarchical tree structures. Path names that specify the location of a particular file or directory can be absolute, originating at the root of the directory tree, or relative, originating at the current working directory.

Disk-scheduling algorithms determine the order in which pending disk requests are processed. First-come, first-served disk scheduling takes all requests in order but is not very efficient. Shortest-seek-time-first disk scheduling is more efficient but could suffer from starvation. SCAN disk scheduling employs the same strategy as an elevator, sweeping from one end of the disk to the other.

ETHICAL ISSUES

Privacy: Opt-In or Opt-Out?^1,2

These terms—opt-in and opt-out—refer to privacy policies. When you sign up at a banking institution, do you want the bank to share your information with other financial institutions? If you buy an item on the Internet from Company A, do you want to receive email from a similar Company B? When you apply for a credit card, do you want to get offers from other credit card companies?

Opt-in says that you must explicitly say you want to share your information. Optout says that the information will be shared unless you explicitly say you do not want to share your information. That is, a website’s default is either opt-in or opt-out.

In the United States, the CAN-SPAM Act covers commercial email messages; the E.U. directive covers all email marketing messages in the European Union. The CANSPAM Act allows direct marketing to anyone until the recipient requests the email to stop (the default is opt-in; you must check a box to opt-out). The E.U. directive says that email can only be sent to subscribers who have given their prior consent (the default is opt-out; you must check a box to opt-in). Companies using opt-out must give the recipients a way to cancel the emails.

The different approaches to privacy, evidenced by how the United States and E.U. handle email, has surfaced in other areas. One of the authors went online, Googled their own name, and received over 7 million hits in 0.23 seconds. The Spanish government has ordered Google to stop indexing information about 90 citizens who filed a formal complaint. In 2012, the E.U. adopted a “right to be forgotten” regulation. In contrast, U.S. courts have consistently found that the right to publish the truth supersedes any right to privacy.

Parallel questions are: Should the people in pictures posted on the Internet be identified without the person’s consent? Should cyber maps give pictures of a residence without the owner’s consent? In the United States, Facebook announced that it was changing its policy of posting names without consent. In Germany, Google allowed individuals and businesses to opt out of Street View. Approximately 250,000 people have done so. These issues have not been, nor will they be, decided easily or quickly.

KEY TERMS

EXERCISES

For Exercises 1–15, mark the answers true or false as follows:

True
False

1. A text file stores binary data that is organized into groups of 8 or 16 bits that are interpreted as characters.
2. A program written in a high-level language is stored in a text file that is also called a source file.
3. The type of a file determines which kinds of operations can be performed on it.
4. The current file pointer indicates the end of a file.
5. Sequential access and direct access take about the same amount of time to retrieve data.
6. Some operating systems maintain a separate read pointer and write pointer for a file.
7. UNIX file permissions allow a group of users to access a file in various ways.
8. In most operating systems, a directory is represented as a file.
9. Two files in a directory system can have the same name if they are in different directories.
10. A relative path is relative to the root of the directory hierarchy.
11. An absolute path and a relative path will always be the same length.
12. An operating system is responsible for managing the access to a disk drive.
13. The seek time is the amount of time it takes for the heads of a disk to reach a particular cylinder.
14. The shortest-seek-time-first disk-scheduling algorithm moves the heads the minimum amount it can to satisfy a pending request.
15. The first-come, first-served disk-scheduling algorithm moves the heads the minimum amount it can to satisfy a pending request.

For Exercises 16–20, match the file extensions with the appropriate file.

txt
mp3, au, and wav
gif, tiff, and jpg
doc and wp3
java, c, and cpp

16. Audio file
17. Image file
18. Text data file
19. Program source file
20. Word processing file

For Exercises 21–23, match the symbol with its use.

21. Symbol used to separate the names in a path in a Windows environment
22. Symbol used to separate the names in a path in a UNIX environment
23. Symbol used to represent the parent directory in a relative path name

Exercises 24–57 are problems or short-answer questions.

24. What is a file?
25. Distinguish between a file and a directory.
26. Distinguish between a file and a file system.
27. Why is a file a generic concept and not a technical one?
28. Name and describe the two basic classifications of files.
29. Why is the term binary file a misnomer?
30. Distinguish between a file type and a file extension.
31. What would happen if you gave the name myFile.jpg to a text file?
32. How can an operating system make use of the file types that it recognizes?
33. How does an operating system keep track of secondary memory?
34. What does it mean to open and close a file?
35. What does it mean to truncate a file?
36. Compare and contrast sequential and direct file access.
37. File access is independent of any physical medium.
1. How could you implement sequential access on a disk?
2. How could you implement direct access on a magnetic tape?
38. What is a file protection mechanism?
39. How does UNIX implement file protection?
40. Given the following file permission, answer these questions.

Read Write/Delete Execute

Owner Yes Yes Yes

Group Yes Yes No

World Yes No No
1. Who can read the file?
2. Who can write or delete the file?
3. Who can execute the file?
4. What do you know about the content of the file?
41. What is the minimum amount of information a directory must contain about each file?
42. How do most operating systems represent a directory?
43. Answer the following questions about directories.
1. A directory that contains another directory is called what?
2. A directory contained within another directory is called what?
3. c. A directory that is not contained in any other directory is called what?
4. The structure showing the nested directory organization is called what?
5. Relate the structure in (d) to the binary tree data structure examined in Chapter 8.
44. What is the directory called in which you are working at any one moment?
45. What is a path?
46. Distinguish between an absolute path and a relative path.
47. Show the absolute path to each of the following files or directories using the directory tree shown in Figure 11.4:
1. QTEffects.qtx
2. brooks.mp3
3. Program Files
4. 3dMaze.scr
5. Powerpnt.exe
48. Show the absolute path to each of the following files or directories using the directory tree shown in Figure 11.5:
1. tar
2. access.old
3. named.conf
4. smith
5. week3.txt
6. printall
49. Assuming the current working directory is C:WINDOWSSystem, give the relative path name to the following files or directories using the directory tree shown in Figure 11.4:
1. QTImage.qtx
2. calc.exe
3. letters
4. proj3.java
5. adobep4.hlp
6. WinWord.exe
50. Show the relative path to each of the following files or directories using the directory tree shown in Figure 11.5:
1. localtime when working directory is the root directory
2. localtime when the working directory is etc
3. printall when the working directory is utilities
4. week1.txt when the working directory is man2
51. What is the worst bottleneck in a computer system?
52. Why is disk scheduling concerned more with cylinders than with tracks and sectors?
53. Name and describe three disk scheduling algorithms.

Use the following list of cylinder requests in Exercises 54–56. They are listed in the order in which they were received.

40, 12, 22, 66, 67, 33, 80

54. List the order in which these requests are handled if the FCFS algorithm is used. Assume that the disk is positioned at cylinder 50.
55. List the order in which these requests are handled if the SSTF algorithm is used. Assume that the disk is positioned at cylinder 50.
56. List the order in which these requests are handled if the SCAN algorithm is used. Assume that the disk is positioned at cylinder 50 and the read/write heads are moving toward the higher cylinder numbers.
57. Explain the concept of starvation.

THOUGHT QUESTIONS

1. The concept of a file permeates computing. Would the computer be useful if there were no secondary memory on which to store files?
2. The disk scheduling algorithms examined in this chapter sound familiar. In what other context have we discussed similar algorithms? How are these similar and how are they different?
3. Are there any analogies between files and directories and file folders and filing cabinets? Clearly, the term “file” came from this concept. Where does this analogy hold true and where does it not?
4. Spamming is the Internet equivalent of unsolicited telephone sales pitches. There are laws now that allow a telephone user to request that his or her name be removed from the solicitor’s calling list. Should there be similar laws relating to spamming?
5. In your opinion, is spamming a reasonable business strategy, like regular direct or “junk” mail, or is it a form of electronic harassment? Why or why not?
6. Which approach is better, opt-in or opt-out?

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

	Read	Write/Delete	Execute
Owner	Yes	Yes	Yes
Group	Yes	Yes	No
World	Yes	No	No

Table of Contents for 11.3 Disk Scheduling

Create new playlist

Sign In

Sign Up

Table of Contents for
11.3 Disk Scheduling