Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 10. Backing Out of a Blind Alley

	Its not as if there is some killer technology at the protocol or network level that we somehow failed to include. We need to take all the technologies we already know and fit them together differently. This is not about building a technology innovation that changes the world but about architecture....
	--Dave Clark, MIT, 2005

Introduction

The Preface noted seven fundamental questions uncovered in the early forays into networking. Interestingly, these questions were lost in the ensuing conflict over protocols and remain unanswered 30 years later. What happened? As the 20^th century drew to a close, the excitement over the Internet had begun to wane, with prominent Internet researchers finally beginning to realize that research was stagnating. In 2001, a study into the state of network research by the National Research Council (2001) made a jarring and embarrassing observation:

A reviewer of an early draft of this report observed that this proposed framework—measure, develop theory, prototype new ideas—looks a lot like Research 101.... From the perspective of the outsiders, the insiders had not shown that they had managed to exercise the usual elements of a successful research program, so a back-to-basics message was fitting.

An entire field’s research program being so strongly criticized for losing sight of the fundamentals? Outrageous! But worse, it was becoming more and more apparent that the current Internet architecture was running out of steam. There were severe problems with scaling, security, addressing, routing, and so on; and although there were endless proposals for fixes to specific problems, the fixes were just that: more band-aids, not fundamental new insights. They tended to further complicate the structure. There was a crisis in fundamentals. DARPA funded MIT and ISI to convene the NEWARCH project, consisting of prominent researchers from those two centers and elsewhere. After two years, their final report (Clark et al., 2003) essentially came up dry.^[1] Sessions on new architecture have been held at various conferences, but the results were always the same. It became so bad that proposing new architectures became the subject of satire. Then with great fanfare, NSF discovered the issue. A grand effort to build an infrastructure (how does this differ from the rationale for Internet2?) to investigate the major questions plaguing the Internet today (a variation on trying all kinds of cannons?) was initiated, along with a new initiative to find a new Internet design. “Clean-slate” has become the new hot topic. But if their announcements are any indication, the slates haven’t been erased well, let alone washed. Few of the problems it sees as important seem to be any of our seven fundamental questions. In the one or two cases, where they have come close to a key issue, they have failed to recognize it as having fundamental implications. There seems to be an inherent assumption that the foundations are firm (leaving the slate less than clean). It is not clear whether this is a blind spot or the old guard protecting its legacy. But so far, the grand effort seems to be recycling old wine in new bottles. Perhaps, it has become just another source of funding. A new take on Alyosha’s dream: We can’t actually solve the problem or the funding will dry up. But there is more going on here. We must look at the socioeconomic forces that got us here. So, where do we stand with our fundamental questions?

Consolidation and the Next Generation

Going back to the chronology in the Preface, the first thing that hits one is that none of the outstanding issues have been resolved, with the exception of replacing NCP with TCP, and that is now clearly showing its age. We can see now that TCP was designed to address the immediate problems of the day and had little ability to adapt to the future,^[2] as indicated by the proliferation of transport protocols within the IETF. The Internet is at its core an unfinished demo.

Then why is the Internet seen as such a success? Basically five reasons: abstraction, insensitivity, Moore’s law, military spending, and hard work.

The layer abstraction hides most of the problems from those who use the Net. This was true of the ARPANET, too, and contributed strongly to its success. Only a relatively few people have to deal with the underlying issues; the vast majority are given the illusion that it simply works.
For better or worse, one property of computer systems is that they are relatively insensitive. As some software manufacturers have demonstrated very successfully, you can make almost any piece of sh*@ er, sorry, software work, no matter how bad. And a prime contributor to this insensitivity is....
Moore’s law. Moore’s law has provided the cheap computing power to keep the problems at bay. This is especially true for the routing and addressing problems we faced.
ARPA willingly spent money overprovisioning the early Net. In the early 1970s, 56K lines were enormously fast and expensive. No other experimental network built at the time used such high-speed, long-distance lines (most were 9.6K). Although tests could stress the network, normal usage never came close to a significant load. One cannot overstress the role that this overprovisioning played in the perception of that early success and in masking unsolved problems.
And, the intense nonstop efforts of thousands of engineers to keep it going.

Many people point to this success as proof that the Internet suite of protocols is a paragon of technical achievement and that there is little fundamentally wrong with the Internet. This is the same argument that apologists for DOS used to justify its technical excellence. To quote Stan Lee, “‘nuff said.”

If one looks carefully over this chronology, one is inexorably drawn to the conclusion that the Internet did not begin to stagnate at the turn of the century, but in the late 1970s. It is here that conservative behavior begins to take hold. It is here that each juncture is not seen as an opportunity to fulfill the earlier vision or to synthesize new insights with previous achievements to create a new innovative direction, but more the minimal change is made to solve the immediate problem and go no further. It begins to seem that people are the keepers of some flame and dare not tamper with what has been entrusted to them, the classic behavior of a “second generation” i.e., not the founding generation. Some will argue that this was the policy of small incremental change that has been a stalwart of Internet development. And they are right; caution in maintaining a production system is a necessity but a slow death for research and the research was far from done. But isn’t that just what one would expect from the inheritors of the flame? Is that just a rationalization of a much deeper malaise?

We can now see that after 1975 we were entering a period of consolidation. We were implementing TCP,^[3] the network was expanding, but no efforts were creating new requirements on the Net, efforts that would have forced us to look deeper at some of the things we hadn’t done in that first push.^[4] All the while, Moore’s law was at work pushing the edges further and further out, making problems less and less pressing. In addition, there was never a clean split between operations and research. In some sense, the researchers never let go of their network, but at the same time it was operational supporting more and more people who were not network researchers who could live with the Net being a little flaky on a Tuesday morning. So researchers had to make incremental changes. There was no place to try out major new ideas. Although a period of consolidation was probably necessary and unavoidable, without new developments in other areas the Internet was essentially marking time. Each change becomes permanent. Any attempt to addresses shortcomings of a previous change must be built on top of it. There is no replacement of modules by new solutions as with most products. This has more in common with biological evolution than incremental engineering. We know that with biological evolution, the vast majority of trials (species) reach a dead-end and become extinct. For nature, this is not a problem; it does millions if not billions of trials. We don’t have that luxury.

This mode of operation was reinforced by newcomers to the Net who were mostly nonresearchers and lacked the broad background of the initial developers and who finding a working system (and few alternatives to look at) saw the existing structure as the only way to build a network (early imprinting works again). Myths grew up as to how various stopgaps, kludges, and shortcuts were wonderful pieces of forethought and design.^[5] These myths became “design principles,” making them virtually impossible to overcome and return to the original vision. More than once, the decisions of the early Net have been described as if “handed down on stone tablets.” This was further reinforced by textbooks, which described what existed as the best or nearly best solution instead of doing the extra work of describing the principles along with particular implementations.

Where Are the Ivory Tower Academics?

I am afraid that much of the fault here lies with academia for not distilling principles and insisting on intellectual discipline. Engineers building product and keeping a network running can’t be expected to be working out fundamental principles. But those teaching should be. They are really the only ones in a position to do it. They should ensure that students have a deeper understanding of what is going on, of the principles that can be derived from our successes and our failures! (The failures are more important than the successes.) It wasn’t something that everyone had to do, but it also couldn’t be no one. For example, the fact that not a single networking textbook discusses what must be named, only the format of names, is unconscionable and is a major contributor to our current crisis. I have met many professors, “experts” in networking, who were unaware that an IP address named the interface, not the node.

Reflecting on the events of the past 35 years, I have come to suspect that not pursuing the upper layers, the squelching of USING (the Users Interest Group, see Chapter 4, “Stalking the Upper-Layer Architecture”) may have been a crucial juncture. I had always thought the upper layers should have been explored in the mid-1970s, mostly because there were all of these “neat” things we could use the network for! (The engineer in me.) It was the next obvious thing to look at. But more recently, I have begun to suspect that it was more than just doing the next neat thing. That not pursuing the upper layers removed the driver that would have forced us to complete the architecture; forced a complete addressing architecture; forced greater attention to security, more requirements on the lower layers, and probably many other things. Growth alone was not enough with the effects of Moore’s Law. With no impetus, there was no reason to change, especially given the “second-generation” effects.

In addition to the internal social forces, there were external forces the intensity of which no other effort in computer science (or perhaps any science) had ever been subjected to. Starting in 1974 and 1975, we became aware of the PTTs’ effort to standardize packet-switched networking with X.25. This was our first foray into the world of electro-political engineering and power politics. We were rank novices compared with European researchers, but we learned quickly. Unlike today, CCITT meetings were open only to telephone companies and countries.^[6] For any U.S. manufacturer (other than Western Electric) to attend, they had to be accredited by the Department of State. This began the ongoing war between the “beads-on-a-string” and layered approaches to networking and the incredibly intense connection/connectionless debate.^[7] There was much more at stake for the Europeans. With no Carterphone decision in Europe, the European PTTs could claim that the only equipment that could be attached to their data networks would be theirs. In other words, you had to lease their computers, just as you did their telephones. Research was being pushed into a world of power politics at a very early stage. And not just the power politics of standards; there was also immense pressure to commercialize packet switching and, very soon after, LANs.

Packet-switching networks had always been the underdog, both scorned and feared by the phone companies, beginning with the ARPANET demo at the ICCC conference in 1972. Scorned because the bellheads had been building networks for a 100 years, and the Net flew in the face of everything they knew. Feared by the engineers because computers were at the heart of it, and they didn’t really understand this new world. Feared by management because the layered architecture, and the transport layer in particular, relegated the phone companies to what was clearly a commodity business (unless, of course, they wanted to leave their cushy monopoly for the rough-and-tumble world of competition). In 1975, they saw no reason to. It would be easier to squash the upstarts, or just let it die out when it was realized that it didn’t work very well.

The tensions between the connection and connectionless camps began to heat up. From the beginning, these were some of the most emotional arguments I have ever witnessed or participated in. The first attempt to affect the outcome of X.25 had only modest results.^[8] The connection model held out a possible escape from the commodity trap with so-called value-added networks. The PTTs believed that connectionless was an academic infatuation that would fade away (or at least they hoped). Several PTTs were already well on their way to building networks and were not going to change very much. Because the PTTs were monopolies in most countries, the attitude that customers had to buy what they decided to offer was common.

If communications was going to use computers, it was a new market for the computer industry, and they weren’t going to miss it. The computer industry recognized that they would never have a voice in the CCITT. So, they attempted an end run by creating a network standards effort in ISO, a nongovernmental voluntary standards body. OSI was formed in 1978 over the objections of the traditional ISO data-comm committee.^[9] While OSI was initiated by the proponents of connectionless, the European computer companies wanted and needed rapprochement with the PTTs. Very quickly, with the aid of AT&T, the CCITT effort in this area was made a joint effort between ISO and CCITT. This was unavoidable and at the same time sealed the doom of OSI. On the one hand, a competitive effort in CCITT would have severely weakened any computer industry effort, especially with no inkling of deregulation on the horizon. At the same time, the difference in paradigms was irreconcilable, at least given our knowledge at the time. No one could see a synthesis of the two, and “professional” standards committees did not do the kind of elegant synthesis seen in the early APRANET, but just jammed two (or more) contending approaches together and called them options.

Beginning in 1974 as the International Network Working Group (INWG) and later becoming IFIP WG6.1, INWG broadened the scope of similar groups within the ARPANET, NPLNet, and CYCLADES, along with other researchers. 6.1 had been working on an international transport protocol as well as doing work on a VTP that was a very elegant example of extensibility and the beginnings of the application of FDTs to protocols. Just before OSI started (March 1978), INWG chose a revised CYCLADES TS as the international transport protocol over the NPLnet protocol and TCP (Cerf, et al. 1978). This output was carried into the OSI process as an IFIP liaison contribution and became the basis for the OSI Transport Protocol Class 4 (TP4). Soon after this DARPA researchers ceased participating in OSI. Over time, this separation led to the Internet perception of the OSI connetionless work as a rival rather than an ally against the phone companies.

Those Crazy Americans!

A story has circulated in the past few years that the DoD offered TCP to OSI in the 1980s and they turned them down. This is offered in the vein of a “missed opportunity.” The story is true, but not complete. From the OSI point of view, they had already accepted Vint Cerf’s and the DoD’s recommendation years before as the output of the open IFIP WG6.1 process. (Cerf and McKenzie were the first two of the four authors of the report.) Years later, when the DoD brought TCP to OSI, the Europeans chalked it up to yet another case of the American’s right hand not knowing what the left hand was doing.

The resulting separation of the Internet participants from OSI led to essentially a three-way competition between the connection-oriented PTT proponents, the Internet, and the OSI connectionless proponents who saw their effort as essentially the next step in developing connectionless networking. It was clear by the late 1970s that DARPA no longer had an interest in continuing the revolution it had begun. If any of the open issues were going to be resolved, it would have to be elsewhere. OSI offered the only forum where there was a chance to resolve some of the issues the ARPANET/Internet had not had a chance to do.^[10] However, with the confusion created by the warring connection/connectionless factions, it didn’t matter what OSI was trying to do; whatever advances it made were completely obscured by the internal conflicts. The strife within OSI was not limited to connection versus connectionless, but also Japan versus Europe versus the United States, and the computer industry versus the phone companies and everyone against IBM.^[11] The push for OSI was signaled in early 1982 when IBM changed the diagrams in its advertising of SNA from five layers to seven.

After IBM’s endorsement,^[12] the desire to lead the commercialization of OSI yielded many interests who all had their own agendas, fighting over who would get a piece of the pie, who would essentially control its implementation: NBS/NIST, MAP/TOP, EMCA, COS, DEC, IBM, and others. IBM played them like a violin. It was truly magical to watch.^[13] IBM played them not by direct action or pushing specific proposals but by creating an environment that would maximize confusion and complicate whatever the result. IBM played on the egos of a few key people to create new initiatives, separating the designers from the implementers (a standard organizational tactic to kill a project) by encouraging the formation of implementer’s workshops distinct from the standards organizations.^[14] As it usually turns out, the designers had a much better idea of how to implement than the implementers did.

The implementation profiles were supposed to be a means around the connection/connectionless dichotomy. Different environments would use different profiles. It merely served, as predicted, as a forum to rehash the same arguments in different groups, adding to the confusion. The dichotomy in OSI immensely complicated its output. Because vendors were unwilling to bet on the outcome, they felt they had to implement the whole thing. As with all organizations, it is the internal problems that are the most deadly. There were just too many disparate camps under one umbrella, too many wanting to “lead” OSI, and no common direction for OSI to have ever succeeded.^[15] And while there were elements buried in OSI that would have advanced the state of the Internet, it would have been impossible to extract them in the political climate, and there were still many things that were not “right.” The lesson to take away from the OSI experience is plain: No matter how reasonable it may appear, collaborating with the legacy will lead to failure. If the new model does not have enough advantage to stand on its own, it does not have enough to stand with the old guard either.

Probably the greatest beneficiary of all this turmoil was the Internet, which for the most part was left out of this multisided conflict. The Internet had been left to its own devices and was in the best position to push the technology ahead while everyone else was distracted. But, this did not happen. At the same time, the isolation had its own effects when the IPng effort brought them together again. By this time the Internet had rewritten its history to reflect its crowning achievement, having lost sight of the list of open issues and that the Net was really an unfinished demo that had never been “productized.”

What was required was a true synthesis of connections and connectionless, like the syntheses that the early ARPANET had been so good at pulling off. However, the political situation was so intense and the problem so much more difficult that it was hard to find the middle ground.^[16] Instead, a bunker mentality set in: each group unwilling to bend for fear of losing the whole thing. One cannot stress too much the intensity or affects of this bunker mentality on all three camps. A less-intense environment might have been able to find the synthesis, but it was impossible in the environment that existed.

It is also worthwhile to bring up another factor that is sensitive and embarrassing for many of us: immaturity. Although there were many “standards professionals” who knew the ropes and understood how to get things done, the vast majority of participants in all three camps were young, bright, hotshots new to the game. We had been weaned on the kind of raucous design sessions that were prevalent in the 1970s and 1980s, and I am afraid that our behavior often contributed poorly to the process. Often ridicule and loud abusive language was substituted for logic and reason and trying to understand the opposing view. The idea of winning the position at whatever cost overrode any need to “get it right.” Although it has moderated considerably, a strong current of intolerance and lack of listening still characterizes these discussions on all sides (often compared to religious fervor).

New Architecture and the IETF

Clearly, one contributor to these problems has been a misconception about what the IETF can accomplish. It is fundamental that all committees behave conservatively. (James Madison was one of the earliest to realize this property.) This is not where new understanding will come from, nor should it. The job of the IETF is to ensure the stability of the operational public Internet.

But for the problem we are concerned with, the situation is worse. It is very hard for organizations to change culture. The history of business has shown that the few companies that do succeed do so by doing what they were founded to do, but very few are able to adapt to do the “next thing.” The drag by the original proponents is generally sufficient to deflect or diffuse the new effort. It has always seemed that making such changes should be easier for companies being driven “top down,” but we know they aren’t. It is orders of magnitude more difficult, if not impossible, in standards organizations, that are directed bottom up. This alone makes it virtually impossible for a committee to radically change its culture or pursue a radically new direction. Too many participants have too much invested personally and commercially to want to see it displaced. This implies that should a new direction be found, it must be done by a different organization. There are too many interests in the status quo for such a change to succeed in the IETF. Trying to do a new architecture within the IETF would be worse than OSI bringing in the PTTs. It would make that look like a tea party. Anyone who demurs is either hopelessly naïve or represents such an interest.

So, is there a way forward for the IETF and for the Internet? There might be. However, most participants see the Internet as a string of crowning achievements, not as a list of missed opportunities. This has to change. Until a new crop of young Turks comes along who are willing to look at the efforts of their seniors and find them lacking, I am afraid we are stuck. Where is Oedipus when you need him?

If OSI buffered the Internet from the bulk of the turmoil with the PTTs and the connection/connectionless debate, how does any of this affect the Internet? This is one of the oddest effects in this whole story, and one I still don’t understand. The Internet participants developed a deep visceral aversion for anything associated with OSI, which exists to this day. Much of what is believed about OSI is not based on firsthand knowledge and is often simply wrong. It is an odd reaction for something that was doomed to fail on its own.

But as I noted in Chapter 5, “Naming and Addressing,” the oddest reaction is that if OSI did something, then the Internet wouldn’t. The reaction wasn’t, “They didn’t understand it; let us show you how to do it right.” The reaction was to do anything else but (hardly the behavior of professional scientists or engineers and often resulted in “cutting off their nose to spite their face”). We saw this with the decision to continue to name the interface in IPng. Something we had known was wrong since 1972 in the ARPANET. And something like this may have been at work in the rejection of an object-oriented approach for network management that chose SNMP over HEMS.^[17] As discussed in Chapter 4, SNMP was a step backward from even the rudimentary IEEE 802.1 protocol that preceded it.

We also see it with respect to the concept of layers. Internet proponents try very hard to avoid layers, primarily because it is strongly associated with OSI, although it is doubtful any would admit it. However, they can’t completely forsake them. To do so would put them squarely in the beads-on-a-string camp. Recently, however, it does seem that some prominent Internet researchers have lost faith with the connectionless model, just when a renaissance is in the offing. Many papers have been written trying to get rid of layers. Most still assume that a layered architecture implies a particular implementation strategy. And they assume that the only possible layers are the ones arrived at in the early 1970s. Both of which we have seen in this book are not the case. They have de-emphasized interfaces to the point that the lack of clean interfaces was a major source of the work in doing IPv6, and the interfaces provide little or no abstraction. They have tried to concentrate everything in the network and transport layers to the point that the Internet architecture is more beads with stripes than layered, with TCP/IP rather than X.25/LAPB.

It is easy to see how in all the turmoil, research has suffered.

From reading the popular press, one has the impression that the Internet is an amazing intellectual achievement, a triumph of American innovation. But a close look reveals a quite different picture. There are only two major innovations: the central one, Pouzin’s connectionless insight for a French network;^[18] and Saltzer’s application of operating system naming to networks. Technologically, the Internet has been languishing since the mid-1970s, creating one band-aid after another as it rode Moore’s law around the hard problems facing them. When OSI self-destructed from its internal contradictions and the corporate squabbling over whom would set its agenda, the Internet, much to its own surprise, was the only thing left. And the Internet was only standing because of the efforts of Al Gore and others to make it freely available in the United States. So in the end, it was more politics and economics that determined the success, not the technological innovations.

Maybe Al Gore did invent the Internet!^[19]

Maybe it is time we quit loafing and get back to doing some serious intellectual work, if we still know how.

How Did This Happen?

The short answer is a combination of effects. The push to commercialize networking came on very fast, before the science was really understood. The business world and the research community to a large extent assumed that the “productization” of networking would occur in commercial products. And academia was intimidated from exercising its responsibility to distill fundamentals by the intense flame wars that erupted when pointed questions were asked. “Facts” were often articles of faith, and desires were couched as principles, and wonderful technical doubletalk was created to obscure the lack of hard evidence. By the time it became clear that the Internet was to be the focus of those products, no one in the Internet remembered (or ever knew) that it had not been finished.

The effect of business cannot be underestimated. The press to commercialize networking had a tendency to freeze solutions much too early. But by the same token, the public success of this commercialization led to research taking up the pattern of venture capitalists looking for a quick ROI. Each agency or program director measured by how much they are on the “bleeding edge.” This has led to research programs that are not so much concerned with solving problems as funding for two or three years, declaring victory and moving on to the next big problem. (Clearly, the “bleeding edge” can’t be the same as three years ago.^[20]) Meanwhile, researchers must adjust their research program to the new fads to keep the dollars on which they are measured rolling in. No one is measured on solving problems and producing science. This leads to a proliferation of technique, with no theory to distill them and to hold them together. And with no theory, there is no way to determine how good the results are.^[21] Note that in other sciences, the fact that they are trying to understand an external object (e.g., nature) to a large extent keeps this from happening. It is only in computer science, where we don’t have nature, that this seems to occur. We have nothing to keep us close to Clauswitz’s “proper soil.” It seems that it is as important to keep practice close to theory as it is to keep theory close to practice! As in any science, it is the boundary conditions that are the most enlightening. Computer science provides an interesting boundary condition to the “philosophy” of science.

Much more damaging was the political milieu that networking found itself thrust into. Almost immediately (within just a few years of its first proposal), the concept of the connectionless network was under relentless attack by the PTTs. This led not only to a bunker mentality, but also to a feeling of being a “chosen” elite initially, because the Net was so much more advanced than anything the PTTs could do, and later with the exponential success of the Web. In the 1970s, this was the camaraderie of excitement about new ideas. In less than two decades, this perception had become institutionalized, so that today the mindset of the Internet bears a stronger resemblance to the mindset of the PTTs of the early 1970s than to the ARPANET of the early 1970s. Our textbooks present the Internet as the only network. (Just as 40 years earlier, the study of telecommunications was the study of how the telephone system worked.) Rationales are invented to portray what was done 30 years ago as near perfect, rather than our partial understanding at the time. Even when it is recognized that a “Copernican revolution” in networking is required, it is with the caveat not to change what we have. What sort of revolution is that?

There are uncanny parallels elsewhere in science. In Lee Smolin’s recent book, The Trouble with Physics (2006), he describes the “seven unusual aspects of the string theory community:”^[22]

Tremendous self-confidence, leading to a sense of entitlement and of belonging to an elite community of experts.
An unusually monolithic community, with a strong sense of consensus, whether driven by the evidence or not, and an unusual uniformity of views of open questions. These views seem related to the existence of a hierarchical structure in which the ideas of a few leaders dictate the viewpoint, strategy, and direction of the field.
In some cases, a sense of identification with the group, akin to identification with a religious faith or political platform.
A strong sense of the boundary between the group and other experts.
A disregard for and disinterest in the ideas, opinions, and work of experts who are not part of the group, and a preference for talking only with other members of the community.
A tendency to interpret evidence optimistically, to believe exaggerated or incorrect statements of results, and to disregard the possibility that the theory might be wrong. This is coupled with the tendency to believe results are true because they are “widely believed,” even if one has not checked (or even seen) the proof oneself.
A lack of appreciation for the extent to which a research program ought to involve risk.

This is a striking list. Smolin could be describing networking rather than theoretical physics. He notes that sociologists recognize these characteristics as groupthink and quotes Irving Janis of Yale as defining groupthink as “a mode of thinking that people engage in when they are deeply involved in a cohesive in-group, when the members’ strivings for unanimity override their motivation to realistically appraise alternative courses of action.” He goes on to quote a description of groupthink excerpted from an Oregon State University Web site dealing with communication:

Groupthink members are themselves as part of an in-group working against an outgroup opposed to their goals. You can tell if a group suffers from groupthink if it

Overestimates its invulnerability or high moral stance
Collectively rationalizes the decisions it makes
Demonizes or stereotypes outgroups and their leaders
Has a culture of uniformity where individuals censor themselves and others so that the façade of group unanimity is maintained
Contains members who take it upon themselves to protect the group leader by keeping information, theirs or other group members’ from the leader

As Smolin says about theoretical physics, this doesn’t match up one for one with what we see in networking, but it is close enough to be worrying! And as Smolin points out, the in-crowd of networking will have no problem countering these critiques. But all one has to do is look at the contributions surrounding the current NSF initiative to see this groupthink at work. They are all remarkably similar, and yet none get at the core of the problem or strike out in a new direction. It would appear that the groupthink arose as part of the bunker mentality created during the “protocol wars” of the late 1970s and 1980s, when the connection versus connectionless battles were most intense. The attitude has persisted. Note how some commentators still drag out the OSI bogeyman even though it is long dead. This is reminiscent of a tactic used by countries who create an incident with a traditional rival to direct attention away from their own internal problems. Part of the problem in networking is that we now have generations of engineers and professors who have never seen anything else and don’t seem to be able to imagine another way to do things; reinforced by luminaries saying there is no point in changing the underpinnings, even though the underpinnings are deeply flawed (our seven unanswered questions). To not question the underpinnings is not only intellectually dishonest, contrary to the scientific tradition, but is building on sand!

Smolin goes on to attribute this to the increased professionalism of the field, and the same thing seems to apply to networking. He distinguishes between master craftsmen and seers. Smolin characterizes them as follows:

Master craftsmen go into science because, they are good at it. They are usually the best students in their math and physics classes from junior high school all the way up to graduate school, where they finally meet their peers. They have always been able to solve math problems faster and more accurately than their classmates, so problem solving is what they tend to value in other scientists.
Seers are different. They are dreamers. They go into science because they have questions about the nature of existence that schoolbooks don’t answer.

I would refer to Smolin’s seers as “theoreticians.” These are people who need to get to the bottom of things to understand the deep structure and assure themselves that everything is based on something firm, to find the similarities, the invariances. A recent article (Kramer, 2007) in CACM touched on this in lamenting the decline of abstraction in computers science. We can see how networking (and to some degree computer science) has been selecting for the excellent programmer (master craftsman). But reading between the lines of Kramer’s paper, one realizes that the characteristics of the super-coder are essentially antithetical to the characteristics of someone good at abstraction and theory.^[23]

Looking back over the events chronicled in the Preface, we see the work of master craftsmen. At each juncture, a stopgap patch is found. There is no consideration of how it fits into a theory or whether it does. No consideration is given to how this sets things up for the next step because there is no vision of the goal. Only once does a theoretician insert an idea (Saltzer), and it is lauded but studiously ignored.^[24] Smolin traces this distinction back to Kuhn’s Structure of Scientific Revolutions (1968) and points out that master craftsmen are precisely what is required during periods of “normal” science: the period in which a new paradigm is established and is being explored and exploited. But when the new paradigm becomes the old paradigm and is showing its age, it becomes clear that a new paradigm is necessary; a different perspective is necessary.

It is when a new paradigm is needed that a period of revolutionary science has been entered, when Smolin’s seers are needed. To find a new paradigm requires seeing new distinctions, questioning everything, looking carefully at what is really going on. Master craftsmen are not suited to finding the new paradigm. Their characterization of the problems tends to look at the edges, at the surface. Their reaction tends to be to start building things, anything, in the hope that it will be better. It is disconcerting to realize that a paradigm one learned as a student and that has always appeared hugely successful and that the world has heralded as a wonderful achievement (perhaps that one has built a career on knowing its ins and outs) will not meet fundamental needs and must be replaced. After all, the paradigms’ success was based not on its own merits but elsewhere (Moore’s law and politics). But this is where we find ourselves.

That Blinding Flash

Many times over the past 35 years, I have seen a new insight put forward only to watch it be greeted with “Oh, yeah, now I get it!” but of course they didn’t get it, and then watch them run off to do what they thought they heard, only to get it wrong. Then, of course, it is the idea that is bad, not that they lacked the subtlety to get it right. There are a million ways to get a concept wrong but only a few ways to get it right. There are hints that possibly this was at work with connectionless, too. We saw its immediate elegance, but we failed to dig further to gain a better understanding of it, inhibited in part by the attacks of the beads-on-a-string forces and the need made unnecessary by Moore’s law.

In our case, however, it appears the craftsman took over before the new paradigm shift was complete. In 1974, we were just beginning to get an understanding. Saltzer tried to jump-start a look at theory, but we lacked a driver. It would seem that we have a case of arrested development. It is interesting to note how various stalwarts of the Internet—small incremental change, the emphasis on running code, the rejection of technical sophistication—are all characteristics of an artisan tradition rather than a scientific one. There is science, actually more engineering, in specific topics, but overall the tradition has become artisan, dedicated to the development of technique, not to comprehensive theory.

We have an added problem. Hundreds of millions of people rely on our unfinished demo. But they are just a drop in the bucket of what is to come. There is no way that the current model will provide what is needed for the next 50 years or more.^[25] The change has to come, and the sooner the better.

In most fields, the fundamental science comes first, at least a basic understanding. First, we understand how nature works and predict what it will do, and then we figure out how to build things that exploit that knowledge, i.e., engineering. It seemed that in networking we were doing the engineering but not the science. But before condemning too quickly, it is good to remember that in computer science, we build what we measure. Not only does this make it difficult to separate artifact from principle, but it also means we must build something (engineer) before we can measure (science).^[26]

Networks present a much different problem than our experience with, for example, operating systems. With operating systems, it is relatively easy for many different approaches to be tried. All one needs is a machine. For a network, one needs considerably more. By the time Multics and VMS were designed (usually considered the canonical bases for most modern operating systems), there had been at least 20 or 30 different operating system designs tried and explored. We might say that Multics and VMS are roughly v30 of operating system designs. With networks, it is much more difficult to try different approaches. At most, we had XNS, CYCLADES, NPL, ARPANET, and the few X.25 networks to draw on, and they were all small networks. This is not to say that we still would not have done something fairly close to what we have, but we had much less experience with the problem space when we settled on what we do have. And it was much more difficult to try other forms. Even doing just one new protocol was (and is) a major effort, let alone a new architecture.

When we step back and look at what we have, given what we have seen here, it is apparent that what we have is closer to DOS, what we need is Multics, but we would probably settle for UNIX.

Very quickly, research on networks became more “engineering the Internet.” How many good ideas have been squashed by “But how would it work in the Internet” or “You could never do that in the Internet.” When did good research start to be predicated on whether it could be deployed in a single product? The telephone network prior to the 1970s? With that attitude there would never have been an Internet. Little work has been done to understand general properties of networks, to understand the deep structure and how architectures other than the one we have may work. Basically, we have been engineering and not doing much, if any, science. There has been too much emphasis on the next neat whiz-bang thing, too much emphasis on the latest hot topic, too much emphasis on academic research as angel funding, and not enough emphasis on science on what these new hot topics tell us about general principles.

The difference between science and engineering is crucial. When most people think of “doing science,” their first thought is of experimental method, as we saw in the NRC report cited at the beginning of this chapter. But experimental method is a key element of both good science and good engineering. This is not what distinguishes science and engineering. The crux of the difference between science and engineering is that science is concerned with understanding, whereas engineering is concerned with applying that understanding to real-world situations. One might say that engineers are fascinated by exploiting the differences, whereas scientists are fascinated by the similarities, with finding the invariances. The Holy Grail of every scientist is to do for his field what Euclid did for geometry. What Newton did for mechanics. What Maxwell did for classical electricity and magnetism. What Mendeleev did for chemistry. Accomplishing the goal matters less than striving for it. We have lost sight of that in our corner of computer science.

Previously in computer science, this kind of abstraction and theory was common. Everyone knew the basic results of automata theory, for example. Today one would be hard pressed to find a computer science graduate who had ever heard of the halting problem, let alone knew what it means. Perhaps this is because everyone has been trained in a field where theory and abstraction is part of the tradition. But some bad experiences with theory run rampant, and a continuing push for applicability, not to mention visions of dollar signs, has seen a waning of theory (as well as appeals to futility, as in “X is 80% of the market. Why worry about knowing what is the right way?”). It has been decades since theory has been seen in networking. Maybe I don’t read the right journals. But if it exists, it isn’t in the mainstream, and it needs to be. Everyone needs to be aware of how what he is doing fits into the bigger picture (or doesn’t).

The Importance of Theory

We have seen another example in the history of science where a rich and successful scientific tradition stagnated and lost information. For millennia, China had such a successful scientific tradition that China was ahead of Europe, sometimes by centuries. The Chinese developed Pascal’s triangle 300 years before Pascal, and Western medicine did not surpass the cure rate of Chinese medicine until the beginning of the 20th century. These are just two examples of the sophistication of the tradition. But ultimately, Chinese science stagnated. In his multivolume magnum opus, Science and Civilization in China (1965), Joseph Needham concludes that the reason for stagnation was that the merchant class was low in the social hierarchy. Without merchants to create demand for new innovations, the need for progress declined when the power structure lived as well as it thought it could. Half-jokingly, one could turn Needham’s argument around to say that it was a strong reliance on government funding that caused science to stagnate in early China. China was a hegemony, a very strong one. If you were in favor with the emperor, everything was golden, if not.... This was in contrast to Europe, where there was no hegemony. When Galileo first got into trouble with the Pope, he went to teach at Padua, then under the control of Venice, a major power that could afford to be independent. There was little the Pope could do. In Europe, if new ideas caused political problems and made things a bit untenable, there was always someplace to go.^[27] My enemy’s enemy is my friend.

But there is more to the stagnation of science in China. It is sometimes difficult to compare Western science with what one finds in China. Much of what is in Needham’s comprehensive survey (seven volumes, some volumes consisting of multiple books, all quite copious) is more technology than science. Many of the advances came out of an artisan tradition, as they did in the West during this period. But unlike Western science, they stayed there. This belies what is perhaps the greatest difference between Western and Chinese science. Although Needham recognized the lack of theory, neither he nor other historians have assigned much importance to it as a factor in maintaining a vibrant scientific tradition. The most significant difference is that more than a lack of theory it is that

China had no Euclid.^[28]

There was essentially no tradition of theory in Chinese science, and certainly not axiomatic theory, of systematizing the results. Needham points out that the body of knowledge represented by Chinese science was more a set of individual techniques than an organized corpus of knowledge.^[29] There was no attempt to develop the kind of overarching theory to integrate a set of results into a comprehensive whole that has characterized Western science. It is hard to find much evidence of attempts (outside of astronomy) to develop predictive theories, or anything analogous to the discipline of proof found in Western science.^[30] The pressure of commerce alone would generate new results but not the impetus for consolidation that comes from theory. As we have said, the Holy Grail of every scientist is to do for his field what Euclid did for geometry: reduce it to a small number of concepts from which every thing can be derived.^[31] Early China is not the only example. There is definite evidence that artisan or craft traditions have a tendency to become insular and protective—more concerned with perpetuating the guild than invalidating its raison d’être. In other words, they stagnate.

Working toward a comprehensive theory, even if one is not found, is beneficial. We seem to have fallen into a bad habit that our only criteria for a solution to a problem is whether the code can be made to work in a reasonable amount of space.^[32] Every time we solve a problem, we should consider how it fits the current theory. If it doesn’t, we need to ask which is wrong: the solution or the theory. What is it we don’t understand? As different models are proposed and tested, a deeper understanding of the problem domain is achieved, even if the proposed model is wrong. It will get better. A good theory is, at the very least, a good mnemonic. There is less to remember. One only need remember the central concepts, a few intermediate principles, and roughly how things relate, and everything else can be derived. Many major results of Chinese science were forgotten, some because they weren’t needed very often. For example, when Matteo Ricci first entered China at the end of the 16th century, he was convinced that he had brought the knowledge that the Earth was round (Day, 1995). As it turns out, the Chinese had determined this 300 years earlier, but the knowledge had been lost.^[33] We are seeing similar loss of knowledge today in computer science as the same solutions are published as new, as the technological pendulum swings back and forth.

Without a unifying theory to simplify knowledge, the amount of information was eventually overwhelming. But theory is much more than just a mnemonic. A theory, even a partial one, leads to deeper understanding of the phenomena being studied and to further insights. Proposals of a unifying theory (even if unsuccessful) raise questions relating disparate phenomena that would not otherwise arise. When there is a theoretical framework, results can be derived that were far from obvious before. Theory not only provides a simpler, more logical explanation, but it also has a tendency to simplify individual techniques (making them easier to understand and apply). Many techniques coalesce to become degenerate cases of a more general method. To see this, one only need read accounts of electricity and magnetism before Maxwell, chemistry before the periodic chart, or geology before plate tectonics.

It is the individual discoveries combined with the search for an encompassing or simpler theory that is the essential tension that gives science direction and allays stagnation. Theory questions the meaning of the observations and techniques that make it up. Theory points at experiments that test its veracity and attempt to invalidate it. Theory points at its own limits. Theory becomes a map of our knowledge and thus pushes us toward better theories.

The processes that have been operating on our science are creating a situation similar to the one found in China of the late Ming Dynasty. Network research has come to this juncture by a different route. Hopefully, it will not have the same end. While we have innovation being driven by the merchant class, they are looking for a quick ROI, i.e., technique, not great leaps forward. To compound the problem, research funding is flitting from fad to fad every few years, generating techniques but giving theory little time to gestate. This, in turn, has a strong effect on researchers. Ross Ashby (1956) noted that the longer a state machine operated, the output became independent of the input and began to reflect the structure of the machine itself.^[34] We are conditioning researchers to produce techniques, not science. There is no theory.^[35] It isn’t that the basics of Research 101 are unfamiliar in our field. What is unfamiliar is that our research must contribute to building a theory. The problem is that the bulk of our researchers have not had training in doing good theory. In mathematics, yes, but not in science.

As anyone who has taken a science course knows, the first step in science is not to measure as the NRC study says. The first step is to state a hypothesis. To state a hypothesis, one must start with a theory to be invalidated.^[36] As Einstein is often quoted, “It is the theory that determines the data.” Without theory, you don’t know what questions to ask, and you don’t know what data is relevant or how to measure it.

I once asked a good friend and Nobel laureate^[37] in physics about Galileo as a theorist and got a strong reaction, “No! Galileo was the great experimentalist!” True, Galileo’s experimental method was exemplary and important to the development of science. But his method would have meant nothing had he not had the insight to see the model that was at the core of the problem. Galileo had thought long and hard before those famous experiments were done. Galileo’s brilliance was in picking the right model on which to base his experiments: falling bodies in relatively controlled conditions. If Galileo had attacked a problem of practical value such as predicting where cannonballs would land (and I am sure that APRAF^[38] would have funded him), he would have found his equations useless. In fact, people did try to apply his results to cannonballs and found his equations were off by as much as 50%.^[39] But Galileo did not start by trying to find equations that would take into account wind drift, air resistance, or that cannonballs were not perfect spheres. The solution to where a cannonball lands requires far more complex equations, at least a system of three-dimensional second-order partial differential equations.^[40] Whereas Galileo’s model could be described with a simple polynomial and confirmed by relatively simple, well-controlled experiments. After the basics of the theory had been worked out, the results for cannonballs could be reasonably achieved by enhancements to Galileo’s equations; but almost impossible if you were to try to start it from the start with a clean sheet of paper. Galileo had the insight to pick an initial model that had a small number of independent variables, could be tested experimentally, and could form the basis to solve more complex problems. But then (and I hesitate to say this), Galileo and his colleagues had the faith to believe in the elegance of the solution even when it contradicted experience, the great Aristotle, and important real-world data. (This is not uncommon in the annals of science [Fleck, 1981].)

Needham recognized that a strong central government as the only patron of science could lead to stagnation. Not surprisingly, he did not foresee that the short ROI demanded by a successful merchant class could have the same effect. It would seem that we have created the conditions for stagnation. Depending on your degree of pessimism, it is now up to us to avoid it or get ourselves out of it.

Finding a New Path

Avoiding the proliferation of technique or, more positively, encouraging the consolidation of technique by theory is difficult. Although, the private sector is more interested in technique, or as they call it product differentiation, theory is not just the responsibility of the research community. Such consolidation is just as important to the private sector for decreasing cost, improving performance, and creating new opportunities. Furthermore it occurs at many levels, some only the purview of those building product. At the same time, we must keep theory from its flights of fancy and close to “its proper soil,” the practical.

There are many platitudes for encouraging theory and good research. As previously mentioned, it is exceedingly hard to rush theory but it is easy to discourage it. It often seems that all one can do is to create a fertile environment and hope for the best. But there are some things that will help create fertile ground. One can argue that research programs and funding should do more to stress theory. But how many researchers have a sufficiently broad exposure to multiple paradigms, the history of science, and to some extent philosophy to be able to think about it from the outside? And how quickly will it be viewed as just another means to exploit the funding process? This could harbor a lot of less than serious work, and it would be very hard to distinguish between that with real potential and that that had done. Clearly, keeping an eye on the fundamentals is important, always holding proposals up to those fundamentals. A better understanding of the fundamentals is never a waste of time or money. On the other hand, this book demonstrates that new insights are more a product of hard thought than hard cash.

But what are the fundamental principles that can be taken from theory in computer science? Fundamental principles are relations that are invariant across important problem domains. The more fundamental, the greater the scope of their use. To have principles, we need good theory. But developing theory in computer science is much more difficult than in any other field because as I have said many times before,

we build what we measure.^[41]

Because there are few principles to rely on, we often see new efforts going all the way back to the beginning. Hence, many different decisions are made in each new effort, which in turn makes it more difficult to compare different approaches to the same problem, which further complicates our ability to discern principle and artifact.

But we can leverage this unique nature of computer science to get our arms around the problem of separating theory from artifact. There are essentially two parts to computer science: a part that is mathematical, and a part that is scientific. Mathematics is not a science. In mathematics, the only requirement is that a theory be logically consistent. In science, a theory must be logically consistent and fit the data. Many aspects of computer science are purely mathematical: automata theory, complexity theory, algorithms, even to a large extent programming languages, and so forth. Although they are rooted in mathematics, it is the “systems” disciplines of computer science that are the more scientific: computer system design, operating systems, networks, database systems, and so on.^[42] For theory to consolidate knowledge, it must find models that emphasize the invariants in the logical structure. This is much of what we have tried to do in this book.

Because mathematics is independent of the data, this can provide us with a means to develop theory in the systems disciplines of computer science. In a real sense, there are principles in these fields that are independent of technology and independent of the data. Principles follow from the logical constraints (axioms) that form the basis of the class of systems.^[43] This is the architecture.^[44] Within this structure, there are additional principles, which are dependent on the data and independent of the technology or implementation. Specific choices in this space yield specific designs. These two should form the content of university-level texts in a field. And finally, there are the “principles” or relations that are dependent on the technology. These form the basis of product manuals and technical books.

The first two are pure theory. Here is where the pressure to emulate Euclid is most felt. One wants to find the minimal set of principles needed for a model that yields the greatest results with the simplest concepts. The general principles derived from the empirical will often take the form of a trade-off or principles that operate within certain bounds or relations between certain measures. Goodput in networking is a good example of this. We have seldom distinguished these three forms of knowledge. We have not subjected ourselves to the same discipline that other sciences follow. This tendency contributes to the proliferation of techniques and ultimately to the stagnation of our field. This is hard work. It requires disciplined experiments. But it is possible. And it beats working in a stagnant field.

As a start in extracting us from this malaise, this book has attempted to show how we have come across more of the principles than we may have thought. To lay the groundwork for a general theory of networking. To tease out the fundamental structure of a complete network architecture, not just an unfinished demo. So, let’s review the high points of what has been covered here and consider their implications.

The High Points

In the first five chapters of this book, we looked at major aspects of networking and what we had learned along the way. We considered the patterns that fell out when we extracted the invariances from protocols, and how we might create a synthesis for the connectionless/connection dichotomy. And while we found that there was no upper-layer architecture, we did find concepts that would prove crucial to understanding the structure. We dug into naming and addressing and found much good work had been done that was still waiting to be applied, which when extended to accommodate new conditions presaged the structure we found when we came to sorting out layers. Sorting out layers proved to be a little embarrassing; the answer had been staring us in the face all along, being precisely what we had always said it was: IPC! We had been skipping steps and getting sloppy! But after we assembled it, there was a lot to reflect on:

First and foremost, there is one layer that provides interprocess communication and it repeats. All layers have the same complement of functions (in particular instances, some functions may be null) but are configured (with policies) for different ranges of the problem.
The IPC model makes it clear that the network or Internet layer was the last vestige of the “beads-on-a-string” model. It disappears. The data link layer developers got it right, but only because we only gave them one layer to work with! Networking is very much a discipline of computing systems. Telecommunications is dead or applies only to the physical media. There are implications for operating systems here that are yet to be fully explored.
This model confirms our understanding that security functions belong in the application. Even IPC security is just a specific form of application security, because IPC processes are applications.
The distributed IPC model is inherently more secure in that it minimizes information available to attackers, requires minimal information from applications, makes it possible to put requirements on its members, and requires minimal trust in supporting services. When combined with current security techniques, it can create a network secured to the degree required by its developers. The enrollment phase establishes the level of trust in the DIF.
Layers (also called distributed IPC facilities) have different scope (or if the same scope, are of different ranks) and serve significantly different ranges of effective bandwidth.
- The decomposition of the structure into three loci of processing with different cycle times—ranging from the very simple and fast for data transfer; to somewhat slower, somewhat more complex control; to slower, yet more complex management—all decoupled from each other through a RIB/state vector is quite elegant and satisfying.
- Our thinking about EFCP-like protocols and the fact that we weren’t clear about the bounds on what went into them had to some degree obscured this structure. When we understand that this is the per-instance IPC channel, limit what should be in them, and remove the invariances (that is, separate mechanism and policy), we see that UDP and TCP are actually part of the same protocol. This then leads to a somewhat unexpected result: There are only two protocols in networking.
What About the End-to-End Principle?
Nearly every architecture paper in the field cites it, but I have a simple question: Why not? The astute reader will have already realized the answer: It simply never came up. And, in a recursive architecture, the concept of “end” is a non sequitor.
1. An error- and flow-control protocol to create the equivalent of an IPC channel. It consists of a single Transfer PDU and optional Control PDUs to support loosely coupled mechanisms. Common PCI is added under two conditions:
  - To carry internal addressing information, if the IPC facility has more than two members
  - For PDU protection (that is, CRC, time to live, encryption)
2. An application protocol that provides information exchange for the distributed IPC, including the following:
  - An IPC access for implementing search rules and access control.
  - A Resource Information Exchange for distributing near-real-time resource information necessary for managing the IPC facility. Connection setup for this protocol provides the means to authenticate new members and assign addresses.
  - A DIF management that performs the role of a traditional network management.
delta-t turns out not just to be a good example of an EFCP but fits into the structure as if it were made for it. I had always known that it was probably the intellectual paragon in protocol design but had not realized how much better it fit the context. Early on, I realized that for an EFCP to support a wide range of services, as it must in the higher layers (transport), separate PDUs for loosely coupled mechanisms would make degenerate cases fall out more simply and make it easier for the implementation to exploit the natural separation of control and data. That implied that protocols such as SeqPkt or TP4 might be a better choice than TCP.^[45] But after the rest of the IPC structure was in place, it was clear that delta-t fit like a hand in a glove. This was a surprise. It allowed for an elegance in the structure that had not been foreseen. It also emphasized the importance of the decoupling.
Decoupling application requests from the protocol action as a key element of the structure. Determining what to do with a user request for communication resources isn’t just invoking the EFCP protocol machine, as we have all seen it for the past 35 years, but involves invoking the DIF’s management. This clarifies all sorts of aspects of resource management.
The application protocol is probably best served by a generalized and updated version of HEMS, as discussed in Chapter 4.
As noted in Chapter 4, OSI figured out that there were no general-purpose upper layers. There might be common functions that some applications used, and specific application domains might have additional layers (that is, relaying and error control with greater scope). OSI might have made more progress on the implications of that had the PTTs not stolen the session layer. The design sensibilities of the telecom tradition were misguided and continue to be as we have seen with WAP, Bluetooth, and now IMS. OSI’s other contribution was working out the relation between the application process and application protocol. This not only makes logical sense, but it has also proved to have wider implications, clearly explaining constructs that were unforeseen to be related. What was surprising was that this relation became key to understanding the structure of the “lower layers” (IPC).
The “session layer,” whatever it was to be, turns out to be part of IPC management. A case where two kludges (the theft of the session layer in OSI and DNS as not quite a directory in the Internet) made it hard to see the answer. But the natural way that it fits—corresponding to operating system search rules and access control and how the case of “out of date” DNS/directory information (also known as a mobile application) is a degenerate case of normal operation—indicates that this is clearly “right.”
We also found that protocol-ids are unnecessary. At first it looked like protocol-ids identified syntax, and they did. But after the IPC model was understood, it was clear that they were completely unnecessary. If there is a protocol-id in a protocol design, the layer design is faulty; either the layer boundary is in the wrong place or there is no EFCP or no enrollment or both.
By the same token, the flow-id in IPv6 would be unnecessary if the existing flow-id hadn’t been used for something else. One kludge leads to another. Not having the equivalent of the IPC access protocol leads to well-known ports and leaves no flow-id available.
Connectionless turns out to be maximal state, not minimal state. This was a rude turn of events and completely upset everything I had thought (and claimed loudly) for 35 years! As it turns out, we had been drawing the boundaries too tightly and comparing apples and oranges. Like trying to do a conservation of energy problem with the wrong bounds on the system! Connection-oriented concentrated on congestion control and put routing at the edge, while connectionless concentrated on routing and did nothing about congestion until it hit us over the head. When it is put this way, it makes a lot of sense and points the way to some interesting, new directions to providing connections with connectionless properties and to providing QoS without having to resort to strong connections as in MPLS. It appears that the dumb network wasn’t so dumb after all! When we compare apples and apples, the amount of state appears to be about the same. The difference is in where it is. This therefore argues that connectionless is characterized by maximal distributed state, whereas connection-oriented is characterized by concentrating state, primarily at the ends.
Layers recurse. We should have seen this coming. Dijkstra and the 1960s were so resource constrained that it focused our attention on differences rather than similarities. And although there are allusions to recursion in the original catenet paper (Cerf, 1978), it seems to have been forgotten. The lesson of large bridged LANs should have told us to expect the same for large routed networks.
When the layers have a common structure, the implementation is streamlined and does not incur the overhead of traditional layers. In effect, it organizes the processing and doesn’t get in the way, which makes implementation more compact and effective. This undoubtedly has significant implications for moving functionality into silicon.
We had always assumed that addresses were names of protocol machines as application names were the names of applications. So, it was quite surprising to realize that in the context of the distributed IPC model, addresses are identifiers internal to the DIF to coordinate its operation. The IPC process is an application process and as such has an application name. This is what is used to initially establish communication with it.
Saltzer’s addressing model from the 1980s requires only slight generalization to include a case that didn’t exist when Saltzer wrote and provides an elegant example of “throwing away the ladder.” It presages the recursive model, and then Saltzer’s distinction of node and point of attachment turn out to be relative and, when combined with the last bullet, a natural consequence of the IPC structure. We needed Saltzer’s result to see the answer, but after we had the answer we no longer needed the result!

But we also found a nice parallel progression in the roles that names and addresses have in a network. The architecture of external application names is essentially a projection onto internal DIF addresses:

Application names:		location-independent	system-independent
Application instance-names:		location-independent	system-dependent
Sentential names:		location-independent	system-independent
Sentential identifiers:	route-independent	location-independent	system-independent
Node addresses:	route-independent	location-dependent	system-dependent
Point-of-attachment addresses:	route-dependent	location-dependent	system-dependent

“Private” addresses turn out to be the general case, and “public” addresses are a specific kind of private address. This was totally unexpected, and I am sure is not going to be popular among those whose utopian desires outstrip their scientific discipline. But this is consistent with the world we now find ourselves, where potentially everyone owns their own network, not just large corporations or governments. It appears now that the idea of the global public address that everyone has to have was an artifact of artificial constraints of early technology that are no longer necessary. One can break out of the tyranny of being accessible within a single global address space: one more blow for freedom, privacy, and security, another blow against the Orwellian tyranny of having to be always connected to the public network.
The layer as distributed IPC reveals minimal information and allows strict control over the information made available to the application. This provides a much more robust structure within which to provide security services.
And last but not least, this model seems to do an admirable job of meeting Newton’s guidelines. We will be hard pressed to find a simpler model that solves as many problems that isn’t isomorphic to this one. In other words, this isn’t just a nice model, it is the basis for a general theory of networking.

Reflecting on this model and looking at what we have been working with, it is clear that what we have today is more analogous to a rudimentary operating system like DOS than to a full-fledged operating system like Multics or VMS. Like DOS, all the pieces are there to call it an operating system, but there are a few things missing that give it everything we think of when we think of a “real” operating system. There are many reasons why this was the case and we have discussed many of them. But still, it should serve as a lesson for the future. We should have seen this sooner (me included).

I must admit this model was constructed not so much looking to solve specific problems, but carefully analyzing interprocess communication between distinct systems with an eye to extracting invariances and minimizing discontinuities. Trying to see what was really going on. It would not be going too far astray to say that finding an elegant, simple structure was the primary overriding concern.^[46] But, of course, this pursuit of elegance was not without a clear knowledge of the problems it needs to solve. Even so, on numerous occasions it became clear only long after arriving at the structure that the structure solved a particular problem, not something I had consciously set out to solve. With this structure, we find that it has a number of interesting properties:

Most important, the model scales over any range of bandwidth, number of elements, distance, and so on.
This allows a complexity implosion that eliminates the need for hundreds of specialized standards, makes networks much easier to manage, and allows more effort to be brought to bear on doing more sophisticated services with the network rather wasting it on twisting an incomplete structure to new ends.
Initializing a DIF is just a case of setting up an application connection. The application authenticates its peer. In this case, that authentication corresponds to ensuring that the process is an acceptable member of the DIF. All that the DIF can guarantee is that as far as it can tell this is what the source asked for. IPC can’t not guarantee it.
This model provides a strong foundation for doing science and for teaching university-level networking, not vo-tec. This model allows us to eliminate variables so that we may better compare results of experiments.
Multihoming is inherent in the structure.
Mobility is dynamic multihoming, a consequence of simply configuring the structure appropriately. All forms of mobility (mobile subnets, systems, or applications) are easily accommodated without special cases.
The model simplifies multicast by more closely integrating it into routing and similarly makes anycast generally useful.
Then multicast turns out to facilitate aspects of mobility.
The structure can be secured to whatever degree necessary. The repeating structure also means that it is less work and that if bugs are found they only have to be fixed once.
Among the things that scale is router table size. With topological addresses and recursion, the number of routes that any one router has to store can be bounded. In the worst case, the total number of routes in a network might be the same, but no router would have to store anything close to all of them.
Congestion control can be utilized in many places either with or without the cooperation of applications. (Far beyond the scope of this book, there are early indications that this structure makes much better congestion control and QoS possible, which then provides much finer control over a network.)
With a full naming and addressing complement, sophisticated distributed applications are possible without additional protocols. (Voice call management is a fairly simple use of these capabilities.)
Other application structures turn out to be cases of IPC—mail relaying (or any application relaying), aspects of transaction processing, and so-called peer-to-peer protocols. This also has the odd implication of removing the “transport layer” barrier from the network providers that started the protocol wars in the first place. Network equipment vendors and network providers, it could easily be argued, are in the IPC business. IPC doesn’t stop at the network but continues up into what has traditionally been seen as applications.
But probably most important, a common repeating structure will have a profound affect on network management. Rather than layers crafted with all sorts of bells and whistles, all with delusions of being the answer to everything, layers (DIFs) are configured to address well-defined ranges of bandwidth and QoS with protocols that are configured to behave in similar and complementary ways. This will promote scalability, repeatability, orthogonality, and commonality, greatly simplifying and improving the effectiveness of network management.

Does this throw out much of the work of the past 20 years? Not really. As with any shift in thinking, there will be a few topics that were once popular that will be mentioned much less frequently. However, because much of the research of the past 20 years has been on technique, much of it can be translated into what we would call policy; this work can be carried over. In fact, more research may find application since there is less requirement that all equipment do the same thing and hence greater diversity in the networks and layers that may occur. This model will make it easier to evaluate when which policies should be used and the boundary conditions within which well-behaved policies should be constructed.

Where do we go from here? This book is an attempt to bridge the gulf between what we have seen in our experience with networks over the past nearly 40 years and to get us out of the blind alley we seem to have marched into. With any luck, this book will help to restart the revolution begun by the ARPANET, CYCLADES, NPLnet, and all of those other “old guys” but was derailed by two decades of conflict. With even more luck, this book will serve as an inspiration for a new generation of young Turks^[47] to have the excitement that we had and to make the kind of mark we made. I hope this book helps rally the seers still left in computer science to throw off the corruption of Moore’s law and return to distilling principles and abstractions and maybe even theory, making the advances that have been lying in wait all these years. Rescue us from the stagnation of mounting technique.

By the time this book publishes, there should be specifications available for a DIF that can be debated and implemented and for which policies can be written. Discussion, experimentation, and exploration can begin—working out the policies for different forms of DIFs from what used to be called the data link layer through the resource-allocation layers and into what we called applications. I hope this book opens up new ideas about providing QoS and routing using addresses. I intend to also begin assembling a textbook that uses this model and its results as its organizing principle (a book based on invariances and principles, a true university-level textbook).

What do I expect the reaction to this book to be? Some are going to be intrigued and probably enjoy the read (I hope). I would hope that people will read this, consider its foundation in IPC, and realize that although it doesn’t agree with much accepted wisdom, it is well founded and might just represent the basis for a new generation to have their day in the sun. Clearly an approach to networking like this benefits the owners of networks more than the vendors. Major vendors will hate it because it represents further commodization and simplification. Engineers will hate it, because it requires many fewer engineers to develop and configure policies than developing whole new protocols and will come up with all sorts of excuses to continue their expensive proliferation of special cases and unnecessary optimizations. The greatest expense in networking today is staff, both operations and engineering. This direction requires much less of both.

But the bulk of the reaction? That is easy. In Hafner’s Where Wizard’s Stay Up Late, Bob Metcalfe describes the reaction of the AT&T engineers to the ARPANET demo at ICCC in 1972. Pretty much the same response is likely here. I hope not, but I won’t be surprised. Do we care? They are the new bellheads and already well past their prime. The future belongs to IPC!

No, technology does not change the world. Imagination does.

^[1]For those who may dispute that characterization, I can only offer that if NEWARCH did not come up dry, why is FIND necessary?

^[2]But, it has worked well for 30 years! Yes, but there were no new applications for 20 of those years.

^[3]Which was no small task. A TCP implementation from scratch took 18 man-months and was pretty much a one-person effort. And as one would expect, the first one was a throwaway, as we learned that the specification was not a good guide for implementation.

^[4]As noted previously, there were some large networking projects done by various ARPA contractors, but they only involved those groups and did not really affect the general use of the Net.

^[5]The human species’ ability to ascribe meaning to things that was never intended is truly amazing.

^[6]Which in most cases were one and the same, except in the United States.

^[7]It is interesting at this time to consider how symbiotic the PTT architecture and the IBM SNA architecture were. Each very carefully avoided the other’s turf.

^[8]LAPB was aligned with HDLC, and a datagram packet type was added, but no PTT had any intention of implementing it.

^[9]This committee, essentially controlled by IBM, represented the current status quo with the phone company. It didn’t want to see its lucrative boat rocked.

^[10]Things such as a full addressing architecture, à la Saltzer, and developing the upper layers. (As we saw in Chapter 4, OSI still got it wrong, partly because there was more to understand and partly because of action of the PTT faction. But it was a step along the way.)

^[11]Which held the head of delegation position for most European countries and the United States.

^[12]Well, not entirely. IBM endorsed the seven-layer model but contended that OSI didn’t do management. (Sound familiar?) IBM already knew that the symmetrical layered model was completely incompatible with its flagship SNA, a hierarchical architecture. You can always subset a symmetrical model to be hierarchical, but you can’t go the other way. IBM didn’t want this to happen in a major way.

^[13]It truly was. I think I could write a “Discourses on Livy” for standards committees.

^[14]IBM was in a multifront war during this period. IBM always sold to the person who signed the check. In the 1960s and 1970s, the person who could sign a check for millions of dollars of computer equipment had little or no computer expertise. As we moved into the 1980s, that was changing and check signers were more willing to consider minicomputers and PCs. IBM was probably more successful delaying the assault on networking than they were at delaying the assault on the mainframe.

^[15]This has necessarily been a very high-level account of these events touching on the first-order effectors. There is fertile ground here for those interested in the process of and sociology of science and technology.

^[16]At the time, they were characterized as extremes of shared state, which it is. But this characterization does not lead to a useful middle ground.

^[17]Although that may have had more Luddite, anti-intellectual overtones, as in “real programmers don’t use high-level languages.”

^[18]There is a lesson here for both the Americans and the French. The Americans have not been the innovator they thought they were but were able to capitalize on the ideas. The French, although creative, were not able to capitalize on the new idea because it threatened entrenched institutions. One must admit that this is typical of both.

^[19]And now he champions action on the global warming crisis and goes down in history as the most influential U.S. politician to never be president? How strange that would be.

^[20]It isn’t the same attitude that all problems can be solved in a one-hour TV show, but it is close.

^[21]However, there is some question whether anyone wants to make or can afford to make such a determination. To do so would jeopardize their funding.

^[22]Quoted in their entirety both to emphasize the similarity and to assure the reader I have not been selective in quoting them.

^[23]There are intersections, but they are exceedingly rare.

^[24]Today you are told “we have moved beyond that” when it is not at all clear that it was ever really understood.

^[25]And don’t even mention IPv6. It is less than too little too late. A waste of time.

^[26]One might argue that this is true of any experimental apparatus in any science, but I contend there is a big difference between building an inclined plane to study the motion of balls in a constant gravitational field and building gravity to measure its properties!

^[27]Galileo’s problems started when he got homesick and returned to Florence to teach at Pisa.

^[28]Ricci translated Euclid for the Chinese in the early 17th century (Spence, 1984).

^[29]This is consistent with the strong artisan aspect of the tradition.

^[30]Expecting something like scientific method or predictive theory may be expecting too much. Scientific method, even in a rudimentary form, did not arise in the West until science had already begun to stagnate in China. It is possible that a critical mass of results was necessary for “science” to arise. One could argue, however, that China should have achieved that critical mass earlier. Another factor is that while China changed dynasties several times between the Han and Ming (~ 2,000 years), they didn’t change bureaucracies. Just as, according to de Tocqueville, the French didn’t change bureaucracies between the ancien regime and the Republic.

^[31]Well, okay, nearly everything, thanks to Gôdel.

^[32]Not unusual for craftsmen.

^[33]Probably lost due to lack of use. The fact that the earth is round is a prominent issue for a culture focused on maritime ventures as Europe was, but since most Chinese trade was overland, the knowledge was seldom needed.

^[34]His favorite example of this principle was that when you met someone in the UK, you could immediately tell what public school they had gone to.

^[35]It must be admitted that theory as sometimes practiced by academia and even corporate research labs has sometimes given theory a bad name. Big investments made in too many big claims have failed to live up their promise leading to more demand for early practical results.

^[36]As Popper showed, you can never prove a theory correct; you can only invalidate it.

^[37]Who was more an experimenter than a theorist.

^[38]“Agenzia di Progetti di Ricerche Avanzate di Firenze.”

^[39]Today you could not get funding for the equivalent of rolling balls down an inclined plane or dropping them off a tower; it has to have practical application. Is it any wonder our field is in stagnation?

^[40]Not to mention that calculus had not yet been invented and to some extent required these results to create the impetus for its invention.

^[41]I know I keep repeating this, but it is that important.

^[42]I do not propose that to draw this distinction too finely, but see it as a useful way to think about our discipline. And in fact, some curricula make this distinction.

^[43]One should not assume that there is a single set of “axioms” for operating systems or networks any more than there is a single set of axioms for geometry. The set of axioms merely serves to create a footing on which to build a theory to test.

^[44]Our field also has a tendency toward “term inflation.” Architecture is used to be synonymous with hardware; architect with chief engineer, topology when we mean graph, and so on. We need to be more careful.

^[45]I had always been uncomfortable with the fact that TCP didn’t separate control and data (something that seemed fairly iron-clad in protocol design, but it had seemed like a good idea at the time).

^[46]I must admit that I did not follow the steps that any system design book tells you to follow: requirements analysis and all that. And I believe that if I had, this structure would not have resulted.

^[47]That’s what they called us.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 10. Backing Out of a Blind Alley

Create new playlist

Sign In

Sign Up