Project 100 had a simple goal: to see one hundred progressive women serving in Congress by 2020. To get there, its founders built a website (project100.org) to showcase the positions and values of every progressive woman running for office.
And with a record number of women running for office in 2018, Project 100 had no shortage of content challenges. Condensing nuanced platform details for each candidate to just a few bullet points was difficult enough—but they still needed to be arranged on the site in a clear, balanced way.
Project 100 cofounder Eduardo Ortiz started first with an algorithm to elevate “trending” candidates, enabling those with strong name recognition to appear at the top of the page. However, Ortiz said, “we wanted people to understand that there are more qualified candidates than the ones who make it to the mainstream or get reported by the media.” To keep lesser-known candidates from disappearing behind a curtain, he designed without pagination:
If we were to paginate the candidates, those who have fewer resources would be displayed on later pages, continuing that disparity…Paginating would have been an arbitrary decision for organizing the candidates. Not paginating was an idealistic approach to giving everyone a level playing field.
This simple but well-considered design decision ensured that all candidates could easily be brought to users’ attention. (Coincidentally, the 2018 elections met Project 100’s goal two years early. Maybe it was the information architecture, maybe it was the power of democracy.)
Pagination would have changed the way candidate information could be discovered and understood—as well as the structure of the site. The meaning and impact of the content tells us not only how to display it, categorize it, and label it, but also how to connect it into a functional, overarching system.
Just as we need to understand our content before we can recategorize it, we need to understand the system before we try to rebuild it.
Enter the structural audit: a review of the site focused solely on its menus, links, flows, and hierarchies. I know you thought we were done with audits back in Chapter 2, but hear me out! Structural audits have an important and singular purpose: to help us build a new sitemap.
This isn’t about recreating the intended sitemap—no, this is about experiencing the site the way users experience it. This audit is meant to track and record the structure of the site as it really works.
First, we’re gonna need another spreadsheet. (Look, it is not my fault that spreadsheets are the perfect system for recording audit data. I don’t make the rules.)
Because this involves building a spreadsheet from scratch, I keep a “template” at the top of my audit files—rows that I can copy and paste into each new audit (Fig 4.1). It’s a color-coded outline key that helps me track my page hierarchy and my place in the auditing process. When auditing thousands of pages, it’s easy to get dizzyingly lost, particularly when coming back into the sheet after a break; the key helps me stay oriented, no matter how deep the rabbit hole.
Color is the easiest, quickest way to convey page depth at a glance. The repetition of black text, white cells, and gray lines can have a numbing effect—too many rows of sameness, and your eyes glaze over. My coloring may result in a spreadsheet that looks like a twee box of macarons, but at least I know, instantly, where I am.
The exact colors don’t really matter, but I find that the familiar mental model of a rainbow helps with recognition—the cooler the row color, the deeper into the site I know I must be.
The nested rainbow of pages is great when you’re auditing neatly nested pages—but most websites color outside the lines (pun extremely intended) with their structure. I leave my orderly rainbow behind to capture duplicate pages, circular links, external navigation, and other inconsistencies like:
Note that coloring every row (and indenting, as you’ll see in a moment) can be a tedious process—unless you rely on Excel’s formatting brush. That tool applies all the right styles in just two quick clicks.
Color-coding is half of my template; the other half is the outline, which is how I keep track of the structure itself. (No big deal, just the entire point of the spreadsheet.)
Every page in the site gets assigned an ID. You are assigning this number; it doesn’t correspond to anything but your own perception of the navigation. This number does three things for you:
Let me be completely honest: things might get goofy sometimes with the decimal outline. There will come a day when you’ll find yourself casually typing out “1.2.1.2.1.1.1,” and at that moment, a fellow auditor somewhere in the universe will ring a tiny gong for you.
In addition to the IDs, I indent each level, which reinforces both the numbers and the colors. Each level down—each digit in the ID, each change in color—gets one indentation.
I identify top-level pages with a single number: 1.0, 2.0, 3.0, etc. The next page level in the first section would be 1.1, 1.2, 1.3, and so on. I mark the homepage as 0.0, which is mildly controversial—the homepage is technically a level above—but, look: I’ve got a lot of numbers to write, and I don’t need those numbers to tell me they’re under the homepage, so this is my system. Feel free to use the numbering system that work best for you.
So we’ve got some secret codes for tracking hierarchy and depth, but what about other structural criteria? What are our spreadsheet columns (Fig 4.2)? In addition to a column for Page ID, here’s what I cover:
Depending on your project needs, there may be other columns, too. If, in addition to using this spreadsheet for your new sitemap, you want to use it in migration planning or template mapping, you may want columns for new URLs, or template types.
You can get your own copy of my template as a downloadable Excel file (http://bkaprt.com/eia/04-01/, XLSX). Feel free to tweak it to suit your style and needs; I know I always do. As long as your spreadsheet helps you understand the hierarchy and structure of your website, you’re good to go.
Setting up the template is one thing—actually filling it out is, admittedly, another. So how do we go from a shiny, new, naive spreadsheet to a complete, jaded, seen-some-stuff spreadsheet? I always liked Erin Kissane’s description of the process, from The Elements of Content Strategy:
Big inventories involve a lot of black coffee, a few late nights, and a playlist of questionable but cheering music prominently featuring the soundtrack of object-collecting video game Katamari Damacy. It takes quite a while to exhaustively inventory a large site, but it’s the only way to really understand what you have to work with.
We’re not talking about the same kind of exhaustive inventory she was describing (though I am recommending Katamari music). But even our less intensive approach is going to require your butt in a seat, your eyes on a screen, and a certain amount of patience and focus. You’re about to walk, with your fingers, through most of a website.
Start on the homepage. (We know that not all users start there, but we’ve got to have some kind of order to this process or we’ll never get through it.) Explore the main navigation before moving on to secondary navigation structures. Move left to right, top to bottom (assuming that is your language direction) over each page, looking for the links. You want to record every page you can reasonably access on the site, noting navigational and structural considerations as you go.
My advice as you work:
Knowing which links to follow, which to record, and how best to untangle structural confusion—that improves with time and experience. Performing structural audits will not only teach you about your current site, but will help you develop fluency in systems thinking—a boon when it comes time to document the new site.
The structural audit paints a picture of how the current system is put together, which should inform your new system. You don’t want to replicate it—you want to understand its weaknesses, so you don’t recreate them; and understand its strengths, so you don’t leave them behind. You’ve learned how the current pages work together experientially; now you can dream up more effective page relationships across the site.
Your sitemap is where you document the dreaming. It’s an artifact for communicating the hierarchy of pages in your site. It records your categories and labels, and maps out how content on the new site will be organized within them. It is your system’s north star.
First, let’s consider the direct relationships to be documented: parent-child relationships between pages at different levels in the hierarchy, and sibling relationships between pages at the same level in the hierarchy (Fig 4.3).
“But,” you might be saying, “my web pages have much more complicated connections, well beyond parent-child and sibling relationships!” Well—good! I would hope your site provides plenty of fluid paths for users to find the content they need as quickly as possible. But sitemaps are not meant to capture every possible path; indirect relationships between pages, and creative ways of surfacing content, won’t necessarily appear here.
Even if you’re not thinking in a strictly page-like way about your designs—I know, I know, we’ve broken the boundaries of the page, and we’re supposed to be designing experiences, not boxes—there are still plenty of benefits to articulating the structure of your site through a sitemap:
Admittedly, I’m subtweeting a bit to all the times I’ve heard someone say, “But do we really need a sitemap?” But the way a site is put together is a design choice, and documentation helps us conceptualize and communicate our choices.
The sitemap is an artifact—it needs to be articulated textually or visually (or both), not only to record your structural decisions, but to share the context for your work with colleagues and stakeholders.
The best documentation is the one that works for you, so consider how the sitemap can assist with the next steps in your project. Will it be used to garner stakeholder approval? To demonstrate progress in the project plan? To prepare for a CMS migration? To identify pages for wireframing?
All of those are valid uses for a sitemap (and your sitemap may meet multiple needs at once). But different styles of documentation have different levels of detail, and so are suited for different purposes.
When people (especially stakeholders) hear “sitemap,” they often think of a visual diagram: a series of boxes arranged in a tree-like structure, connected by lines denoting hierarchy (Fig 4.4).
This visual approach is excellent for presenting high-level structure, particularly to stakeholders who don’t want to get caught up in details. On the flip side, because these diagrams don’t have much room for detail, they tend to work best for smaller sites, or overviews of larger sites.
My sitemap diagrams span multiple pages: first, a page that shows all navigation structures, one level deep (Fig 4.4), then subsequent pages that show the details within each navigation structure (Fig 4.5). (If I find that this isn’t enough to convey the structure, that’s an indication that I should try a text-based sitemap format instead.)
Visually, sitemap diagrams are entirely up to you. Some people prefer horizontal arrangements to vertical ones; some avoid color. Colors, connectors, borders, shading—it’s your call. Just make sure whatever signifiers you apply to your diagram contribute meaningful information. This is particularly important when sharing the sitemap with others; consider including a legend if your system isn’t readily apparent to those who need to approve or act on it.
For a more textual approach, try an outline—yep, just a plain ol’ outline, in a plain ol’ document (Fig 4.6). Outlines are the epitome of hierarchical documentation—that’s literally why they exist—so you can see how that format works nicely for recording a hierarchy of pages.
Outlines are a good choice for completionist scenarios. If you need to show every page at every level, across multiple nested levels, an outline is a much more effective record than a box-and-arrow diagram.
On the other hand, outlines can become quite long, text-heavy, and wavy with indentations and decimals. Stakeholders who were expecting an easy-on-the-eyes diagram may not interpret this as a “sitemap”—so, once again, consider the needs of your project at this stage.
If you need to record more than just page hierarchy, you’ll need to turn to our friend the spreadsheet (again: not my fault). Much like our structural audit, a sitemap spreadsheet creates room for a wealth of data—such as identifying each page’s source content, new URL strategies, content revision status, page ownership, deadlines associated with migration, and more (Fig 4.7).
If you conducted a structural audit, you’re ahead of the game here: you can use that spreadsheet as a springboard for the new sitemap. (Save a copy!) It’s an excellent way to track the progression of content from the old site to the new.
It’s also ideal for collaborating on wireframing, templating, migration, and other development efforts, which may need more metadata than a diagram or outline can provide.
Depending on the format you use for your sitemap documentation, you’ll need to include different kinds of data and different levels of granularity. But, generally speaking, these are my recommendations for building a well-documented sitemap:
If you’ve made it this far, then congratulations: you’ve got a sitemap (plus my respect and admiration)!
But—no matter how rigorous your decision-making, how justified your choices, how solid your data—you’re probably about to make some changes.
Maybe you’ve received feedback from stakeholders. Maybe you’ve been told about unexpected content changes that undermine your category labels (oops). Maybe you’ve done some testing (see the Resources section), and users seem confused by your language. Or maybe you’re still iterating, and the sitemap just hasn’t gelled yet.
Whatever your situation, there comes a point (er, probably multiple points) where you’ll have to revise your site structure. Revisions are usually motivated by one of two things: purity (i.e. the structure doesn’t feel balanced, clear, consistent, etc.) or politics (i.e. a stakeholder’s favorite page is buried, a label doesn’t sound important enough, etc.).
When adjusting the sitemap (whether to please your stakeholders or yourself), you have a few levers to manipulate:
I wish I could give you a formula to follow—some tried-and-true equation where x + y = a scientifically correct sitemap, one so airtight that all content is perfectly exposed and no stakeholder can question the category labels.
But this practice is closer to art than science. We blend a little bit of instinct, a little bit of nerves, and as much data as we can gather to make decisions. No site launches with a perfect sitemap; no sitemap is free from politics and biases and rushed timelines.
The sitemap is the heart and soul of your site structure. But it’s also just one artifact—and a limited one, at that. To ensure a usable structure, we must also illuminate the paths through our content, and help users stay oriented on their journey, wherever it may take them.
18.227.190.93