Chapter 8. The Web-Analytics Industry

Now that you've been given the grand tour of Google Analytics, let's take a step back before we take a giant step forward to understand the industry that is web analytics.

There are a lot of moving parts, definitions, explanations, and rationales in this chapter. I'll talk about how web analytics works, define some metrics, and debunk some common misconceptions. I'll talk about the tools that you'll need to understand to become a great web analyst. It's not the most exciting and entertaining subject matter you've ever read, but if you're into being successful online, you don't want to skip this chapter.

As an industry, web analytics is young. It's extremely young. It's so young that only now are online business owners coming to the realization that they need to be able to measure what's happening with their initiatives. I strongly believe that web analytics is the next big thing. And as in the mid-'90s, everybody will be jumping on board.

How Google (Web) Analytics Works

When you launch a browser and visit a website of your choice, a lot of things happen behind the scenes within a few seconds. After you connect with the Internet (using Internet Explorer, Firefox, Google Chrome, Safari, Opera, or some other browser), your home screen or homepage will load, along with browser preferences, plug-ins, settings, and privacy options. Then you can type the URL of a website into the browser's address bar or access it from a bookmark. Sometimes you'll use a search engine to find a site, or you'll click a link from another site to go to your site of choice. Once you choose to visit a site and click the link (or click Enter or Go), the magic happens behind the scenes.

Note

I spoke of both home screen and homepage earlier, and there is a slight difference in this case. With Internet Explorer you set your default page (homepage). With Google Chrome you have a "home screen" with the eight favorite websites that you visit.

Before the website loads, or even as it's loading, a piece of JavaScript track-ing code that the website's owner installed on every page of the site instructs it to look for browser cookies. Browser cookies are very small text files that are set by almost every website on the Internet today — some websites won't even work properly if you don't have cookies enabled on your browser. These very small text files can contain sensitive data about the websites you visit, like the time of entry, time of departure, referring website, and saved login information.

If the site cannot find the appropriate cookies, then because of the JavaScript tracking code it will install at least four of these cookies on your machine (keep reading for what information Google Analytics collects). If the site does find cookies, meaning you've visited before, then it will simply update the cookies to indicate that you've visited the site one more time, and to note whether or not you've used different referral information to access the site this time.

After this happens and as the site either is still loading or has just finished, data about your visit is sent to Google Analytics servers for processing. The same raw data that's stored in your browser's cookies is sent over the Internet in a neat, tiny package called a utm.gif hit, sometimes also referred to as a 1x1 pixel request, which is a small file that is sent to Google Analytics. This tiny image is captured by Google Analytics and its data is processed and recorded. As the visitor moves about the site, additional data is sent with each page view, which will be associated with the visitor's current visit.

When Google Analytics detects that there hasn't been any more activity by the user after roughly 30 minutes, the session will close. After a few hours (if that), Google begins the process of converting this raw utm.gif session data into the reports that I showcased in Chapter 7. The data is cleaned up, calculated, and run through any settings and filters that may be applied to your Google Analytics profile. It then is reported in your profile and is guaranteed by Google to remain there for two years. This data can then be segmented, exported, e-mailed, and everything else we just covered in the previous chapter.

All this happens for each visitor, every single time. Multiply that by the thousands upon thousands of visitors your site receives daily, and the tens of thousands of websites large and small that have Google Analytics tracking codes installed, in 30-plus different languages, with an unimaginable number of settings and filters and profiles. It's amazing how smoothly and rather unno-ticeably to the everyday visitor this all happens.

Figure 8-1 shows a set of cookies that were installed on my Firefox browser from the website www.metallica.com. You can see the four __utm cookies, which are the Google Analytics cookies. The __utma cookie stores the time stamps and counts of visits that each visitor performs on a website. The __utmb and __utmc cookies are temporary and are used to calculate session lengths. The __utmz cookie stores the referral information, such as the keyword, search engine, referring site, and medium.

Google Analytics cookies

Figure 8-1. Google Analytics cookies

Tag-based web-analytics solutions like Omniture SiteCatalyst (http://www.omniture.com/) and VisiStat (http://www.visistat.com/) collect data and use cookies in a similar way, each using its own language and data-collection methods. Server-based web-analytics platforms like Urchin Software (http://www.google.com/urchin/) from Google and CoreMetrics (http://www.core-metrics.com/) parse log files and clean up the data before showing it to their customers.

What Information Does Google Analytics Collect?

Google Analytics collects information about your website visitors that pertains to their experiences on the website. This information includes:

  • Where a visitor came from (Google, Yahoo!, etc.)

  • The means by which the visitor accessed the site (CPC, CPM, etc.)

  • The website(s) visited

  • The entry page (landing page)

  • The exit page

  • The pages viewed

  • The time stamps of each page view

  • The number of visits to the site

  • Any search queries performed on the site

  • Any events performed on Flash- or AJAX-based applications (event tracking)

  • Geo-location information (continent, country, state, city)

  • Browser language options

  • The browser (Internet Explorer, Firefox, etc.)

  • The operating system (Windows, Macintosh, etc.)

  • Screen colors

  • Screen resolution

  • The version of Flash

  • Java (from Sun Microsystems)

  • Service provider (ISP)

  • Internet connection speeds

  • Mobile carrier and device (if applicable)

  • Custom variables (if applicable)

  • The visitor's IP address (not displayed in reports)

  • Activity pertaining to Google AdWords (if applicable)

  • Activity pertaining to Google AdSense (if applicable)

  • E-commerce transaction and product information (if applicable)

None of this information, with the glaring exception of the visitor's IP address, can be directly tied back to an individual user. Even the custom variables are collected and displayed anonymously and aggregately, so that any individual user's information cannot be found in Google Analytics. The IP address that's collected is merely used for processing and used against any filters that a profile owner may have set up — you won't find any IP addresses in Google Analytics.

Depending on the web-analytics platform, you may see slightly different information in the reports. For example, in Yahoo! Web Analytics you can see demographic data such as the visitor's age range and household income. Other platforms do show you the visitor's IP address, but have clear statements in their terms of service that prohibit users from taking those IP addresses and using them for their own efforts.

Note

Actually, you will find non-individual IP addresses listed in Google Analytics, within the Hostnames report found within the Network Properties subsection in the Visitors report section. These IP addresses are servers and in some cases will be the IP addresses of websites where your tracking code is installed. But they are not individual visitor IP addresses.

Personally Identifiable Information (PII)

Google Analytics does not collect any personally identifiable information, and does not display any personally identifiable information in reports. Google Analytics also does not allow anyone to rig the Google Analytics data collection process in any way to collect information that may be submitted on a website via a form or another method. Since this is such a sensitive topic in web analytics, I encourage you to at least be aware of the terms and conditions, and even the privacy policy, of your web-analytics vendor. If that vendor is Google Analytics, you can visit http://www.google.com/intl/en/analytics/tos.html and be completely caught up.

Google Analytics does not collect or display in reports any of the following information:

  • Visitor IP address (except for the purpose of processing data; it is not displayed)

  • Personal first and last names

  • Street addresses or post office box numbers

  • ZIP codes

  • Salary information

  • E-mail addresses

  • Phone numbers

  • Credit card information

Most if not all web-analytics vendors follow the same guidelines. When integra-tion with a customer relationship management (CRM) software like SalesForce is present, this type of data is still not available from Google Analytics — only in SalesForce or your CRM tool of choice.

When you read http://www.google.com/intl/en/analytics/tos.html, you'll come across point number 7, which deals with privacy. This single para-graph outlines everything I just talked about under this subheading, but it also includes an interesting final sentence: "You must post a privacy policy and that policy must provide notice of your use of a cookie that collected anonymous traffic data."

Some websites do this already, but most don't inform visitors about what web-analytics programs they're using or what tracking methodologies they employ. I recommend finding a way to inform your users that you are collecting their usage statistics (but not their personally identifiable information).

First-Party vs. Third-Party Cookies

I mentioned in the beginning of this chapter that Google Analytics sets first-party cookies on a visitor's computer. It uses only first-party cookies, and no third-party cookies.

A first-party cookie is a cookie that can be read only by the website that sets it. If you visit metallica.com as I did back in Figure 8-1, and Google Analytics is installed there, then only metallica.com can open up those cookies, read them, and update them when necessary. First-party cookies are safe and secure, and cannot be sold or referred to other websites for any purpose. Most web-analytics vendors use them.

A third-party cookie is a cookie set by a website other than the one you're visit-ing. Let's say that you visit aol.com and there is an image ad running on that site for brownshoes.com. Let's also say that aol.com allows third-party cook-ies to be set. The brownshoes.com site is therefore able to set and read cookies on your site, even though you didn't visit brownshoes.com. This means that aol.com is allowing your sensitive web information to be read and accessed by another website. For security reasons alone, this is a dangerous practice that can open visitors up to all sorts of privacy problems. Sites can use third-party cookies to do things like use behavioral advertising (targeting), which allows sites to advertise to you based upon the history of your online activities. Some programs still allow third-party cookies. So be aware of all the facts before you decide to allow third-party cookies to be set on your computer.

Today there are also Flash cookies that can be set to track users, and in some cases to reset deleted cookies. A lot of Flash applications use cookies to remem-ber your audio preferences, video preferences, and other settings. However, some also use them for nefarious reasons. You can manage your Flash player settings by visiting http://www.macromedia.com/support/documentation/en/flashplayer/help/settings_manager.html, clicking Global Privacy Settings, and then clicking the Website Privacy Settings Panel link at the very bottom of that section. Figure 8-2 shows what a settings panel looks like, and you can manage your Flash cookie settings from here.

Google Analytics does not use Flash cookies either, only first-party cookies.

Adobe Flash cookie/privacy settings

Figure 8-2. Adobe Flash cookie/privacy settings

Limitations of Google Analytics

As great as Google Analytics is and as much as I'm a fan of it, it's not perfect. In its defense, it's not designed to be perfect. In some respects it's only as good as the data that it can collect from its users, which is something that every user is in full control over.

Some of Google Analytics' limitations are:

  • Robots/search engine spiders: Google Analytics is a tag-based web-analytics solution using JavaScript and cookies. Search engine spiders do not have the ability to execute JavaScript; therefore, Google Analytics doesn't have the ability to collect and display that type of information. Only log-file parsing web-analytics solutions like WebTrends can report on search engine spider activity (for now).

  • Non-"Web 2.0" mobile phones: A lot of standard mobile phones, like some BlackBerrys, the T-Mobile Sidekick, and the formerly popular Motorola Razr flip phone, do not have a web browser that can execute JavaScript. Therefore, Google Analytics cannot track individuals accessing a site from one of these types of non-"Web 2.0" mobile phones.

  • Cookies/JavaScript/images: If a user has any one of these three disabled, Google Analytics cannot collect its data and cannot display the data in reports. The natural follow-up question to this is almost always, "How many users block cookies/JavaScript/images?" The answer is not simple. Even though many different reports and studies have been performed, there isn't anywhere near a consensus on the percentage of online visitors with any one of these three disabled. I've read one report that fewer than 2 percent of all online users have disabled JavaScript, cookies, or images, but I've read another saying that the figure is close to 25 percent! As you'll read about later in this chapter, it's unwise and a waste of energy to concentrate on what's not being collected. By no means am I making excuses for web analytics, but in today's world, it's just not possible to be 100 percent accurate.

  • 500 rows of data at a time: The maximum number of rows that you can view at any one time in Google Analytics is 500. If you need to view the rows of data between 501 and 1,000, you'll have to hit the right arrow on the very bottom right of reports.

  • Five million page views for non-AdWords advertisers: Each month you're allowed to collect up to 5 million page views if you're not synced with a Google AdWords account in good standing. If you are synced, then there is no page-view limit.

  • Data sampling: Data sampling occurs when you segment a large volume of data. It sometimes happens if your date range is very large. Basically, data sampling means that only partial data can be shown to you when you have your segment applied. Plus-minus figures will appear next to each data point, informing you of the margin of error that Google Analytics is currently showing on your data. This is done in an effort to reduce the load that Google Analytics servers are placed under when large segments, sometimes paired with large date ranges, are requested. When you see your report with yellow-colored boxes labeled with plus-minus signs, as in Figure 8-3, try shortening the date range or removing the segment to undo data sampling.

  • Fifty profiles per Google Analytics account: Within each Google Analytics account, you're allowed a maximum of 50 profiles. If you need to create more than that, you will have to create a new Google Analytics account (I'll discuss this more in Chapter 10).

  • Profiles cannot be moved from one account to another: Because the pro-file is tied to the UA account number, and for workload reasons, profiles cannot be moved around.

  • Historical data: Historical data cannot be filtered or modified in any way after data processing occurs. What you see is what you'll always get.

  • Account recovery: Once an account or a profile is deleted, it cannot be brought back to life. You really should use extreme caution with your profiles and Google Analytics account to prevent an accidental account or profile deletion from occurring.

  • Cost data from non-AdWords sources: Cost-data cannot be imported from Yahoo! Search Marketing, MSN AdCenter, or any other pay-per-click advertising platforms at this time.

Data sampling in Google Analytics

Figure 8-3. Data sampling in Google Analytics

The limitations of other web-analytics solutions are as varied as you can imagine. Some limitations of Google Analytics are duplicated in Omniture SiteCatalyst, and some WebTrends limitations are default features of Google Analytics. For example, consider the "bounce rate" metric. This is not a supported metric in the latest version of WebTrends. But it's standard in Google Analytics. Lyris HQ (formerly ClickTracks; http://www.lyris.com/solutions/lyris-hq/web-analytics/) and TeaLeaf (http://www.tealeaf.com/) have features that no other platform offers. Only Google Analytics has native Facebook Markup Language (FBML) integration, while platforms like Klout and Twitalyzer are the only ones that measure your Twitter influence. As you can see, different platforms report and behave differently (which is a lead-in to one of the most common web-analytics misconceptions, discussed later in this chapter).

Web-Analytics Metrics

It's about time that I defined which metrics are used in web analytics. You've seen them throughout this book and you've also seen plenty of images of them in Chapter 7. Each metric has a purpose and a definition, and there's one organization out there that is helping every web-analytics vendor come up with consistent definitions and calculations for the wide variety of metrics they all report on.

The Web-Analytics Association

This not-for-profit organization is the de facto governing body of the web-analytics industry. It has published research documents and metric standards with the ultimate goal of influencing every web-analytics vendor to adopt consistent, standard naming conventions and metrics definitions across the board.

The group's 2008 Standards Definitions Volume I can be downloaded via this URL:

http://www.webanalyticsassociation.org/resource/resmgr/pdf_standards/webanalyticsdefinitionsvol1.pdf

When you have some time, read up on how this body classifies metrics into four categories and how it defines every one of the standard metrics.

Web Metrics and Dimension Definitions

Everything that Google Analytics collects is bucketed into one of two categories. It's either a metric or a dimension. A metric is a count of something, like a page view or a goal completion. A dimension is an informational text string, like a search engine, page title, or keyword.

Let's define the metrics first and then the dimensions.

Metrics

  • Visit: A visit to your website

  • Page view: A single page view of any page on your website in a visit

  • Unique page view: A unique, unduplicated page view of a single page within a visit

  • Pages/visit: The average pages viewed in each visit

  • Visitor: An actual website visitor (person)

  • Unique visitor: A unique, unduplicated individual who visits your website

  • New visits: The count of first-time visitors to your website

  • % new visits: The percentage of new visits in comparison to all site visits

  • Time on page: The amount of time spent on a page by a visitor

  • Time on site: The amount of time spent on the entire site by a visitor

  • Entrances: The number of entries into a particular page of your site

  • Exits: The number of exits from a particular page of your site

  • Bounces: The number of single-page visits by your visitors

  • Bounce rate: The percentage of single-page visits by all visitors to your site

  • Goal starts: The number of times a goal is started but not necessarily completed

  • Goal completions: The number of unduplicated times a visitor reaches a goal page (converts)

  • Goal value: The value of an individual goal, based upon goal completions

  • Per-visit goal value: The average value of each visit to your website, based upon goal completions

  • $ index: The average value of each page on your site, based upon goal completions and e-commerce revenue

Dimensions

  • Visitor type: The type of visitor (new or returning)

  • Hour of the day: The hours of each day (12:00, 1:00, etc.) at which a visitor visits your site

  • Page depth: The number of pages viewed in a single visit

  • Days since last visit: The number of days since a visitor's last visit to the site

  • Visit duration: The length of each visitor's visit

  • Language: The visitor's language browser preference

  • City: The visitor's originating city

  • Region: The visitor's originating region or state

  • Country/territory: The visitor's originating country

  • Sub-continent region: The visitor's sub-continent region (Western Europe, Central America, etc.)

  • Continent: The visitor's originating continent

  • Custom variable: The custom variable set on a visitor's computer

  • Mobile: The mobile device information for a visitor

  • Campaign: The advertising campaign that led a visitor to your site

  • Ad group: The advertising ad group that led a visitor to your site

  • Keyword: The advertising keyword that led a visitor to your site

  • Ad content: The advertising ad that led a visitor to your site

  • Ad slot position: The actual location of the ad on the search engine results page that led a visitor to your site

  • Source: The name of the website or marketing initiative that led a visitor to your site

  • Medium: The means by which a visitor accessed your site

  • Referring site: The site on which a link to your site appears that led a visitor to your site

  • Referral path: The specific page on the referring site that led a visitor to your site

  • Page: The page viewed on your site

  • Page title: The page's title meta-tag that was viewed on your site

  • Hostname: The URL(s) of the site(s) visited by your visitors

  • Landing page: The page used as the entry point into your site by a visitor

  • Exit page: The page used as the exiting point from your site by a visitor

These are a lot of definitions, but even by eyeballing this list you should have seen the word visits and the word visitors quite often. It's important throughout your web-analytics adventures that you make a clear distinction between the two. A visitor can have multiple visits, and multiple visits can be made by one visitor. The visitor is the actual person on the other end of the keyboard and mouse; the visit is the act of visiting a website.

Unfortunate and Common Misconceptions

Any industry has its disagreements and misconceptions, its myths to be debunked, and its arguments, ranging from tussles about minutiae to deep philosophical debates. In web analytics, it's no different. There are a host of misconceptions and misunderstandings about our industry and especially Google Analytics. No one is to blame for this — it's simply the nature of the beast. Many misconcep-tions stem from a lack of understanding of the basics of how things work and what things were designed to do. In this section I hope to cover the important misconceptions about web analytics and also about Google Analytics to put to rest any fears you may have.

Misconception: Web Analytics Is Accounting Software

Remember that Google Analytics isn't QuickBooks. Many, many folks feel that Google Analytics was created to collect 100 percent of all web visitors and match up to the exact decimal point the number of seconds spent by all users on the site. They feel that keyword searches, goal completions, and e-commerce trans-action and revenue data should be exactly what they see in their server logs, in their version of QuickBooks, and in their CRM database. Even the slightest deviation is often unacceptable and is used as a basis for reasoning that Google Analytics or any other web-analytics platform is broken.

The thing these folks should realize is that web-analytics programs aren't designed to function as hit counters or as accounting software that requires absolutely precise totals and tabulations. Web analytics has its limitations, and therefore it will never be able to provide such a level of completeness that you can compare it side by side with more precise tools. If a user has cookies or JavaScript or images or anything else blocked, the chances that Google Analytics or any other tool will be able to collect that user's info become zero. If a user has an extremely slow Internet connection and is making moves on the website before it fully loads (and before the JavaScript tracking codes load), you can kiss accuracy goodbye. If tags are not installed correctly, if URLs aren't tagged properly, or if other factors intrude, it's so long data.

These sound like simple hurdles to get over, right? Not necessarily. Many of you readers will have bosses to answer to, or key stakeholders to present to. Bringing up the topic of total accuracy (or its lack) may ruffle feathers and poke giant holes in conventional wisdom, and could cause you to commit unintentional career suicide. Totally perfect data is fool's gold: Even if you get there one day, what are you going to do with it? Is having 5,423 goal conversions in a perfect data-collection world going to change your rationale and make you do things differently than if you had 5,419 goal conversions instead?

Web analytics is designed to analyze trends, provide insights, and high-light key data that you can use on your site and in your marketing initiatives. The sooner your boss or your colleagues understand this concept, the easier everyone's jobs and lives will be, and the sooner you can make progress and improvements online.

Misconception: Google Analytics Publicly Shares and Sells My Information

In high school I gave a senior-class presentation on the JFK assassination. I remember talking about how it was impossible for Lee Harvey Oswald to have been the lone assassin. Yes, I'll believe a lot of things, but even I can't buy the theory that Google publicly shares and sells your information.

In the data-sharing section within your Google Analytics account, you have the option to anonymously share your data with Google services and other products. The data is used to create tools like those in the benchmarking section. Other than that, there's no evidence other than accusatory blog posts that Google is just waiting for the right time to sell everyone out and rake in huge profits. If there were evidence, it would probably trigger lawsuits and congres-sional investigation. There isn't any such evidence. Google isn't selling your data, period.

Misconception: A $125,000 Tool Will Solve All My Problems

I think Omniture SiteCatalyst, WebTrends, and CoreMetrics are excellent web-analytics platforms. They have features beyond anything Google Analytics can currently do, and in my opinion they provide the type of competition required for Google to continue to improve its own product. However, these tools come with a pretty hefty price tag, going all the way up to the six-figure range, depend-ing on your package and service selections.

What's at work here is the idea that you get what you pay for. Google Analytics is free; therefore, it can't be all that good, right? However, big expensive programs should be able to answer all questions, solve all problems, and even iron your pants for you. Unfortunately, while they may be good, they aren't that good either, no matter how much money you paid.

The truth is that it still takes a human being to use the tool and to extract insights and data from that tool. The web-analytics platform is only as good as the person using it. Think about a first-time driver getting behind the wheel of a Formula One racing car — he'd crash it within seconds! But take Michael Schumacher and put him behind the wheel of a Fiat, and I'm sure he'll get the most out of that Italian compact.

Just because you pay a lot of money for a web-analytics tool doesn't mean that it will raise your conversion rates and bring you more revenue. You're still going to need a great web analyst and a great marketing team to help you along the way.

Misconception: One Tool Should Be Enough for Me

People who buy a big-dollar web-analytics platform also feel sometimes that it should be the only tool they'll need. That same feeling can extend to Google Analytics — those who use it can begin to see it as the only thing around, and be unaware of what other tools are out there.

Over time you'll want to develop a repertoire, a bag of tricks that includes diverse tools and services. Just as you'd want to diversify your 401(k) stock portfolio, you'll want to diversify your web-analytics strategy to cover the most possible ground.

You have Google Analytics/Yahoo! Web Analytics/Unica NetInsight? That's great. You also need a customer-feedback service like Kampyle or 4Q by iPerceptions. If you use social media — and who doesn't these days? — you'll need to make use of Facebook and YouTube Insights, Twitalyzer (http://www.twitalyzer.com/), Klout (http://www.klout.com/), or paid services like SAS (http://www.sas.com/) or Radian6 (http://www.radian6.com/).

Your arsenal won't be complete until you dive into tools like HitWise (http://www.hitwise.com/), comScore (http://www.comscore.com/), Google Insights for Search (http://www.google.com/insights/search/), or Google Trends for Websites (http://www.google.com/trends/).

That is, until you want to start doing mobile analytics with tools like PercentMobile (http://www.percentmobile.com/) or Bango Analytics (http://www.bango.com/), or you need to measure blog statistics and performance with FeedBurner (http://www.feedburner.com/) or Sentiment Metrics (http://www.sentimentmetrics.com/). Oh, and don't forget about tools like CrazyEgg (http://www.crazyegg.com/) and clickdensity (http://www.clickdensity.co.uk/), which allow you to view click data on top of your website's pages.

As you mature in web analytics, your current lone tool will not be able to answer every question or measure every statistic that you ask it to. It's not the tool's fault; it's just not designed for it. So you'll have to expand your horizons and get out of your comfort zone to find new data sources with interesting and useful information. I'll highlight some of these in Chapter 17, but for now, check out any of the tools that I named in the previous paragraph for a good head start in what will be discussed toward the end of the book.

Misconception: Web-Analytics Tool X Should be Exactly the Same as Web-Analytics Tool Y

I often come across a website with more than one web-analytics package. More often than you may think, a website may be using Google Analytics, Yahoo! Web Analytics, WebTrends, and Omniture SiteCatalyst all at the same time. (Okay, maybe not exactly that scenario, but it's quite common to find more than one web-analytics tool installed on a site — simply view the source code or install a tool like the Web-Analytics Solution Profiler (WASP) tool for Firefox, to view what web-analytics tools are setting cookies on your machine.)

People also tend to compare the statistics from one platform with those from another, and wonder why the two systems don't match. A common conversation can start out like this: "So I'm looking at WebTrends and I see 4,350 page views for the month of March. But then we installed Google Analytics and we see 3,780 page views for the same month. Why is this happening and which one is right?"

From reading up to this point you know why this happens — tools collect, process, and calculate differently from one another. Some tools (like WebTrends) count search engine spider activity, while other tools (like Google Analytics) do not. Also, some settings, like filters, are applied on one tool and are not applied (or aren't possible) on another. But you knew that already. The next question may be the tricky one to answer: "Which one is right?"

The answer is that both of them are right. This is a tough pill to swallow, especially if it's a client you're talking to. But assuming that both platforms are installed and configured correctly, the logical conclusion is that there are two correct answers, even though the answers themselves are different. The key is to not "use the bigger number" all the time (which is easy to do with things like e-commerce transactions or goal conversions). Remember that web-analytics platforms are designed to analyze trends and are focused on cultivating insights — they're not a replacement for server logs or accounting software. Train yourself and your clients to think this way and to stop compar-ing statistics from one tool against those from another.

Misconception: Google Analytics Can't Handle Large Volumes (Because It's Free)

To the contrary, Google Analytics can handle large volumes of data just fine. The root of this misconception is the five-million-page-view limit per month for non-AdWords advertisers. This is simply a limit for those web analytics accounts that are not synced to an active AdWords account in good standing. If you wished, you could open up an AdWords account, sync it to your Google Analytics account, and have as many page views as your website can handle.

At the time of this writing, both Facebook and Twitter — two of the Internet's largest-volume websites — use Google Analytics on their sites. They wouldn't be using it if it couldn't handle volume.

Misconception: The More Data My Dashboard Has, the Better!

In Google Analytics you can add as many reports to your dashboard as you like, which you already know from Chapter 7. When you make an executive dash-board in Excel, there are also no limits to what you can do with it. Sometimes the dashboard is dictated by your CEO or VP, and you basically just have to follow orders.

But it's high time that folks started creating dashboards that made sense. Far too many executive dashboards, high-level overviews, or top-tier presentations include pages upon pages of stuff with charts, graphs, metrics, columns, rows, and data, until the view gets dizzy. The entire purpose of a dashboard is to have a high-level review of what's going on — not to provide every single detail. If detail is requested, then that can be provided in a separate tab, worksheet, or file, but don't pollute your dashboard with too much data. Save the good stuff for later. Unfortunately, far too often I see dashboards that remind me of an episode of Hoarders, with so much data piled in one small place that I wonder what I'm supposed to do first.

When I create a dashboard I try to think like a salesperson, whose objective is to make that sale or renew that contract. When the salesperson is putting together a presentation or sales pitch, it usually has at most a summary of bul-let points, some high-level information, and possibly one or two paragraphs of explanation. In essence the salesperson is creating a dashboard that will be reviewed and looked at by potential clients. Try this approach the next time you're creating or working on your dashboard — remove the clutter, cut the fat, and don't become a dashboard hoarder.

Misconception: Too Many Visitors Are Listed on the Top Exit Pages Report

Another one of the most common web-analytics misconceptions concerns the Top Exit Pages report, found within the content section in Google Analytics.

Take a look at Figure 8-4, where I show the top half of the Top Exit Pages report, purposely without listing the page names. What insights, actions, or intelligence can you derive from this information?

The top half of the Top Exit Pages report in Google Analytics

Figure 8-4. The top half of the Top Exit Pages report in Google Analytics

As I discussed earlier, visitors have to leave a site eventually. They can't stay on your site forever. Looking at this report and trying to get anything out of it other than some fun facts will drive you insane. You will think your homepage is "leaking traffic," and will make some radical change that won't work and will wind up breaking your CMS System, while simultaneously taking your website offline. If that's going to happen to you, at least do it after looking at the Top Landing Pages report, which is far more useful than this report could ever be even on its best day.

If you have colleagues, customers, or clients who use this report, please attempt to discourage them from doing so. Offer metrics like bounce rate and reports like Top Landing Pages as alternatives.

Misconception: You Should Use Year-over-Year and Decade-over-Decade Comparisons

When a year-over-year comparison is performed, I bite my tongue really hard so as not to say anything. It's even harder when I see comparisons ranging back a couple of years, or (as has happened) a comparison between log-file web stat online data from 2000 and 2010. I added this final piece of this section to cover just this topic.

Online data is like a jar of mayonnaise — it's perishable and it has a shelf life. Your data, after a certain period, is just not usable anymore. It could very well be rotten and ready to be thrown down the garbage disposal. On the Internet, things change really fast. In 2006 Twitter had just been born, and the topic of social media was new. Our offline and online worlds have completely changed in a relatively short time. Comparing web data from 2005 to web data from 2010 is like comparing Nielsen television ratings from 1950 and 2010.

There are a few reasons why you might compare year-over-year data, but only a few. If you know people who habitually compare visits from 2005 to visits from 2010 or page views from 2003 to page views from 2010, you might try to educate them.

Becoming a Great Web Analyst

So you now know how Google Analytics works, you know the definitions of a lot of metrics and dimensions, and you have read through a lecture on the common misconceptions, all while obtaining some insight into this new and exciting industry that we know as web analytics. Now, it's time to learn the tools of the trade for becoming a great web analyst for yourself or your organization.

Part of achieving greatness is knowing the common misconceptions that I just discussed. You know what they are, you know how to spot one when you see one, and you know what to do about it. You're already more ahead of the game than most. Now, what else is there to learn about?

Failing

If you haven't failed yet at doing online marketing or web analytics, then you probably haven't been doing it very long. If you have been doing it a long time without a failure, consider yourself one of the very few extremely fortunate ones. It's virtually inevitable that you'll fail sometime at measuring, implementing, or doing web analytics (or online marketing). You'll make a mistake, and it may even cost you time, resources, money, or, worse, your data.

Just as in sports, you learn more from your failures than from your successes. What I've learned from my failures is that I don't know everything, and I don't like to consider myself an "expert" or a "guru" (I actually don't like either term). But early on I thought I knew everything, until I made that inevitable screw-up that cost a client some important data. I learned the hard way that I don't walk on water, and it humbled me. This knowledge was reinforced soon afterward when I struggled to answer a client's web-analytics business questions, despite my very nice summary of web data and stats, which were of no help to the client.

You will fail in web analytics. Don't fear it — simply be prepared when it happens. Try not to get mad or frustrated — learn from the experience, store it away in your bag of tricks, and move on to the next assignment. And don't call yourself an "expert" or a "wizard" or any other self-promoting noun — there is always someone out there who knows more than you, or is going to know more than you someday.

Segmentation

The key to success in web analytics is segmenting your data. Data at an aggregate level can take you only so far and can answer only a few questions. You need to know which keywords came from Google, and which visitor types landed on your homepage. You need to be able to customize your data segments, as I'll show in Chapter 9. Without the ability to slice and dice your data, you won't be able to obtain the deep insights that are required in order to truly understand your website. Figure 8-5 shows you an example of a segmented report in Google Analytics.

Let's say that you start at the very top with all visits, and see that there were 50,000 visits to your site during August. This is the highest-level, most unseg-mented data possible, and it doesn't tell you anything at all other than how many visits you received. What action can you take on your website, AdWords campaign, or e-mail marketing blasts from knowing that number? You can't take any action, and that's the whole point of segmentation with web analytics.

So you start drilling into the Traffic Sources Overview report and find that 60 percent of your site's traffic was from search engines, 20 percent was direct traffic, and the other 20 percent was from referring sites. Your large and costly e-mail campaign isn't even a blip on the radar, but you see as you segment one level deeper that almost all your search engine traffic came from your Google AdWords campaign. Now, because you segmented, you know that your web-site's traffic is highly dependent upon AdWords, which means that you have no organic visibility and not very many people are typing in your website's URL directly. You have SEO and branding work to do.

Segmenting in Google Analytics

Figure 8-5. Segmenting in Google Analytics

But then you go even deeper and see that there are only two campaigns that are converting out of the nine campaigns you have running in AdWords. You drill down into those two campaigns to find the ad groups containing the three converting keywords in each that are the sole contributors of business for your online initiatives. You then segment those keywords by geo-location to find that the conversion rates are the best in New York City, Chicago, and Los Angeles, and segment again to discover that almost all purchases happen in the afternoon Monday through Friday. These purchases from those very lucrative keywords are only by returning visitors with fast connection speeds using Firefox. They take six visits and three days to purchase on average.

You then backtrack to find out that the non-converting keywords are money-guzzlers in need of strong optimization efforts, because they are bouncing from your pay-per-click landing page. The keywords that do make it but don't convert are leaving on Step 3 of the goal conversion funnel, and visitors are spending less than four minutes on the site on average. They're also searching for product catalog IDs on your site's internal search function, which is not working properly. This could very well be another reason they're not converting.

Yes, I love segmenting. And so should you. Or you can continue to use aggre-gate metrics like 50,000 visits to make your decisions. I'll be seeing you at next year's holiday party with all your clients in my portfolio.

Comparisons

I don't like doing year-over-year comparisons, but to become a great web analyst you're going to have to perform at least some month-over-month ones. You need to have some frame of reference, like the most recent month, to be able to tell you how your efforts are progressing (or regressing) over a reasonable period.

With Google Analytics you can click the Compare to Past checkbox within the date-range tool to perform a comparison of data between one date range and a previous one. Figure 8-6 shows a 30-day date-range comparison by search engine, to see if my conversion rates have increased or decreased over time. In Figure 8-6, at least for Google, I am showing a very strong increase in the conversion rates across the board in this particular set of goals, which reaffirms to me that whatever my optimization efforts were, they are paying off big-time. The interesting thing to note is the very first column on the left-hand side. You can't see the column heading in Figure 8-6, but it's the number of visits within my specified date range. Notice that I'm getting many fewer visits in the most current time period than in the previous time period, yet my rates of conversion are much higher. This is another great indicator of success — I've refined my site and my efforts in such a way that the traffic I am now receiving is much more ready to convert. In essence I've been able to weed out the irrelevant traffic, which wasn't doing anything of value anyway, so it's not that important.

Comparisons in Google Analytics

Figure 8-6. Comparisons in Google Analytics

Executive Dashboards/Reports

Yes, your dashboard should never be a candidate to appear on the show Hoarders. It needs to be clean, crystal-clear, free of clutter, and easy to read, not just by your client or colleagues, but by you as well. After all, you are the keeper of the flame, the web-analytics practitioner who will give advice to your client, boss, or yourself if you own your own website.

Try to find a way to show what's really important on one page. If you abso-lutely need to have more data for a weekly meeting or client report, attach it in subpages, leaving you with one executive summary page. Use a font size that's easy to read and big enough that no one has to squint. Use simple, plain color schemes and don't get too creative with your background logos or Flash pieces. Organize your columns and rows neatly and with common sense. Of course, double-check the validity and accuracy of your data — nothing ruins a dashboard like data that's incorrect.

If you're planning on using the Google Analytics dashboard, make sure you add only the reports that you need. Feel free to reposition or remove the default dashboard reports. Limit the reports you add to your dashboard to six (any more than six will generate another page when downloaded). Remember that there's nothing you can do about the Site Usage window or the trending graph — those two will always be there on your dashboard and cannot be removed.

Take the feedback that you receive about your dashboard or executive sum-mary constructively, not as a personal attack. Remember that the data you're presenting has to satisfy your clients or stakeholders, so ultimately they're in charge. Temper their requests to fit the framework of what you know works best in a dashboard, and try your very best not to let it get out of control.

Competitive Intelligence

I mentioned this earlier in this chapter as well as in Chapters 3 and 6. You are not the only fish in the sea. There are competitors doing what you do and thinking of ways to do it better than you. So you're going to have to incorporate some competitive-intelligence insights into your framework to stay on top of your rivals, and keep your clients or customers right at home with you.

If you have the budget for it, try using HitWise's wide range of competitive-intelligence services (www.hitwise.com). Figure 8-7 shows HitWise's weekly retail intelligence report, which is freely available on its website. You can view industry rankings arranged by the percentage of visits, the fast-rising search terms, and the industry search terms. This is great data that you can have for your site if you can afford the paid version of HitWise.

If you don't have the budget for it, try Google Insights for Search and Google Trends for Websites, which I'll talk about in Chapter 17. You can view hot trends and fast-rising search terms, but unlike in HitWise's freely available report, you can modify date ranges and segment by country, state, or city without even logging in to your Google Account.

HitWise weekly retail re port

Figure 8-7. HitWise weekly retail re port

Understanding Your Website

In Chapter 1, I talked about asking yourself the question "What is the purpose of my website?" To be a great web analyst, you're going to have to understand your website like no other person on the planet. You're going to need not only to answer the "purpose" question, but also to know what the secondary purposes of your website are, what functionality the site provides, and where all the nooks and crannies are. You need to understand your website's target audience, the site's content, and what actual visitors are viewing your website. This is aside from knowing how to perform analytics and calculate metrics. You're going to have to become that word that I hate — an expert — on your own website in order to truly be a great analyst.

Now, I understand that those reading this book may include consultants, marketers, and analysts in charge of looking at multiple websites. If you don't build in some time to learn as much as you can about the business, the web-site, and anything else you can get your hands on, you will eventually fail at doing web analytics. Take as much time as you possibly can to test-drive the site you're analyzing. Ask questions. Fill out test form submissions. Download files and play with the Flash applets. Make test purchases on the shopping cart and go through the order process. These experiences will help you become intimate with the website, which is what you'll need to do to effectively speak the language that your client or website owner is speaking, and to pro-vide the insight and recommendations necessary to succeed and be a great web analyst.

Flip over to the next chapter to learn about some advanced tools that Google Analytics has to offer to make you even better at web analytics than you already are.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.138.134.102