Top

Measuring Traffic

December 9, 2004

As a website runs, a log file is generated on the web server. This log file contains one line for each and every thing sent to visitors of the site — pages, images, etc. In most cases, this line also indicates what kind of browser it was (“User Agent”), what operating system it ran on (Windows XP, Mac OS X, etc.), the address of the user (“Host Address”) and if the user clicked on a link to get to the page that was sent, it will also have the address of that page (“referer” and yes, it’s routinely misspelled this way.)

As you can imagine, this makes for a large and unwieldy file very quickly. This file is typically processed and used to generate one or more “statistics” pages. Your web host may offer one or several varieties of statistics pages — some may be free, and others may cost money. Alternatively, you might install (or have installed) a statistics package of your own choice, or you might even download your log files and use any of several popular programs on your own computer to generate statistical information. (There is also another alternative, involving placing counting codes on each page, but we’ll ignore that for the sake of this discussion)

What all of these mechanisms have in common is that they have to make some “guesses” as to how to interpret the logs. As far as the web server is concerned, if a user viewed one page at 10:00 AM and another at 10:01 AM, those two events have nothing to do with each other. The statistics packages make this determination themselves based on time between pages, etc.

For this reason, two different statistics programs looking at the same set of logs will often have somewhat different results. In general, it’s a bad practice to try to compare them — changes in traffic in a single statistics program over time will yield much more usable information.

Years ago, it used to be common to speak of web traffic in terms of “hits”. Since each image on a page is a “hit”, as are things like style sheets, external Javascript files, etc., a “hit” isn’t terribly useful. Since one page might generate fifty “hits” per view, and another five, it’s usually a lot more convenient to speak of “page views” (someone once suggested that “HITS” is an acronym for “How Idiots Track Statistics”, which is crude, but probably near the mark.)

Another useful figure to watch is “User Session”. This approximates (due to the limitations mentioned above) how many users visited the site during a period. If a given user showed up at the site at 9:00 AM, and looked at pages until 9:45 AM, and returned again for 10 minutes at 4:00 PM, this should be counted as two “User Sessions”.

In general, things your statistics program should be telling you include:

  • Page Views
  • User Sessions
  • Number of views per page
  • Number of times users entered the site on a page
  • Number of times users exited the site on a page
  • Referring pages, and how many users they sent
  • Number of hits per each search engine
  • Keywords and phrases used at search engines
  • Amount of data transfered (this is particularly important if you pay for your hosting by transfer volume)

Regardless of the statistics package you use, whether it’s provided by your web host or one you download and run on your own computer, it’s vitally important that you learn how to read your site statistics, and do it regularly. Otherwise, you have no tools to gauge how well your site is performing, and how well you are promoting your site — whether links from other sites are actually sending you traffic, whether search engines are finding your most important pages, etc.

Rate this:
2.8
Like this article? Share it!
  • Digg
  • Sphinn
  • del.icio.us
  • Facebook
  • Mixx
  • Google
  • Reddit
  • StumbleUpon

Comments

Got something to say? [privacy policy]





Possibly Related


Traffic and the Average Website How can we categorize traffic for the average small business web site? It's pretty simple, actually -- the average small business web site doesn't get any traffic to speak of. Looking at the statistics for November, 2004 for a typical site, we find that it got 774 page views -- that's about 26 per day. Of those 774, 92 were from “robots” -- automated processes run by search engines to update their index, and by spammers looking for e-mail addresses (which is why you never put a “live” e-mail address on a web site). Of the remaining 682 page views, 119 belonged to the site owner. That leaves 563, or fewer than 19 a day. 80 of these page...


Comparing Site StatisticsI recently had a site owner who was puzzled over the different numbers he was receiving from two different statistics packages. It's no wonder he was confused -- the fact is that no stats software can be compared with any other to any significant degree of accuracy. He was looking at numbers for both "hits", and "sessions". Hits are the number of elements the web server served; if a page has a style sheet and four graphic images on it, someone loading that page will generate 1 (HTML) + 1 (Style Sheet) + 4 (images) = 6 hits. If it has a Javascript included file, then it will generate an additional hit -- but only if the user has Javascript...


Types of Pages - User’s ViewA typical well-designed small business web site will tend to have pages falling into a handful of categories, each with its own particular function. There are of course two fairly different views of this, the “user oriented view” and the “search engine oriented view”.  Having a successful site requires considering both, as one brings you traffic, and the other retains that traffic and converts it to sales.  This article focuses on the “user oriented view.” Home Page The home page is essentially the “main navigation” page of your web site.  It's where people who type in your domain name, or follow a link from another site will typically arrive first.  Most search engine users will arrive on a page somewhere...


Recommended Site LayoutThe following diagram shows a typical business site layout, and how it interacts with various sources of traffic.  This layout is designed to take maximum advantage of adding content to increase user value, retention and search engine traffic, while providing opportunities to convert that traffic to prospects and customers. Pages on your site Home Page The home page will typically be the entry point for type-in traffic (people who type in your domain name) and bookmarkers.  Links from other sites (such as directories) may also target the home page.  Users entering at other points in the site will often go to the home page to locate other features of interest.  The predominant use of the home page is as an...


What is SSL (the “little padlock”)? SSL (“Secured Socket Layer”) is a protocol used to encrypt the communication between the user's browser and the web server. When SSL is active, a “little padlock” appears on the user's browser, usually in the status line at the bottom (at the top for Mac/Safari users.) This assures the user that sensitive data (such as credit card numbers) can't be viewed by anyone “sniffing” the network connection (which is an increasing risk as more people use wireless networking). Common web site owner questions about SSL: How do I get the little padlock on my site? To get the little padlock, your site must have an SSL Certificate from a Certificate Authority. Once an SSL Certificate has been purchased and...

Bottom