Cool Custom Report – “True Referrers Of Interest”
A quick look at this custom report shows me whether somebody, somewhere, has suddenly linked to my site. This question came up on the Webtrends Forum this week and I thought it would make a good post. It is a report that I have found to be very valuable.
Basically, it’s a report that shows referring sites (or, if you wish, domains or even referring pages) … but leaves out the overwhelming fog of search engines, paid search hits, sites displaying your ads, email traffic, and more. I just want to see bloggers or news articles or other meaty, and probably transient, mentions.
Here’s how to make the report.
As with all Webtrends custom reports, there are three components – the row labels (a.k.a. the “Dimension”), the column headings (a.k.a. “Measures”), and the filters (usually, stuff you don’t want to be in the rows).
The Dimension
I like to have two versions of this report. One uses the dimension “Referring Sites” (which is more useful than “Referring Domains”). But I always do one with “Referring URL”. Why? Because then I usually can click on the URL and instantly see the page that had the link that produced the traffic.
(Note: some referring URLs will have the important page-specific address in the query parameters, i.e. after the “?”. Webtrends strips the parameters out by default in the Referring URLs report. If you want your Referring URLs report to display those parameters when they occur, you need to see this post about referring page parameters. )
The Measures
I like to use both visits and page views, plus a custom calculated measure that is the ratio of the two (pages per average visit). That way, I can see the amount of traffic and also have an idea of the length of those visits.
The Filters
Back to the purpose of this report. What I’m after, overall, is look at the “leftover” referring pages – what’s left after removing search engines, paid search hits, sites where somebody clicked on one of your ads, and email traffic. Also, maybe, the sites of my client’s dealers and distributors. Whatever.
There are two kinds of filters used here: those based on the referrer, and those based on the entry page URL. You’ll probably need both.
They are all VISIT filters, not hit filters (you choose the filter type on the first screen of the filter configuration). And they are all applied to the report as EXCLUDES, of course.
You can make a lot of individual filters or, if you wish, one or more big filters that have many individual components under the “Filter must match one or more criteria” button. It doesn’t matter which way you do it. But stay away from the “Filter must match ALL criteria” button.
Filters Based on the Referrer
- Filtering out search engines: In the “Filter On” field, choose “Search Engine”. (This will rely on Webtrends’ collection of search engine identifiers, which is pretty good, and it will exclude a lot of organic search as well as most (but probably not all) of your paid search.)
- Filtering out search engines that Webtrends may have missed: Filter on “Referring Domain” or “Referring Site” then enter a list of the [search engine related] domains you’ve seen in your reports that you want kept out of this report. But before going to a lot of trouble with this one, see the next bullet below.
- Filtering out miscellaneous search subdomains on many non-search sites: Filter on “Referring Site” (not Referring Domain) then enter strings like “search”. This will get rid of search.ask.com, search.sweetim.com, www.delta-search.com and many others. Most of them put “search” into their site name, making them easy to identify.
- Filtering out other domains that you don’t care about: Same as the above, but you’ll be entering whatever domains you’ve seen in your reports that you don’t want. As an example, one of my clients filters out their global affiliates, i.e. MySite.mx or espanol.mysite.com. Another example is their dealers and distributors: DealerSiteWhatever.com. Another possibility is Google Maps, which is maps.google.com.
- Filtering out visits from social media: ”Referring Site” or “Referring Domain” with the strings Twitter.com or Facebook.com, and so forth.
- Filtering out most (not all) traffic from links in emails: Filter on “Referring Site” (not Referring Domain) then enter strings like “mail.” and others that indicate web-based mail servers, such as Yahoo Mail which comes from many servers with site names like “usmg6.mail.yahoo.com”. You’ll have to do a little research to optimize this, where “research” means looking at the report for ones to cull then adding them to your filter definitions. (Note that people using email clients such as MS Outlook will not be seen by this filter … but, on the other hand, Outlook traffic has a blank referrer anyway, so it’s not a problem in the first place.)
Big note: These are available to you only if you put marker parameters in the landing page of your marketing traffic. What, you don’t? You should be.
- For example, if your email blasts contain the parameter “WT.mc_id=email”, you can remove all email traffic from this report with one filter based on “Entry Page” where the Page is “*” (NOT regular expression match) and the URL parameter is “WT.mc_id” equal to “email” (regular expression match would probably be best).
- It’s the same for banner traffic, which you might have marked with “WT.mc_id=Bannerxxxxxx.
- Or your affiliate traffic, which you might have marked with “Affiliate=nameofaffiliate”.
- Or your paid search traffic, which might be marked with “WT.srch=1″ or “gclid=<something>” or “WT.mc_id=PPC”.
- Filtering on “WT.mc_id” values of “*” might do the trick for all your marketing channels – email, banners, paid search, affiliates – if that’s how you have it set up.
- I’m constantly adding to the exclusion filters as junk creeps into the report. It takes a couple minutes a week.
- Most importantly, I get to see mentions almost immediately and maybe can reply to them or thank the person for the mention in a timely way.
Tags
custom report, referrers, WebtrendsPosts that WordPress seems to think are related :)
June 2, 2013 No Comments
If you have server logs, use Excel for a quick look at details of individual visits
This question just came up on the Webtrends forum and I thought I’d share a quick way to look at individual visits, using Excel plus SDC logs.
Note that I said “using logs.” This is the raw data that Webtrends processes when it does the aggregation that it was designed for. If you use OnDemand, you have to ask Webtrends for the SDC logs. There’s a fee. If you use OnPremises, you have server logs (IIS or Apache logs) or you have SDC logs. Somewhere.
These instructions work for SDC logs. These instructions also assume the logs contain no more than one day of data (if more, you’ll have to change the sorting to keep the days from mingling). And I’m also assuming your logs aren’t too huge. Excel can choke.
So … here are instructions for a quick and dirty sorting into visits. If anybody finds it useful, let me know and I’ll come back and add some screen shots.
It seems endless, but that’s because I’m listing every little step. It really will take only about a minute of actual work, not counting waiting time while Excel works or the time spent reading the instructions.
- Open Excel
- File >> Open
- Browse to your log file and select it
- The Text Import Wizard will open:
- Choose Delimited, then Next
- Choose Space; leave everything else as is, then Next, then Finish
- Wait while Excel reads and opens the log. If your log is too big for Excel, it will tell you so (but will load the log partially)
- I strongly recommend that you keep only the first few hundred thousand rows or so and delete the rest. You’ll reduce the chances of crashing Excel.
- Boom! About 20 columns of stuff will appear when Excel is done.
- Delete the first four rows; they will be #Remark, #Software, #Version, and #Date. Keep the #Fields row.
- Delete the cell that says “#Fields:” and shift the rest of the row to the left. Right click on A1, choose Delete, then Shift Cells Left. This will put the right headings over the right columns.
- Make a new column that contains, for each row, both the IP address and the User Agent string, as follows:
- Go to row 1 in column S (the first empty column) and enter this formula: “=concatenate(C1,L1)” and hit Enter.
- The cell should now contain this: “c-ipcs(User-Agent)”
- Click on that cell to highlight it, then hover over the lower right corner of the highlighted cell where there is a very small black square. The cursor will change to a +. Double-click.
- Column S should now auto-fill downwards with a concatenation of the IP and user agent string. Don’t worry about reading and understanding these. The point is that these are probably (but not 100% of the time) each unique to an individual visitor, which is what you are after.
- Highlight all columns (A-S), then open the Data tab and choose Sort. The Sort dialog box will open.
- Make sure the box at the upper right is checked (“My data has headers)
- Click on the “Sort by” dropdown and choose the very last item, which will be “c-ipcs(User-Agent)”
- Click on Add Level to get another sort config line
- Click on the “Then by” dropdown and choose “Time”
- Click on OK and wait for the sorting.
- You now have your visits, with all the hits in each visit sorted by time from first to last.
- You’ll want to make it a little more intelligible by putting a divider line between individual visits, as follows:
- Highlight your block. I do it by clicking on A1, then Ctrl-Shift-DownArrow, then Ctrl-Shift-RightArrow.
- Assuming you are still in the Data tab, select Subtotal
- The Subtotal dialog box will open. Fill it in as follows:
- “At each change in:” open the dropdown and choose the very last item, which will be “c-ipcs(User-Agent)”
- “Use function:” Count
- “Add subtotal to:” check the box for c-ipcs(User-Agent) which, again, will be at the very end
- Leave Replace current subtotals and Summary below data checked
- Click on OK and wait. There’s a progress bar in the status line at the bottom if you feel panicky.
- Excel has inserted a new row between “visits” and also added some controls over in the left margin, labeled 1,2,3. Click on 2.
- Excel is now showing you only the inserted rows. The block should still be highlighted. Don’t worry, the rest of the data is still there. Look at the row numbers over on the left if you’re nervous; you’ll see lots of missing numbers.
- Click Alt-semicolon (Alt-;) and the screen will change subtly. You’ve just selected only the visible cells.
- Click on the Paint Can to color all these rows, for example black
- Go back to the controls “1,2,3″ on the left and click on 3.
- Excel is now showing you all rows, with a colored divider row between “visits.”
- At this point, I usually move everything to a new sheet to get rid of the underlying mechanics, as follows:
- Highlight the block
- Copy to Clipboard (I always just use Ctrl-C)
- Open a new sheet, select A1, then paste (Ctrl-V)
- Delete column S if you like
- Go back and delete the sheet you were just on
Now you’re ready to look at individual visits line by line, in the order in which they occurred in the visit. You’ll probably want to concentrate on columns G and H, which are the URL stem and the URL query parameters for each hit. Have fun, or good luck … whichever applies.
A few notes:
- If the same “person” had two visits, they will be jammed together in chronological order. Keep an eye on the Time column. If there’s more than 30 minutes between one hit and the next, Webtrends is probably seeing this activity as two visits.
- You’ll see a lot of visits with nothing in the Cookie field on the first hit. These are first-time visits. Don’t worry, Webtrends sorts it out just fine when it analyzes. Using the IP-User Agent method is a quick-dirty way to get around the complexity of cookies (which are, in fact, in the Query Parameters and not in the so-called cs(Cookie) field of the logs.
- Sometimes SDC hits are virtually simultaneous, i.e. two hits happen in the same second of time. It’s possible that Excel will sort those two hits into the wrong order (it doesn’t know any better; hh:mm:ss is all it has to work with).
- If you want to get fancy with logs, download Microsoft Log Parser and learn how to use it.
Tags
log files, SDC & data collection, sessionizing, WebtrendsPosts that WordPress seems to think are related :)
March 24, 2013 15 Comments
Cool Custom Dimension: Height of the visitor’s browser window, in pixels
If you’re concerned about part of your page being below the fold, why not check on your suspicions using your site data.
In other words, look for a correlation between your KPIs and the height of the window in which your site is being viewed.
This post is about how to create a custom dimension for the height of the viewing window being used by your visitor. Note: we’re not talking about the full screen height — we’re speaking of the much more useful height of the viewable window area.
The Webtrends SDC tag collects this information in the parameter WT.bs (bs stands for something like Browserwindow Size, I suppose). There is a small problem with WT.bs — values are recorded as a combination of browser width and height, for example “1200x700.”
If we’re interested in only the height, we need just the second part, after the “x“. We can pull out just the height using one of the advanced features of dimension configuration that allows us to extract the part after the x.
Here’s how.
Create a new custom dimension definition.
In the General tab:
- Give it a name (Browser Window Height? Viewport Height?)
- Give the report column a name (Height? Browser Window Height? Height in pixels?)
In the Based On tab:
- Value to Base On is “Query Parameter”
- Parameter Name is “WT.bs”
- Click on the “Advanced” button
- Select the Regular Expression radio button and enter this in the regex field. This is how the part after the “x” is pulled out and separated:
([0-9]+)$
In the “When to Collect Data” tab:
- Choose either “first Occurrence in Visit” or “Last Occurrence in Visit” depending on your instincts. (I use First Occurence)
Save.
There’s your custom dimension. Use it in a report that has your favorite measures – conversions, or perhaps a measure of success in those items that you think are usually below the fold.
For best results in this kind of report, it’s probably a good idea to filter it to remove mobile devices. Or to include only mobile devices, if that’s your interest. Mobile users seem have different expectations about the fold and seem to be a lot more likely to scroll down. At least, that’s what my instance of this report seems to tell me. Removing mobile users gave me a clearer pattern.
Tags
browser height, pixels, viewing heightPosts that WordPress seems to think are related :)
January 31, 2013 4 Comments
The new Ad-Hoc Data Re-Analysis feature
For Webtrends OnDemand users, this is maybe the best thing since Dashboards.
For Webtrends OnPremises users, this might remove one of your reasons for not switching to OnDemand.
So …
About a month ago, Webtrends quietly added the ability to kick off short-term rollbacks of your profiles in the UI. If you add a new custom report or content group, or correct a filter, you get to re-analyze a bit of your data with the new settings.
And, the short-term rollbacks (up to three days) are FREE and UNLIMITED. I am so amazed at that. Thank you to whoever at Webtrends was smart enough to realize how very attractive that is.
This capability is called “Ad Hoc Analysis” and seemed to just suddenly appear one day as a new choice in the profile-level dropdown menus. Personally, I would have called it “Ad Hoc Re-Analysis,” but I’m not complaining.
Here’s how it appears in the profile dropdown:
If you click on it, you’ll see up to five possible time-period rollback choices:
- Those above the red line (red line added by me) are FREE: today (which means, yesterday’s data that would have been analyzed in the wee hours of today), yesterday (yesterday’s analysis, which covers the day before), and yesterday-plus-the-day-before. (note the time specs: they are UTC and you have to do a little mental time-shifting.)
- Those below the red line cost “credits”. Every OnDemand account is given a few free credits for the year, and others can be bought. Generally, a month of rollback costs one credit. Technically, you can roll back to month-before-last, but for that period you have to go through technical support.
- Rollback and re-analyses start as soon as you click the “Analyze” button. You can cancel a re-analysis any time up until it’s finished.
- After your re-analysis finishes, you can kick off another one for the same profile, if you want.
- If you enter an email address, the program will send you an email when it’s done.
- The program gives you a pretty good estimate of how long it will take. We’ve had some go considerably faster than the estimate, and only a few have taken longer.
- Note: While a re-analysis is happening, you can’t change anything in the configuration of the profile.
There are still a couple of, um, unpredictabilities in this very young feature. The time period choices aren’t always consistent and they especially get a little wonky at the end of a calendar month. The interaction with the normal analysis cycle of your profile is not predictable, or at least not understood by us Outsiders. Once in a while one of the day-rollbacks will want to charge you a credit for the privilege, but that corrects itself eventually and it goes back to being free. And, sometimes there’s a choice of killing all previous data, and sometimes you are not shown that choice. (Hint – even if it’s offered, you probably don’t want to do it!)
… Why is this wonderful?
Two things: 1) testing new configurations quickly, 2) fixing screwups, especially the kinds of screwups you notice after somebody else has changed a campaign or site page and didn’t bother to tell you right away.
Regarding testing, if you are an OnDemand user you probably already know that if you add a new custom table, change a filter, add a content group, and so on, you have to wait until the next morning to see if it worked as expected. And if it didn’t, you get another 24 hours to change it … meanwhile you have hours or days of missed analysis. No longer. With Ad Hoc Analysis, you can see if something worked in just an hour or two. You’ll look more agile and smarter to your end users.
Regarding screwups, if you learn about something within a couple of days of it happening, you can correct the configuration or add the new campaign filter or whatever it is you need, and then re-analyze everything back to and before the screwup. Or, if something spiked and you want a closer look at the spike with an extremely specific new table, you have a couple of days to get that together. Again, you’ll look smarter to your end users, or at least you’ll avoid a few episodes of looking dumb.
Tags
Re-analysis, rollbackPosts that WordPress seems to think are related :)
October 6, 2012 3 Comments
Hey, you changed MY report! Hey!
When several different people have editing privileges for configurations in an analytics tool, disaster awaits if there aren’t ground rules.
Here are some tried and true ground rules we have published among ourselves to avoid being shorthanded due to one of the analysts being in jail on grounds of assault on another analyst.
Analyst Ground Rules
- Create a sandbox profile for each analytics team member. Each analyst can use theirs (and ONLY theirs) to create and test new or modified configurations.
- Team members should put their initials somewhere in the “Name” field of all custom items they create. That means custom reports, dimensions, filters, measures, content groups, path analysis definitions, URL search & replace rules, URL rebuilding rules, templates, dashboards, or profiles. We Outsiders happen to like having the initials at the beginning of the name for alphabetization purposes, but there are other approaches.
- In addition, it might be a good idea to create a Category for each analyst. Finding one’s own items then becomes easy when looking at long lists of configs – just sort the list by Category.
- Nobody is allowed to alter anything that has another analyst’s initials, unless there is a damn good reason, some communication happens, and the reason is documented somewhere.
- The Change Comment box that pops up whenever something is saved is not optional. Comments should be done every time, and should be complete and clear enough to be understandable six months from now. This is part of the ground rules among analysts, but it’s also an important practice even if you are the only analyst around.
- Anybody on the team can, of course, freely re-use something with another analyst’s initials on it, by applying those items as-is (no changes!) to one’s own reports or profiles. However, everybody should remember that the owner of a configuration might change it, so be prepared for that.
- There are many out-of-the-box reports that don’t have any initials. They appear in config lists as plain black type, no underlines or hyperlinks. Luckily, they cannot be edited. So, if someone wants to modify an out-of-the-box report, they’ll have to copy it, add their initials, then do their modifications.
- There are two, exactly two, out-of-the-box items that CAN be edited. Be ultra-careful with these: Paths from Entry and Content Paths from Entry. It’s possible to edit the number of steps in these. There is only one of each allowed in all of your Webtrends installation; they cannot be cloned. So, be aware that if somebody changes the number of steps in the one-and-only master configuration, they will be erasing all already-analyzed data in all instances of those from-entry reports in all profiles. Get together, decide as a group on the number of steps you want in all future uses of these reports, and keep the above in mind.
- Finally, remember the license constraints if you are using OnDemand. Your Webtrends OnDemand license allows you a certain number of profiles and a certain number of “custom report tokens,” which are the sum of all custom reports that are assigned to the sum of all profiles. Get to know the rules of how they are tallied and manage accordingly.
Let us know if you have improvements or alternatives. And if you want to wing it without implementing ground rules like the above, don’t contact us to bail you out of jail!
Tags
management, naming, paths from entryPosts that WordPress seems to think are related :)
October 6, 2012 2 Comments

