Skip to: Content
Skip to: Site Navigation
Skip to: Search



Why is Google's Market Share in Hitwise Data So Much Higher than in comScore's?

By Andrew Lipsman - January 19, 2009

I occasionally get asked this question, and I usually answer that different methodologies may produce different estimates. But I know that doesn’t really get at the heart of the matter. After all, given how many searches are conducted across the U.S. each month, shouldn’t the search share number from the various measurement services be pretty close to each other?

Let’s examine the Google search share figures in November 2008 for the three largest online measurement services, in order: comScore, Nielsen and Hitwise.

share-of-search1.gif

comScore and Nielsen are within about half a percentage point of each other, while Hitwise shows Google at a substantially higher 72%. Now, why is it that Hitwise consistently shows a search share about 8 percentage points higher than comScore and Nielsen? A behavior as ubiquitous as Internet search yields extremely robust sample sizes, which means that there should be little variation between the services if each was sufficiently representative of the U.S. Internet audience.

Let’s investigate why that may not be the case…

Both comScore and Nielsen build their Internet panels by recruiting consumers directly and then installing patented software on the panelists’ computers to collect their online behavioral data. comScore is the only service to also include complete AOL proprietary activity. This ensures that all Internet Service Providers (ISPs) are correctly represented in the comScore database. However, Hitwise obtains its clickstream data by purchasing it directly from those ISPs that are willing to sell it to third parties. It’s unclear exactly which ISPs or how many provide their data to Hitwise, since the company does not disclose that information. However, what is publicly known is that, mainly because of privacy concerns, the vast majority of ISPs are, today, unwilling to sell their data to third parties. The most prominent one – and likely only one of the major ISPs – that is willing to sell its data is United Online/NetZero. Interestingly, an analysis of Google’s market share across ISPs (using comScore’s data which represents all ISPs) suggests that the Hitwise data may be mainly reflecting Google’s performance at United Online/NetZero, an ISP composed primarily of dial-up users.

The analysis below shows how Google search share varies across ISPs in the comScore panel:

share-of-search2.gif

Here, we can see that Google’s search share peaks among United Online/NetZero users at about 73% – a number that is strikingly similar to what Hitwise reported as Google’s overall U.S. share in November. The other ISPs shown above are all major broadband providers, who do not sell their data to third parties. Google’s market share in each of them is significantly lower than within United Online/NetZero. Note, especially, that Google’s share is lowest among AOL ISP users, an ISP that Hitwise does not measure.

So, is it possible that Google’s high share as reported by Hitwise overstates reality and is simply a result of the fact that Hitwise’s estimates are predominately based on ISP data from United Online/NetZero, the ISP where Google’s market share happens to be highest?

What do you think?

Comments (4)

Very interesting... Would be useful to know if there is any similar insights outside the US.

I'm specifically interested in the UK (where ISPs seem to be much keener to sell on their user data, if recent developments with Phorm are anything to go by) but I'm sure that other markets would be equally interested in their own regions!

Cheers,

Don Patti:

There are lies, damned lies, and HitWise. Nice analysis!

How many people do you and Neilsen survey, and what do you do to ensure they are representative of the general population's?

And why do you have such a hard Captcha?

Andrew Lipsman, comScore:

@ Peter,

I can't speak for Nielsen but comScore observes the online behavior of ~1 million U.S. panelists. We recruit broadly, across a wide array of websites to ensure robust sample from all demographic segments. Then we statistically weight by these demographics to represent the total U.S. Internet population. We also calibrate our broader sample to a smaller, randomly recruited sample to account for any potential selection bias that is not smoothed out by the weighting/projection.

How do we know that our data are representative? We have several independent validations, but perhaps most notable are our quarterly e-commerce estimates, which have aligned very closely (within ~2% each quarter) with the Dept. of Commerce numbers over several years. We release our data each quarter weeks in advance of the Dept. of Commerce, and if our data were not representative of the total U.S. online population they simply would not correspond as closely with the DoC as they do.

You can see this data reconciliation at the following link: http://www.comscore.com/press/release.asp?press=2242

Post a comment