boakes.org

Understanding Google’s Bigdaddy Rollout

Several news and technical commentary sites have been pronouncing the arrival of Google’s Bigdaddy datacentre over the last few weeks, and looking at the stats for this site over the last few days I might be seeing it’s effect – hits from Google are up. After a little digging, I reckon the rollout is 50% complete.

Bigdaddy’s Calling Card

What these two pie chars show is that during a nine week period covering December and January, 22% of the readers who arrive at this site do so because they’ve searched in Google and seen a result that’s looked like it’s worth clicking on. On the right hand side, we see that during this last week a statistically significant change can be observed, with 62% of people arriving here from a Google search. That’s a big change, and the overall site traffic has increased as a result.

The effect of Big Daddy on the statistics of this site.

So how can I be sure this change is it’s related to the Bigdaddy infrastructure? Well, obviously I can’t be 100% positive, but here are the clues:

  1. The increase was first noticable during the first weekend in February, approximately when Bigdaddy was scheduled to bubble through to the public.
  2. I’m seeing the IP addresses of those data centres as “hostnames” in my google analytics stats, which means that my content is cached in those datacentres.

What's very interesting (for me) is Matt Cutts comment about the switchover.

I’d expect a new data center to be converted to Bigdaddy roughly every 10 days or so. The more data centers there are using Bigdaddy, the odds of you hitting a Bigdaddy data center in the normal rotation go up.

Supposedly the switchover started in January, so this is an ongoing process, but just how near to completion is it?

Investigating Bigdaddy

The IP addresses from the logs gave me a starting point and there’s a fair bit of information about Google’s datacentres on the web, so for each datacentre I that discovered, I looked at one machine to see if the it was running the new infrastructure (based on the “sf giants” query, which returns sfgiants.com as the top result on Big Daddy infrastructures, but mlb.com on the old system). From that I deduced whether the data centre in question was running the old or Big Daddy infrastructure.

ID IP checked Feb
24th
Mar
1st
July
5th
Location
OD 64.233.161.104 old old BD Mt. View, California
RN 64.233.171.104 old old BD Mt. View, California
HS 64.233.179.104 old old BD Mt. View, California
__ 64.233.189.104 old old BD Hong Kong, China *
__ 66.249.93.104 old (!) BD BD Paris, Illinois *
VA 216.239.37.104 old old BD Herndon, Virginia
DC 216.239.39.104 old old BD Washington, DC
CW 216.239.57.104 old BD BD Palo Alto, California
PY 64.233.167.104 BD BD BD Mt. View, California
MC 66.102.7.104 BD BD BD Santa Clara, California
__ 66.249.87.104 BD BD Munich, Germany *
LM 66.102.9.104 BD BD BD Dublin, ireland
KR 66.102.11.104 BD BD BD Dublin, Ireland
__ 72.14.207.104 BD BD BD Mt. View, California
IN 216.239.53.104 BD BD BD Santa Clara, California
GV 216.239.59.104 BD BD BD Dublin, Ireland
__ 216.239.63.104 BD BD BD Mt. View, California

* Physical locations (where they were not stated by their sources) were guessed using HostIP.info.

Understanding Bigdaddy Progress

So, nine machines appear to be using the big daddy infrastructure and eight are using the old system. Looks like they’re about 50% of the way through. Theoretically then, if the surge in readers is based on the Big Daddy chanegover, I’m going to see an even larger increase in traffic during the next two months as the rest of Google switches over. Of course, there are mitigating factors such as those potential users who don’t speak English, so who won’t find anything I write remotely comprehensible, then there’s the fact that the stuff that gets on this site is culturally rather “western/european” so it may have already had it’s boost, but so far, Big Daddy seems to have had a positive effect on traffic.

Bigdaddy’s Legacy

Hopefully these new visitors will find the content here useful, ’cause if they don’t it’s bandwidth and time on both sides that’s being wasted. Only time (and Google Analytics) will tell if Bigdaddy is as good at quality as it is on quantity.

Published: February 20th, 2006

Tags: