Dan Frost's blog

Dan Frost's blogDan Frost

Crawl of the week - Google vs Yahoo! vs Live Search - battle of the blogs

Posted by Dan Frost on Wed, 12/03/2008 - 09:18

Every week or so I'll be posting "celebrity site crawls" and talk about the results.

So this week in celebrity site crawls it's the battle of the giants.

In the corner with the bean bags and M&Ms dispenser, hailing from Mountain View, doing no evil whatsoever but still weighing in with a market capitalization of $137b, is the King of spin, the geek with a mean streak, Gooooooogle.

Google sitelinks and site search in results

Posted by Dan Frost on Tue, 11/03/2008 - 19:59

This post is now here.

The need to structure web pages

Posted by Dan Frost on Fri, 07/03/2008 - 10:13

Most people in SEO have heard the term "semantic search" - this means that a search engine is for a specific niche with the aim being that you get more relevant results for your searches.

Let's say you search on Google for "Derby" - you could mean the Kentucky Derby, Derby hat, Derby in England, hotels in Derby and so on.

A search engine bot in pictures - too much speculation

Posted by Dan Frost on Thu, 06/03/2008 - 15:05

I've been mulling over this article for some time now so I could draw some sensible conclusions from it but I've decided against doing it because there are far too many variables in there.

Crawler behaviour in pictures

Posted by Dan Frost on Mon, 03/03/2008 - 09:12

I recently stumbled upon a very interesting research project about how Googlebot, Slurp and MSNbot crawled pages on a very large site (billions of pages).

Googlebot crawling tree

Googlebot activity

Yahoo! slurp crawling tree

How to make your website more crawlable

Posted by Dan Frost on Tue, 26/02/2008 - 18:17

I've read many articles on this subject but being as we've actually written a web crawler I think we're pretty well placed to give good advice on how a website should be built to be easily crawled :)

1. Well structured HTML

Optimising a Drupal site for SEO - part 2

Posted by Dan Frost on Tue, 26/02/2008 - 12:14

The blog post I had in mind for this has been very, very well covered in Wim Leers post on improving Drupals page loading performance.

What I will do is blog about the specific issues I found with crawlscore.com and how I addressed them.

Optimising a Drupal site for SEO - part 1

Posted by Dan Frost on Mon, 25/02/2008 - 16:17

Because crawlscore.com has been through a couple of re-designs and also because I've made some mistakes when putting the site together, it needed some maintenance to ensure we make the most of each crawlers visit.

Collaborative search engine crawler study

Posted by Dan Frost on Fri, 22/02/2008 - 13:36

We're pondering on creating an open, collaborative study into how often web crawlers visit websites.

The idea is that with minor PHP/Apache mods you can turn robots.txt into an executable PHP file and basically submit the fact it has been requested to a central resource.

What is a large website?

Posted by Dan Frost on Fri, 15/02/2008 - 00:34

We've had some feedback from customers asking for a version of Crawl Score to crawl their 10-30 page websites saying that our starter package with 100 page limit is too much and it got us thinking.