Hi there,
Greg R. Notess is a search engine expert who has been keeping track of
search engine index sizes for three years now. Not only does he list
the sizes reported by the engines in their press releases, but he also
uses his own techniques to verify the sizes.
Millions of web pages indexed
----------------------------
Google 3,033
AlltheWeb 2,106
AltaVista 1,689
WiseNut 1,453
Hotbot 1,147
MSN Search 1,018
Teoma 1,015
The above numbers are from Dec 31, 2002
http://www.searchengineshowdown.com/stats/sizeest.shtml
I pay close attention to such numbers, as I run my own search engine
site and like to keep the facts and figures up to date. The numbers
are still accurate for today. The emphasis on size has waned, with
engines now aiming to mimic Google's good points - a clean,
uncluttered interface and relevant results.
Although Google has the most pages indexed, it only indexes the first
100K of each page. AlltheWeb indexes complete pages - which means in
terms of information indexed their sizes could be quite similar.
A good way to test AlltheWeb is by searching for url.all:http:+, which
today brings up 1961 million pages:
http://www.alltheweb.com/search?cat=web&cs=utf-8&l=any&q=url.all%3Ahttp%3A%2B
Other search engines don't make it so easy to check their claims.
Hotbot has changed recently into an interface for 4 different searche
engines - the figure given is for the default Inktomi search. MSN also
uses Inktomi, however each have a slightly different version of it,
which explains the difference in sizes.
Search strategy: personal bookmarks
Best wishes,
robertskelton-ga |