Google Answers Logo
View Question
 
Q: website statistics ( Answered 4 out of 5 stars,   0 Comments )
Question  
Subject: website statistics
Category: Miscellaneous
Asked by: frediii-ga
List Price: $50.00
Posted: 18 Dec 2005 07:37 PST
Expires: 17 Jan 2006 07:37 PST
Question ID: 607069
How many active websites are there in the world?
How many total web pages?
How many active web sites have 10 pages or less?
How many active web sites have 11-50 pages?
How many active web sites have 51-100 pages?
How many active web sites have over 100 pages?
Answer  
Subject: Re: website statistics
Answered By: welte-ga on 21 Dec 2005 10:52 PST
Rated:4 out of 5 stars
 
Hi frediii-ga, and thanks for your question.

One of the best estimates of the number of total websites comes from a
recent Netcraft survey:

http://news.netcraft.com/archives/2005/06/01/june_2005_web_server_survey.html

As of June, 2005, there were approximately 64,808,485 sites.  The
November, 2005 estimate was 74,572,794.  The above report also gives
quite a bit of additional information, such as growth rates, market
share for various web server software (Apache is the big leader as you
might guess), operating systems, etc.


Here is their most recent web server survey:
http://news.netcraft.com/archives/web_server_survey.html


You can find also find information, for example, not only about how
prevalent Linux servers are, but also which distributions are the most
popular and which are gaining ground the fastest:
http://news.netcraft.com/archives/around_the_net.html

______________

In terms of the number of pages per website, Hewlett-Packard studied
this question in 2001.  The authors of the article below found a power
law distribution of pages per site, the number of users who visit a
given site, and the number of links point to and from a given site.   
For the fraction of sites with a given number of pages, HP includes
data from two sources, infoseek.com and archive.org (see Figure 2A in
the article below).

http://www.hpl.hp.com/research/papers/weborder.pdf

Adamic LA, Huberman BA.  The Web's Hidden Order. Hewlett-Packard Labs,
Palo Alto, CA 94304.
ladamic@hpl.hp.com
huberman@hpl.hp.com


To get totals, one must rebin the data that's presented in the graph
in the above paper.  I did this by extracting the data using
GraphClick and rebinning using Excel.  Here are the results for the
ranges you specify:
http://www.arizona-software.ch/applications/graphclick/en/


Based on 74,572,794 total sites, we get the following approximate values:

Web sites have 10 pages or less: 73,950,191
Web sites have 11-50 pages: 591,183
Web sites have 51-100 pages: 26,600
Web sites have over 100 pages: 4,790


http://www.hpl.hp.com/research/papers/weborder.pdf

=================================================

I hope this information is useful.  Please feel free to request
clarification prior to rating.

       -welte-ga

Request for Answer Clarification by frediii-ga on 21 Dec 2005 12:31 PST
Hey...thanks for the answer....but the netcraft graph shows about 38m
active websites.....and 74m domain names...check me out?  Fred

Clarification of Answer by welte-ga on 21 Dec 2005 15:13 PST
Hi again Fred,

You are correct.  The November, 2005 graph shows the number of sites
(or domain names) and the number with active (live) sites:

http://news.netcraft.com/archives/2005/11/index.html

I based the numbers I gave you on the total sites (domain names).  If
you are interested in the proportions of active sites in the ranges
you specified, I can calculated those as well.  Because the underlying
proportions (from the HP article) would be the same, one can redo the
same analysis for the number of active (live) sites:

http://news.netcraft.com/archives/2005/11/index.html

Based on this, there are about 34 million active sites as of November, 2005.


Based on 34 million active (live) sites, we get the following approximate values:

Web sites have 10 pages or less: 33,716,137
Web sites have 11-50 pages: 269,538
Web sites have 51-100 pages: 12,100
Web sites have over 100 pages: 2,180


       -welte-ga

Request for Answer Clarification by frediii-ga on 29 Dec 2005 10:23 PST
thanks for the data confirmation....one last request...I had also
asked for ....total number of website pages on the web....you guys
used to post that you indexed 8 billion.....would that be a good
number?   thanks Fred

Clarification of Answer by welte-ga on 29 Dec 2005 18:01 PST
Hi again, 

The answer to this part of your question depends a little on how you
define it.  The "surface web," that part of the web that's easily
indexed by search engines, tends to be more static.  The so-called
"deep web" consists of databases, dynamic web pages, etc., and is much
harder to index by search engines.  There is considerable ongoing
research on this topic.  Here is one useful source:

http://www.deepwebresearch.info/

"The Deep Web covers somewhere in the vicinity of 600 billion pages of
information located through the world wide web in various files and
formats that the current search engines on the Internet either cannot
find or have difficulty accessing. The current search engines find
about 8 billion pages at the present time of this writing. "

        -welte-ga
frediii-ga rated this answer:4 out of 5 stars

Comments  
There are no comments at this time.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy