Hi JTFBus!
Initial investigations into www.jtfbus.com yielded no insight into why
it should have been removed from the Google database, indeed the site
appears to be well designed with useful content and not too much
unnecessary code. This led me to look for causes not dependent
specifically upon www.jtfbus.com itself, but which were related to the
effects other sites were having upon it. It was at this point the
solution to your problem became clear; the use of multiple duplicate
sites is a tactic which Google regards as a form of spamming, and is
likely to punish with exclusion from their listings.
Performing a search for 'jtfbus' enabled me to identify the following
sites which have substantially duplicated content from www.jtfbus.com
:
http://canon-fax-machines-direct.com/canon-fax-machines/default.cfm
http://ricoh-fax-machines.com/RICOH-FAX-MACHINES/default.cfm
http://www.sharp-copiers-toner-drum-copier-supplies.com/SHARP-DIGITAL-COPIERS/default.cfm
http://www.iobs2.com/jtf/
http://www.faxsupersale.com/
http://www.sharp-fax-machines-direct.com/sharp-copiers/default.cfm
http://canon-copier-toner-printer-toners-supplies.com/canon-laser-class-fax-machines/default.cfm
Referring to Google's webmaster guidelines, the penultimate bullet
point in the quote below covers our area of discussion.
Google Information for Webmasters
://www.google.com/webmasters/guidelines.html
"
Quality Guidelines - Specific recommendations:
* Avoid hidden text or hidden links.
* Don't employ cloaking or sneaky redirects.
* Don't send automated queries to Google.
* Don't load pages with irrelevant words.
* Don't create multiple pages, subdomains, or domains with
substantially duplicate content.
* Avoid "doorway" pages created just for search engines, or other
"cookie cutter" approaches such as affiliate programs with little or
no original content.
"
So it would seem that Google have detected a breach of their
guidelines, and have therefore removed www.jtfbus.com from the search
database. You may now be pondering over why it has happened now and
not before. This is most likely to be related to the increase in use
of site duplication over the last couple of years; the following quote
makes the point concisely, if somewhat bluntly:
"the more that the search engine databases get compromised by
duplicate content, the more the engines will fight against it"
Search Engine Guide: Doorways, Mirrors, and Duplicate Sites
http://www.searchengineguide.com/whalen/2002/0607_jw1.html
Thus far, my answer may have seemed a little negative, but hopefully I
can remedy this now by making explaining how you could resolve the
problem.
There are essentially two ways to start making www.jtfbus.com more
'Google-friendly' again. These involve either removing all the
duplicate sites, or making the duplicate sites invisible to search
engines. If a substantial quantity of your business comes from the
duplicates, it may be desirable to leave them in place and use the
latter of the above options.
When Google, or any other search engine, records information from the
Internet, it does so with a piece of software called a 'spider'. The
spider travels around the 'net through hyperlinks, just as a human
reader would do. When the spider comes to a new page, it will check
the robots.txt of the site (to be discussed in a moment), and in most
cases will record the content of the page in the Google database.
It is possible to prevent a spider from indexing a site, effectively
making it invisible to the search engine, by using a specific command
in the robots.txt. Currently the duplicate sites do not have a
robots.txt, but it is very simple to create one: just make an
ordinary text file with notepad or similar, enter the following
commands, save it as robots.txt, and upload it to the duplicate site
root directory eg. http://www.faxsupersale.com/.
To stop any search spider from indexing a site, use the command:
User-agent: *
Disallow: /
Or to stop just the Google spider, use:
User-Agent: Googlebot
Disallow: /
Further information relating to the use of robots.txt is available
from:
Creating and Using a robots.txt File
http://www.outfront.net/tutorials_02/adv_tech/robots.htm
Having made these changes, it will probably take up to a month for
them to have an effect on the Google database. However, once they
have, you will no longer be infringing the Google guidelines, and an
email to help@google.com explaining the situation and specifying you
have disallowed the duplicate sites should result in www.jtfbus.com
being re-listed in the results pages.
Below are links to a few sites which I have found useful when
researching the subject of search engine optimization. They may be of
interest if you would like any further details about the function of
search engines.
Search Engine Guide
http://www.searchengineguide.com/
Search Engine Watch
http://searchenginewatch.com/
Webmaster World
http://www.webmasterworld.com/
Search Engine World
http://www.searchengineworld.com/
I hope you have found this answer both interesting and informative; if
you would like clarification of any area, please do not hesitate to
ask before rating my answer.
Kind regards,
mcfly-ga :)
**Search Strategy**
jtfbus
"jtf business systems"
"+www.jtfbus.+com"
"+www.jtfbus.+com" -site:www.jtfbus.com
google duplicate OR duplicated sites
google duplicate OR duplicated sites site:www.google.com
google robots.txt |