Google Answers Logo
View Question
 
Q: Site removed from google index ( Answered 5 out of 5 stars,   2 Comments )
Question  
Subject: Site removed from google index
Category: Computers
Asked by: halloerstmal-ga
List Price: $30.00
Posted: 08 Jun 2003 05:21 PDT
Expires: 08 Jul 2003 05:21 PDT
Question ID: 214644
Hello,
in the past, my website www.glymes.com always had a top ranking at
google for keywords like "tetraglyme" or "butyl diglyme".
The ranking was first or second in a total number of hits of 270
("tetraglyme" and 420 ("butyl diglyme").
Then, at around mid to late May 2003, for 2-3 days, my site was only
found at
every second or third search (sometimes it was found and sometimes it
was not found), but the ranking did not change.
However, after 3 days it has completely vanished from the search
results.
The web statistics for my site tell me that googlebot still hits my
site.
The site was not changed.
I have sent an e-mail to help@google.com and I also asked for
reinclusion at webmaster@google.com. I only got an automatic e-mail
reply saying that my e-mail was received and that I'll hear from
someone at google soon.
Until now, nobody replied and my website still is not indexed at
google. From other answers here at google answers I understand that
such a problem may only be a temporary one and that after around one
week the site should be reincluded. I am waiting for around two weeks
now, and my site still does not appear at google.

Why has  my site been removed from the index and how can I reinclude
it again?

Thank you very much for your answer in advance.
Answer  
Subject: Re: Site removed from google index
Answered By: webadept-ga on 08 Jun 2003 11:19 PDT
Rated:5 out of 5 stars
 
Hi, and Eureka! I found it. 

Your question has be a troubling one, since it doesn't appear that you
are using any spamming type solutions for you site, it is informative
and has a good amount of text, no flashing adds and hard sells. I just
couldn't find anything to suggest why this has happened on your site
until I checked your robots.txt file

User-agent: *
Disallow: 


Take out the disallow line. You are in effect telling the robots not
to list your site.

You may have to wait for a while before this takes effect. Perhaps a
month or longer. To speed things up (not a guarantee) you might want
to send a message off to the Google folks again saying you fixed the
problem, but don't submit the site again. Too much of that at one time
is not a good thing.

Good luck. 

webadept-ga

Request for Answer Clarification by halloerstmal-ga on 08 Jun 2003 12:08 PDT
Hi,

thanks for your answer. Sorry to bother you again, but I don't think
what you recommend is right.
1. I did not change my robots.txt. The same robots.txt worked fine in
the past, why shouldn't it work now?
2. At www.robotstxt.org/wc/faq.html it clearly says that 

User-agent: *
Disallow:

means that all robots are allowed to crawl my site.
If it would read

User-agent: *
Disallow: /

none of the robots would be allowed to search my page.
Could you please comment on this?

Clarification of Answer by webadept-ga on 08 Jun 2003 16:06 PDT
Hi again, 

Yes I'm aware of what it says there, but I've had this trouble with
other sites before. One thing you learn really fast in dealing with
the Internet environment is that, just because something has worked
for a long time, doesn't mean it will work in the morning. I would
take the disallow line out completely.

Google bar here to see PageRank:
http://toolbar.google.com/

If your site was banned then you would have no page ranking and the
googlebot wouldn't show up.

Currently your PageRank is 0 of 10, which means you are in the index,
after a fashion. If you are not in the Google Index or have been
banned, the PR bar is gray, meaning, no ranking at all. So 0 of 10 is
not really that bad.

The bots are there, if they are showing up more than 3 times a week,
(in your case I would bet they are showing twice a day.. eh?) it means
the bots are trying to index your site and are being stopped by
something. Either they are not parsing your page right, or the
robots.txt is telling them not to (despite the robots website telling
you it's okay). No matter if they are right or not, it is a simple
change and just as correct. Try it out.

The other thing that could stop the robots is cloaking:
://www.google.com/webmasters/faq.html#cloaking

But I can't see you doing this, so I didn't bring it up before. 

Another thing you can do is get listed in DMOZ. http://www.dmoz.com

This will help your PR, and also help things like this from happening.

I can't find any links to your site, from other places, good, bad or
indifferent, so it isn't a link farm. You say that you have read the
other questions which pertain to this kind of thing, so I'm guessing
you are aware that using Web Position Gold and those kinds of programs
could cause this.

Change the robots.txt, write Google, watch the bots and give it a
week, see what happens. If there is no change in about 8 days, I'll
pull my answer and we'll see if someone else has a better solution.

thanks, 

webadept-ga

Clarification of Answer by webadept-ga on 08 Jun 2003 18:51 PDT
Hi, 

Robertskelton-ga, with his comment below, has a good point. Most of
your front page, and huge percentages of your other pages (in some
cases 100% of those) are all direct copies from the
http://www.clariant-surfactants.com/ website. That website itself is
not doing so hot,(A PR of 3 is the highest I was able to wrestle from
it). That perhaps 5% of your content is "original" (or not copied at
least), could have a great deal to do with you being in the index, but
not used for any search results. Extracting redundant content pages
from search results has been more prevalent on Google, since the
beginning of the year.

An interesting thing to note at this point: one of the first things I
check when going through sites with your problem is a small content
search. I grab 5 to 7 sentence parts from the front page and search
for them to see if the paragraph or sentence they are in will show up
some place else. Your's didn't at that time. They do show up using the
google.com.au engine
://www.google.com.au/search?q=%2B%22Glymes+are+end+capped+polyethylene%22&hl=en&ie=ISO-8859-1
so my fellow researcher had a small advantage of me today :-) 

Switching engines to www.google.de also showed results 
://www.google.de/search?q=%22Glymes+are+end+capped+polyethylene%22&ie=ISO-8859-1&hl=de&btnG=Google+Suche&meta=

So it looks like a Google Dance is happening around my area right now.
:-)

If this is so, then your website, in order to rank prevalently on any
search again, is going to require a great deal of new content, to
replace to copied content which currently exists.

webadept-ga

P.S. keep the change in the robots.txt though, it will help later.
halloerstmal-ga rated this answer:5 out of 5 stars

Comments  
Subject: Re: Site removed from google index
From: robertskelton-ga on 08 Jun 2003 18:11 PDT
 
My guess is too much duplicated content, which also appears at
Clariant's site:

http://www.glymes.com/description.php
http://www.clariant-surfactants.com/fun/internet.nsf/vwWebFramesets/5477347BA6BAB346C1256CD90037B200?openDocument
Subject: Re: Site removed from google index
From: halloerstmal-ga on 09 Jun 2003 03:35 PDT
 
That's it! Thanks to both of you.
In fact, the Clariant website is a copy of www.glymes.com.
I will change the content on either site.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy