Hi,
I need to build a list with sites that ends with ***.co.il (I know
that there are ~35,000 sites).
I tried to use a search engines but the results where huge (millions of results).
I even asked my local Internet organization ("ISOC") but they told me
they just can't give me the list.
How can I build an accurate list as I've described before?
Thanks,
Romko-ga. |
Clarification of Question by
romko-ga
on
22 Jun 2006 11:57 PDT
By *** , I mean not pnly 3 letters site name but any number of letters...
Best,
R.
|
Request for Question Clarification by
sublime1-ga
on
22 Jun 2006 12:31 PDT
This tailored Google search should do the trick:
inurl:.co.il
://www.google.com/search?q=inurl%3A.co.il
Let me know if this satisfies your interests...
sublime1-ga
|
Clarification of Question by
romko-ga
on
22 Jun 2006 18:19 PDT
Nope, your idea gives 32,000,000 results - a number which I just can't handle.
Maybe if there is a way to cut sites that looks like:
(1) http://www.XYZ.abc.co.il and (2) http://www.abc.co.il/XYZ/...
Thus leaving only the "pure": abc.co.il, than it was great.
I hope this clarifies my needs,
Romko-ga.
|
Request for Question Clarification by
pafalafa-ga
on
22 Jun 2006 19:11 PDT
This search pares things down considerably:
://www.google.com/search?q=-inurl:www+site:co.il&num=100&hl=en&rls=GGLD,GGLD:2003-43,GGLD:en&lr=lang_en&sa=X&oi=lrtip9
but there's still a half a million of the buggers to deal with.
What leads you to believe there are only 35,000? Where did that number come from?
pafalafa-ga
|
Clarification of Question by
romko-ga
on
22 Jun 2006 19:48 PDT
Hi,
First - Thanks for your help.
Second - The number 35K is a round figure that "ISOC" gave to a Prof. I know.
Third - Can you tell me the secret of this tailoring trick - I tried it
myself with ***.gov.il (instead of co.il) but got poorer results than yours.
summing things up:
I'm still convinced that the answer I'm looking for is a combination
of a strong search engine and some kind of software which can reduce
the unwanted results to the minimum, thus giving me my desired, perfect, list.
Thanksn a million,
Romko-ga.
|
Request for Question Clarification by
pafalafa-ga
on
22 Jun 2006 20:43 PDT
You can see the search terms in the search box at the link:
-inurl:www site:co.il
The [ inurl: ] command focuses on text that is [ duh! ] in the url,
and the minus sign says, in essence, Don't include sites with www in
the url.
A similar search, but replacing co.il with gov.il leads to these results:
://www.google.com/search?num=100&hl=en&lr=lang_en&rls=GGLD%2CGGLD%3A2003-43%2CGGLD%3Aen&q=-inurl%3Awww+site%3Agov.il&btnG=Search&lr=lang_en
What else would you need for a full answer to your question?
pafalafa-ga
|
Clarification of Question by
romko-ga
on
22 Jun 2006 21:26 PDT
Hi,
As I wrote in my previous clarification: with your suggested method I'm getting
a result which is 16 times bigger than my objective list (40K vs. 640K).
Again, I don't think that a search engine alone can do the trick.
Best,
Romko-ga.
|
Request for Question Clarification by
byrd-ga
on
25 Jun 2006 10:08 PDT
Hi romko-ga,
Let me know if this meets your needs:
http://www.iguide.co.il/sites/sites.htm
http://www.iguide.co.il/
Thanks,
Byrd-ga
|
Clarification of Question by
romko-ga
on
25 Jun 2006 13:42 PDT
Thanks but no.
I'm well familiar with this site.
It reflects the true picture 6-7 years ago.
Was there an Internet 7 years ago? ;-)
Thanks for trying,
Romko-ga.
|