|
|
Subject:
How many pages have URLs or hrefs with the pattern .taf, .tml or .thtml in them
Category: Computers > Security Asked by: witangodude-ga List Price: $30.00 |
Posted:
04 Aug 2003 21:46 PDT
Expires: 03 Sep 2003 21:46 PDT Question ID: 240135 |
I need to know how many pages have URLs or hrefs on the page with any of the following patterns in them: .taf .tml .thtml e.g. http://www.witango.com/help/help.taf Based on the same search patterns and the number of URLs and hrefs found, I also would like a list of the URLs and their corresponding Page Titles. I am happy to pay extra for the list of unique URLs and hrefs depending on how many unique items are found. I will pay an extra $20 for every 100,000 unique URLs and hrefs on the list up to a maximum of $200. I believe that there should be about 600,000 urls. |
|
There is no answer at this time. |
|
Subject:
Re: How many pages have URLs or hrefs with the pattern .taf, .tml or .thtml in them
From: robertskelton-ga on 05 Aug 2003 01:14 PDT |
They can be found on search engines, but unfortunately the most results you will get for each query is a couple of thousand. It would take a lot of effort to scrape the results, add them to a file and remove redundancies. The only efficient way would be a combination of a automated process combined with some smart keyword selections. |
Subject:
Re: How many pages have URLs or hrefs with the pattern .taf, .tml or .thtml in them
From: witangodude-ga on 05 Aug 2003 03:24 PDT |
So how do you get a search engine like google to provide information after the first 1000 results so that you can get a full list to be able to create an automated process that correlates the results. |
Subject:
Re: How many pages have URLs or hrefs with the pattern .taf, .tml or .thtml in them
From: robertskelton-ga on 05 Aug 2003 04:06 PDT |
It's a case of running searches on hundreds of different keywords and combining the results. There's no other way. |
Subject:
Re: How many pages have URLs or hrefs with the pattern .taf, .tml or .thtml in t
From: zarby-ga on 05 Aug 2003 07:24 PDT |
Use Google with: allinurl: taf (331 K results) allinurl: thtml (70 K results) allinurl: tml (428 K results) Not all these URLs actually end with .taf/.thtml/.tml, and Google doesn't know all the pages or hrefs, but still, it gives you a fair idea of the number of such URLs since you can decently have a statistical approach from page 10 of Google results for each request. To have URLs of such kind by thousands, you need to process the results to remove anything that does not end with .taf/.thtml/.tml. |
Subject:
Re: How many pages have URLs or hrefs with the pattern .taf, .tml or .thtml in them
From: cogent-ga on 19 Aug 2003 18:30 PDT |
Type the following queries in Google: 0 || 1 || a || e filetype:taf (184,000 found) 0 || 1 || a || e filetype:tml (682 found) 0 || 1 || a || e filetype:thtml (15,400 found) but again, limited to 1000 results... |
If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you. |
Search Google Answers for |
Google Home - Answers FAQ - Terms of Service - Privacy Policy |