Google Answers Logo
View Question
 
Q: Best Platforms / Tools for Mass Querying, HTML parsing. ( No Answer,   0 Comments )
Question  
Subject: Best Platforms / Tools for Mass Querying, HTML parsing.
Category: Computers > Internet
Asked by: bigphil_at_writeme-ga
List Price: $5.00
Posted: 04 Aug 2005 02:06 PDT
Expires: 03 Sep 2005 02:06 PDT
Question ID: 551559
This is somewhat of an open-ended question, but I hope to get insight
from experienced web developers out there.  I'm hoping use GoogleAPI
to query 3-4 lists of query terms, each list with somewhere between
2000-5000 terms.  (I realize this might take many days given Google's
API licensing limits.)  For each query term, I want to retrieve the
top 10 results in each of 5 different (non-English) languages.  For
each resulting page, I just want to keep the sentence or table row
that has the query term.

Then, I want to keep these sentences in flat file(s), data struct, or
a database somehow, and do some pretty major string manipulation.

I'm trying to figure out what platforms and tools out there will best
handle these tasks.  I don't mind learning entirely new environments /
languages.  It's an academic project, and I prefer to stick to
freely/cheaply available tools under Windows because that's what I
have in front of me.  However, if there's a great idea under the
unix/linux umbella, I'll consider it.

An answer to this question will outline an end-to-end solution,
mentioning all languages, development tools and libraries needed to
best accomplish this project as quickly as possible.  It should
include some non-mainstream, non-obvious information (perhaps
specialized string manipulation or web retrieval tools) that will make
my job easier.
Answer  
There is no answer at this time.

Comments  
There are no comments at this time.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy