Google Answers Logo
View Question
 
Q: Dictionary/Thesaurus/Encyclopedia available as Web Service ( No Answer,   3 Comments )
Question  
Subject: Dictionary/Thesaurus/Encyclopedia available as Web Service
Category: Computers > Programming
Asked by: billisdog-ga
List Price: $100.00
Posted: 09 Sep 2005 17:26 PDT
Expires: 20 Sep 2005 12:13 PDT
Question ID: 566251
I am developing a Web 2.0 (read: ajax-heavy application) and I would
like to know if several various services are available as either:
A)- A real XML Service API, such as the way Google's Web API is
(although please not with a daily limit like Google's)
B)- HTTP pages that are easily browsable/parseable/regular/predictable
enough in their responses that I COULD make requests against them with
XMLHTTPRequest and use a simple enough set of rules to parse the HTTP
response into the bits I am interested in.

The services I am looking for are:
1- A dictionary.  A web service or web site that i could request with
a word and get a response back that somewhere contains its definition
and (this is also essential) the part of speech.
2- A thesaurus.  Similar as above but I'd like to receive back as a
response possible alternatives for the word.
3- An encyclopedia.  I would use Wikipedia, for instance, but their
database is too truly colossal to mirror (like several dozen gigs) and
they reject all automated requests, to boot.
Encarta seems to play along well enough, but it isn't very throrough-
any other suggestions?
4- Online academic journals, but free.  I doubt there are too many of
these, but if any quality legal/medical/humanities etc online academic
journals exist that don't cost anything and respond well to an HTTP
request, I'd like to know about them.

What are my options for each of these above four categories- either as a real
Web Service, or a page that with a certain querystring will return
easily parseable responses?  I'd like up to 3 or 4 possibilities for
each category, if that many exist, since I expect to encounter
numerous programming difficulties.  Downloadable libraries are also
acceptable but not preferred.

P.S. By easily parseable/predictable response, I mean an HTTP page
that I could query with a certain set of parameters and the response
it gives back would have the information I desire always formatted in
a fairly standard way and in roughly the same place, so that I could
extract the information with a high percentage of success by using an
algorithm making heavy use of string functions and regular
expressions.  As an example, using Dictionary.com as a reference this
way would score poorly under these criteria as far as I am concerned:
it gives back a predictable and regular response page with the
definition about half the time, but the other half of the time it
might take you through any number of intermediary pages asking for
clarification of the word.  (In addition, just to point out,
Dictionary.com doesn't give parts of speech along with the word, which
is essential to my project).

P.P.S. Any word on when Google might raise the API search query limit
to above a thousand a day?  Because I most definitely WILL sign up and
pay whatever it costs the minute they do.

Request for Question Clarification by nenna-ga on 20 Sep 2005 11:42 PDT
I believe that your question will require more time
and effort than the average amount of time and effort associated
with this price.  Here is a link to guidelines about pricing
your question, https://answers.google.com/answers/pricing.html

Nenna-GA
Google Answers Researcher
Answer  
There is no answer at this time.

Comments  
Subject: Re: Dictionary/Thesaurus/Encyclopedia available as Web Service
From: kkkkk1-ga on 12 Sep 2005 08:48 PDT
 
you can get it there http://www.geocities.com/onlinedictionary87/dictionary.htm
Subject: Re: Dictionary/Thesaurus/Encyclopedia available as Web Service
From: billisdog-ga on 12 Sep 2005 15:55 PDT
 
I can tell by looking at the page you referenced that this is pretty
obviously a scam.  Thanks, but I am not a chump.
Subject: Re: Dictionary/Thesaurus/Encyclopedia available as Web Service
From: kgilpin-ga on 15 Sep 2005 07:23 PDT
 
Entrez is a government sponsored web site that unifies access to
PubMed (life sciences publications), OMIM (sort of an encyclopedia of
life sciences terms), protein data bank, etc etc. Quite a few data
sources.

You can access this information programatically through web service
APIs; just URLs that you post to and receive XML in response. Each
database behind Entrez defines its own XML responses so it is a fair
amount of work to use.

Abstracts are available programatically for all PubMed articles. Full
text is typically not available for free, but some journals do have
open content.

Are you looking for just scientific terminology, or terminology from all domains?

You mention part of speech. The best resource for this I have found is
the British National Corpus (BNC) which is a POS-tagged corpus.

And although Wikitionary and Wikipedia are big I don't see a better
alternative out there than getting their database dumps and loading
them. I believe they do provide deltas so once you do the initial
import, keeping the info up to date is not too awful.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy