I need a large list of German words suitable for validating entries in
a Scrabble-like word puzzle game (i.e. words that you'd find in a
dictionary).
The word list must be large (>80,000 words), comprehensive, and
reasonably free of slang and proper names (which won't be valid in the
game). It should include plurals and the various conjugated forms of
verbs - i.e. any word you could legally play in Scrabble or a similar
style game.
Most importantly, the word list must be public domain, open source, or
available under some sort of license that makes it suitable for use in
my game (and there must be a clear license/statement to that effect,
on a reasonable, non-warez site).
The list can be in any reasonable text format - if I can cut and paste
it into Windows Notepad, that's all I need.
I do not speak German well, but I will have a German speaking
acquaintenance verify that it looks reasonable.
(Note that I've posted this question 3 times - once each for Spanish,
German, and French - feel free to answer for each language for a
bigger payment.) |
Request for Question Clarification by
leapinglizard-ga
on
28 Jun 2005 15:55 PDT
The TU Chemnitz dictionary is available for download under Version 2
of the GNU General License. Is this license acceptable to you?
http://dict.tu-chemnitz.de/help.html#download
http://www.gnu.org/copyleft/gpl.html
The dictionary file as it is requires some processing to strip out the
English translations and to extract all inflections, but I can do this
for you with my own text-processing tools. The end result would be an
alphabetic list of words, one per line, which I would post on a web
page for you. Good enough?
leapinglizard
|
Clarification of Question by
psteinx-ga
on
28 Jun 2005 17:02 PDT
The dictionary itself is ok (missing some verb conjugations, but I
think I can generate some of them automatically and/or fill in by
hand). However, it's under the general GPL rather than the library
GPL. The former requires me to release all my source code, whereas
the latter would only require me to open source the dictionary and/or
my modifications to it. I've asked the library provider if LGPL would
be acceptable - if he says yes, then the answer is complete (I don't
need text parsing- I can do that myself). If he won't consent to
LGPL, then I don't think I can use this. I'll let you know.
|
Request for Question Clarification by
leapinglizard-ga
on
30 Jun 2005 15:01 PDT
The following is a fairly comprehensive word list.
ftp://ftp.ox.ac.uk/pub/wordlists/german/words.german.Z
An accompanying README mentions that the file came from Munich.
ftp://ftp.ox.ac.uk/pub/wordlists/README
Based on the number of entries in this word list, I believe it was
copied to the Munich FTP repository from Erlangen. The Erlangen file
is cited here.
http://ftp.fi.muni.cz/pub/tex/local/spelling/README.german
Erlangen is also listed on this page devoted to "Free Electronic
Dictionaries". See under the heading "Where to find dictionaries
today".
http://runeberg.org/admin/dictionary.html
Although I have no firm proof that this word list is in the public
domain, I have detected no sign that is covered by a restrictive
license. All indications are to the contrary. All in all, I would have
no qualms about using it in a proprietary program.
What do you think?
leapinglizard
|
Clarification of Question by
psteinx-ga
on
30 Jun 2005 20:54 PDT
OK, I ended up going the route suggested by electropostie-ga of
merging multiple dictionaries (and only using words that appear on two
independent dictionaries). I didn't use all the dictionaries in E's
list - just those that appeared to be public domain, open-source
and/or academic in origin (i.e. not commercial, and not from someone
else's game).
So, electropostie-ga - If you repost your comment as an answer, I'll approve it.
LeapingLizard - the Chemnitz dictionary wasn't a final answer for me,
as it had various issues, but it was helpful and put me on the right
scent. If you send me appropriate info (physical address and/or
paypal account), I'll gladly send you half the list price, if you
think that's fair.
Phil
|
Request for Question Clarification by
leapinglizard-ga
on
30 Jun 2005 22:04 PDT
We are not permitted to exchange contact information through this
service. You can, however, change the list price of your question
before an answer is posted. See here under "Change your question
price".
http://answers.google.com/answers/help.html#followup
By the way, the commenter below is not a Researcher and cannot receive
payment for his comment.
leapinglizard
|
Clarification of Question by
psteinx-ga
on
01 Jul 2005 07:39 PDT
OK - would you be cool if I dropped the price to $30, you submitted
your response as an answer, and I accepted?
|
Request for Question Clarification by
leapinglizard-ga
on
01 Jul 2005 08:10 PDT
Sure, that would be an equitable solution.
leapinglizard
|