Google Answers Logo
View Question
 
Q: Searching text for ideas and concepts ( Answered 5 out of 5 stars,   1 Comment )
Question  
Subject: Searching text for ideas and concepts
Category: Computers > Software
Asked by: fiznicker-ga
List Price: $200.00
Posted: 30 Mar 2005 01:43 PST
Expires: 29 Apr 2005 02:43 PDT
Question ID: 502508
What software products or code modules (Win32 dlls) exist that
allow searching text for themes,
ideas, and concepts? For example, Nav4 by Think Tank 23 does this
(http://www.thinktank23.com/).  The goal is to be able to input a
large amount of text, and get the main ideas and concepts as an
output.
Answer  
Subject: Re: Searching text for ideas and concepts
Answered By: wonko-ga on 07 Apr 2005 08:25 PDT
Rated:5 out of 5 stars
 
There are a number of products addressing this problem.  One type
analyzes documents and identifies key concepts to create summaries. 
Another type is typically used for electronic discovery in legal cases
and identifies concepts within documents.

I have provided links to examples of specific firms offering such
products and a broader list of text analysis software companies.

Summarizers:

"Copernic Summarizer" Copernic http://www.copernic.com/

"TextAnalyst" Megaputer http://www.megaputer.com/

"Sinope? Carp Technologies http://www.carp-technologies.nl/en/indexIE.html


Electronic Discovery:

"DolphinSearch" DolphinSearch http://www.dolphinsearch.com/

"Inxight" Inxight http://www.inxight.com/

"Attenex" Attenex http://www.attenex.com/showpage.asp


An extensive list of text analysis and text mining software:

"Text Analysis, Text Mining, and Information Retrieval Software"
KDnuggets (2005) http://www.kdnuggets.com/software/text.html

I hope one or more of these products meets your needs.

Sincerely,

Wonko

Search Terms: summarize text software
fiznicker-ga rated this answer:5 out of 5 stars and gave an additional tip of: $10.00
Exceptional answer - just what I was looking for.

Comments  
Subject: Re: Searching text for ideas and concepts
From: craniac-ga on 07 Apr 2005 08:50 PDT
 
Hi,

http://www.clearforest.com/

Claims to do this, but many experts feel that statistical analysis is
inherently limited, and that at some point grunt librarian work has to
be done by hand.  It really depends upon the degree of precision
you're looking for. Clay Shirky discusses ways to fool your users into
doing manual classification in his essays on "Folsonomies" which
discusses the ways in which users of online photo applications such as
Flickr tag their own data.

I believe the in receiving an answer is because what you are asking
for is an artificial intelligence application that doesn't yet exist. 
The ability to extract actual concepts (as opposed to unique or
repeated combinations of words) falls into 'skynet' territory, i.e. a
machine that has self awareness.

That said, Steven Berlin Johnson has used DevonThink (for the mac)
with some success to identify related topics within his own body of
notes, and the discussions surrounding Johnson's essay on DevonThink
will link to similar windows applications.  I'm also interested in
this topic, as I would like, at some point, to parse student essays
and generate concept maps using something like Graphviz, a free
diagram mapping program.

As an aside, Amazon now generates "SIPS" or Statistically Improbably
Phrases" for the books that it sells that have been digitized. 
Looking at Malcolm Gladwell's "Blink" for example, reveals a number of
cool buzzwords such as "adaptive unconscious."  I think this problem
falls into the realm of computational linguistics, if that helps any. 
You might also try posting this query at ask.metafilter.com as they
are a bright but rowdy group.

references:

http://tokerud.typepad.com/blog/2005/02/knowledger_kill.html
http://www.stevenberlinjohnson.com/movabletype/archives/000230.html

These guys have a white paper addressing extracting concepts from
plain text.  It doesn't point to an application but might include
enough related terminology to help you narrow your search:

http://garage.cps.msu.edu/papers/GARAGe98-07-02.pdf

another good article:

http://www.intelligententerprise.com/031210/619decision1_2.jhtml

Clearforest reviewed:

http://www.vnunet.com/features/1150440

The University of West Florida claims to be doing some "automated
concept mapping" which looks kinda cool:

http://www.ihmc.us/users/acanas/Publications/AAAI99CmapsCBR/AAAI99CmapsCBR.html

My guess is that ClearForest products cost about as much as a black
market adoption.  Good luck. Oh and feel free to google and email me
if you find anyone else working on or talking about this problem.  A
search for "Markzilla" turns up my crappy blog and email address.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy