|
|
Subject:
Book digitizing tools and techniques for large-scale projects
Category: Science > Technology Asked by: ssturgis-ga List Price: $25.00 |
Posted:
14 Nov 2005 12:17 PST
Expires: 14 Dec 2005 12:17 PST Question ID: 592864 |
A recent Wall Street Journal story described the tools and techniques used by the Internet Archive to scan out-of-copyright books: http://online.wsj.com/public/article/SB113111987803688478-2GrTWaqO20C8bl6vQd0qMe6_7qk_20061110.html?mod=blogs The question is: What tools and techniques are being used by even larger scale book digitizing projects, such as those of Google, Yahoo, and Amazon? I am particularly interested in: (1) The hardware used to do the scanning. (2) The software used to perform OCR on the scanned text. (3) The resulting sccuracy of the processed, searchable text. E.g., One incorrect character for every "N" characters scanned. (4) The cost per page processed. (Note: The WSJ article did not address items #2 or #3.) | |
|
|
There is no answer at this time. |
|
There are no comments at this time. |
If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you. |
Search Google Answers for |
Google Home - Answers FAQ - Terms of Service - Privacy Policy |