Google Answers Logo
View Question
 
Q: Capture HTML to a picture (gif/jpg) ( Answered 5 out of 5 stars,   1 Comment )
Question  
Subject: Capture HTML to a picture (gif/jpg)
Category: Computers > Graphics
Asked by: imi-ga
List Price: $20.00
Posted: 03 Jun 2003 09:07 PDT
Expires: 03 Jul 2003 09:07 PDT
Question ID: 212464
I am looking for a software that can capture web pages into gif/jpg
(automatic capture is a must. we intend to capture 100,000 - 200,000
sites).

Example at: http://thumbnails.alexa.com/images/bigjpeg/y/yahoo.com_.big.jpeg

The size of the caputre does not matter, as I have technology to do a
mass resize.

Request for Question Clarification by haversian-ga on 03 Jun 2003 09:25 PDT
How technically involved are you prepared for this software to be?

I see some problems that can be solved a number of ways, depending on
your preference:
1 - What browser?  Do you want software with a built-in browser?  Of
what quality?  Something which uses IE's rendering engine would
probably be easier to find, but the Gecko rendering engine found in
Netscape, Mozilla, and others is noticeably superior in standards
compliance.
2 - What about filenames?  Do you need/want the software to
automatically name your files with the URI of the resource you are
saving?
3 - Where to go?  Should the software automatically read a text file
for URIs and browse those?
4 - What platform?  Windows?  UNIX/Linux?  MacOS?  Would you accept a
series of shell or perl scripts as "a software"?

Clarification of Question by imi-ga on 03 Jun 2003 10:12 PDT
1 - What browser?  Do you want software with a built-in browser?  Of
what quality?  Something which uses IE's rendering engine would
probably be easier to find, but the Gecko rendering engine found in
Netscape, Mozilla, and others is noticeably superior in standards
compliance.

Which ever browser, as long as we can get picture of the web site. 

2 - What about filenames?  Do you need/want the software to
automatically name your files with the URI of the resource you are
saving?
yes, filename should be name of the web site or someting similar. we
are very good at mass rename later, as long as it has some kind of
sequence.

3 - Where to go?  Should the software automatically read a text file
for URIs and browse those?
yes, and more importantly capture the picture and save to file.

4 - What platform?  Windows?  UNIX/Linux?  MacOS?  Would you accept a
series of shell or perl scripts as "a software"?
We would prefer *nix based (so we can change around if it doesnt do
100%). Perl or shell script would be excellent. A fully function
windows program would also be acceptable.  no MacOs
Answer  
Subject: Re: Capture HTML to a picture (gif/jpg)
Answered By: easterangel-ga on 04 Jun 2003 08:29 PDT
Rated:5 out of 5 stars
 
Hi! Thanks for the question.

I was able to find three fully functional Windows programs that you
can use to convert HTML to JPG/GIF.

Our first link is your best bet and provides the solution you need.

HTML2JPG Features
 
a. "HTML2JPG creates a BMP or JPG image with the whole 
'vertical' content of the webpage."

b. "HTML2JPG produces an image of the size that you specify."

c. "HTML2JPG can operate in batch-mode and generate hundreds
of images without your intervention."

HTML2JPG
http://www.html2jpg.com/

Other programs I found are the following:

Zan Virtual Image Printer 
http://www.zan1011.com/

ePrint from Leadtools
http://www.leadtools.com/Utilities/PrinterDriver/ePrint-Features.htm

Search terms used:  
convert converter html jpg           
"html to jpeg" conversion 
                 
I hope these links would help you in your research. Before rating this
answer, please ask for a clarification if you have a question or if 
you would need further information. 
                 
Thanks for visiting us.                  
                 
Regards,                  
Easterangel-ga                  
Google Answers Researcher

Request for Answer Clarification by imi-ga on 04 Jun 2003 23:13 PDT
Can you find any Linux / Unix software that does the same thing as HTML2JPG

Clarification of Answer by easterangel-ga on 04 Jun 2003 23:32 PDT
Hi imi-ga! Thanks for asking a clarification before providing a
rating.

In your question clarification you specifically mentioned that

"A fully function windows program would also be acceptable.". This is
the reason I attempted to answer the question.

However, before posting the answer I still looked for a Unix or Linux
version but could not find anything. When I received this new
clarification I still looked but couldn't find anything.

Since this is an additional requirement for the question (a Linux
version), I suggest that you post a new version of this so that
another researcher might get a crack at it.

Thanks again.

Best Regards.
Easterangel-ga

Clarification of Answer by easterangel-ga on 04 Jun 2003 23:33 PDT
You may also add that you want a Unix version as well if possible.

Thanks!
imi-ga rated this answer:5 out of 5 stars
Great research, helpful information!

Comments  
Subject: Re: Capture HTML to a picture (gif/jpg)
From: alienintelligence-ga on 03 Jun 2003 22:56 PDT
 
Hi imi,

Have you considered "printing the web pages"
to a file? You can parse the postscript files 
later into a number of other file formats.
The final conversion could even be carried out
on another computer, to free up the resources
of the spidering computer. 

-AI

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy