Google Answers Logo
View Question
 
Q: URL search from list ( Answered,   0 Comments )
Question  
Subject: URL search from list
Category: Computers > Internet
Asked by: scottybaja-ga
List Price: $50.00
Posted: 06 Dec 2004 11:30 PST
Expires: 05 Jan 2005 11:30 PST
Question ID: 438884
From a list of say 500 complete company names (ex: APNA GHAR, INC.)
how do I search for the correct (or suspected) URLs all at the same
time?

Request for Question Clarification by leapinglizard-ga on 06 Dec 2004 11:42 PST
What is a "suspected" URL? And what exactly do you mean by searching?

I can provide a script that takes a list of names, forms them into
URLs based on a template of your choosing, and attempts to download
each of the corresponding home pages. Would this meet your needs?

leapinglizard

Clarification of Question by scottybaja-ga on 08 Dec 2004 07:33 PST
Sounds good. Do you know what the rough percentage of accuracy is for
your script? If a homepage is downloaded, does it tell you if it is
accurate? When I say searching, I would like to present the list of
company names to some kind of search engine (or database) that can
ideally give me back the correct URLs for each company. (I also have
accurate fields like address, city, state, zip to help if necessary.)
Does anything like that exist? A mute issue I know if your script is
highly accurate. Can I try it or send to you a sample list of
companies? Thanks. Scott

Request for Question Clarification by leapinglizard-ga on 08 Dec 2004 15:15 PST
Sure, give me a sample list. If it's very long, put it on a web page
and post a link to it below. Otherwise, just post the whole list. I'll
give you the results of my script, and if you find them pleasing, I'll
post the script as an answer.

By the way, the script has no idea of its own accuracy. That's for a
human to judge. If it knew when it got something wrong, it would keep
trying until it got the right answer, but I'm afraid it's not that
smart. Its accuracy depends on how well the companies are documented
on the web and on how precisely you give their names.

leapinglizard

Clarification of Question by scottybaja-ga on 11 Dec 2004 09:34 PST
Thanks LL. Here is the URL for some sample data. Would using phone
number be more accurate than company name?
http://prosites-scottpierson.homestead.com/forgoogle.html

Thanks.

Clarification of Question by scottybaja-ga on 15 Dec 2004 09:46 PST
Just double checking that you received the data you had requested. Thanks. Scott

Request for Question Clarification by leapinglizard-ga on 16 Dec 2004 16:30 PST
Yes, thank you, I did get it. I'm very busy at the moment with other
projects, but I'll attend to this shortly. Do let me know if it's
urgent.

leapinglizard

Clarification of Question by scottybaja-ga on 27 Dec 2004 07:43 PST
Say, we are getting a bit eager for this since you mentioned that a
script could give us the possible URLs. Any progress? Would you give
it some urgency? Thanks. Scott

Request for Question Clarification by leapinglizard-ga on 30 Dec 2004 04:24 PST
Scott,

I apologize for the long delay. I was so busy over Christmas that this
question dropped right off my radar screen. You now have my full
attention, however, and I hope I can still be of assistance to you.

I've hacked together a little Python script, not much more than 40
lines long, that uses a popular search engine to run a query against
each company name. Experimentation showed that it does more harm than
good to use additional data such as the address or phone number. Most
of the work I've done, and frankly it's not an enormous amount, is in
parsing the resulting HTML to extract the top hit for each query. The
cleaner alternative would be to use the API provided by the number-one
search engine, but users of this API are not permitted to run more
than 1000 queries a day. In order to avoid this restriction, I chose
to interface directly with the number-two search engine through HTTP
requests, hence the parsing.

I only parse the resulting HTML in a superficial way, relying on
markup features that may well change a few months or weeks from now.
In fact, I'm quite sure that my script won't last forever. Sooner or
later, the search engine will change its markup, and my parsing
routines will break. Another reason you might not want to accept this
script is its error rate. I haven't attempted to incorporate any
feedback algorithms, so the script blurts out the number-one hit for
the company name as supplied by the search engine, without computing a
confidence level.

I'll let you judge for yourself. Below are the guesses for the first
500 company names in the file you provided. Take a look at them and
decide whether you're satisfied with the accuracy. Note that in cases
where the search engine couldn't find any hits for the company name,
the script outputs "[zero hits]".

If you're not pleased with these guesses, I'll withdraw from your
question and leave it open for other Researchers. If you do find the
results acceptable, I can post as an answer: (a) my Python script with
usage instructions; or (b) just the guesses resulting from every
company name in your file. If you do want the script, I can't
guarantee it will last beyond the near future. I am willing to follow
up with code patches if it breaks within the next month or so, but I'm
not offering to make other kinds of modifications such as computing
confidence levels or refining the query-generation strategy.

Regards,

leapinglizard


1. 10,000 FRIENDS OF PENNSYLVANIA
    http://www.10000friends.org/

2. 1000 FRIENDS OF IOWA
    http://www.kfoi.org/

3. 1000 FRIENDS OF MINNESOTA, INC.
    http://www.rachel.org/orgList/orgResults.cfm?map=World&country_ID=all

4. 1000 FRIENDS OF NEW MEXICO, INC.
    http://web.state.nm.us/LOBBY/ORG.HTM

5. 113 CALHOUN STREET FOUNDATION CLEMSON UNIVERSITY COOP EXT Service
    http://calspace.ucsd.edu/spacegrant/contacts/allcontacts/allcontacts.html

6. 2PLUS, INC.
    http://www.2plus.com/

7. 3 RIVERS WET WEATHER, INC.
    http://www.3riverswetweather.org/

8. 420032013 EDWARD G. SCHLEIDER URBAN WASTE MANAGEMENT & RESEARCH CHAIR TRUST
    [zero hits]

9. 420032021 UNO-URBAN WASTE MANAGEMENT & RESEARCH FOUNDATION PROFESSORSHIP TRUST
    [zero hits]

10. A&M SERVICES
    http://www.aetv.com/

11. ABILITY COUNTS, INC.
    http://www.abilitycounts.org/

12. ABRAHAM LINCOLN MEMORIAL GARDEN FOUNDATION
    http://www.lmgnc.com/

13. ACRES, INC.
    http://www.icehorse.com/

14. ACTIVE CITIZENS TOGETHER IMPROVING OUR NEIGHBORHOODS, INC.
    http://www.newszap.com/yellowpages2/index.inn?loc=detail&area=3&cat=7

15. ADIRONDACK 46R CONSERVATION TRUST
    http://www.grenvillecc.ca/faculty/jchilds/adkauction/auction.html

16. ADIRONDACK LAND TRUST, INC.
    http://www.prfamerica.org/Stats-AdirondackNatureConservancy.html

17. ADIRONDACK MOUNTAIN CLUB, INC.
    http://www.adk.org/

18. ADIRONDACK TRAIL IMPROVEMENT SOCIETY INC
    http://www.marketingsource.com/associations/state/NY

19. ADOPT A BEACH
    http://www.adoptabeach.org.uk/

20. ADOPT-A-STREAM FOUNDATION
    http://www.streamkeeper.org/

21. ADOPT-A-WATERSHED, INC.
    http://www.swrcb.ca.gov/agendas/1999/march/0303-08.htm

22. ADVENTURE CENTRE AT PRETTY LAKE
    http://www.adventurecentre.org/

23. ADVENTURE DISCOVERY, INC.
    http://www.discoveryplace.org/

24. AFRICAN ENVIRONMENTAL FILM FOUNDATION
    http://www.aeffonline.org/

25. THE AFRICAN VIOLET SOCIETY OF AMERICA, INC.
    http://www.avsa.org/

26. AGASSIZ AUDUBON SOCIETY INC
    http://www.galenet.com/templates/src/help/dsphoto.htm

27. AIKEN COUNTY OPEN LAND TRUST, INC.
    http://www.privatelandownernetwork.org/landtrustlist.htm

28. AIREDALE TERRIER RESCUE & ADOPTION
    http://www.aire-rescue.com/

29. THE AKRON METRO PARKS FOUNDATION
    http://www.all-creatures.org/cash/let-20040303.html

30. ALABAMA COASTAL FOUNDATION, INC.
    http://www.alcoastalfoundation.org/

31. ALABAMA CONSERVATION & NATURAL RESOURCES FOUNDATION, INC
    http://www.dcnr.state.al.us/

32. ALABAMA ENVIRONMENTAL COUNCIL
    http://www.aeconline.ws/

33. ALABAMA FOREST RESOURCES CENTER, INC.
    http://www.preceda.com/assesment.htm

34. ALABAMA FORESTS FOREVER FOUNDATION
    http://www.alaforestsforever.org/

35. ALABAMA PALS
    http://www.alpals.org/

36. ALABAMA RIVERS ALLIANCE, INC.
    http://www.alabamarivers.org/dirdefghi.htm

37. ALABAMA TREASURE FOREST ASSOCIATION
    http://www.atfa.net/

38. ALABAMA WATER WATCH ASSOCIATION
    http://www.alabamawaterwatch.org/awwa/AWWABOD082203.htm

39. ALABAMA WATERFOWL ASSOCIATION, INC.
    http://www.alabamawaterfowl.org/

40. ALACHUA CONSERVATION TRUST, INC.
    http://www.privatelandownernetwork.org/landtrustlist.htm

41. ALAMO RC&D AREA, INC.
    http://www.flyball.com/ruffnecks/HH4-Judging_Schedule.xls

42. ALASKA ASSOCIATION OF CONSERVATION DISTRICTS, INC.
    http://www.rcdnet.org/mitigation.htm

43. ALASKA BOREAL FOREST COUNCIL
    http://www.akborealforest.org/

44. ALASKA BOTANICAL GARDENS
    http://www.alaskabg.org/

45. ALASKA CONSERVATION ALLIANCE
    http://www.akvoice.org/

46. ALASKA CRAFTSMAN HOME PROGRAM, INC.
    http://www.alaska.net/~achp

47. ALASKA FORUM FOR ENVIRONMENTAL RESPONSIBILITY
    http://www.alaskaforum.org/
    
48. ALASKA NATURAL HISTORY ASSOCIATION
    http://www.alaskanha.org/
    
49. ALASKA WATER MANAGEMENT ASSOCIATION
    http://www.awwma.org/
    
50. ALASKANS FOR LITTER PREVENTION & RECYCLING
    http://www.alparalaska.com/
    
51. ALBANY COMMUNITY LAND TRUST
    http://www.albanyclt.org/ 
    
52. ALBANY COUNTY LAND CONSERVANCY, INC.
    http://www.privatelandownernetwork.org/landtrustlist.htm
    
53. ALDO LEOPOLD NATURE CENTER, INC.
    http://www.madison.com/communities/aldo-leo
    
54. ALEWIFE NEIGHBORS, INC.
    http://www.alewifeneighbors.org/
    
55. ALFRED B SWANSON FOUNDATION
    http://www.handrehabfoundation.org/board.asp
    
56. ALLEN & ALICE STOKES NATURE CENTER
    http://www.allen-heath.com/
    
57. ALLEY POND ENVIRONMENTAL CENTER, INC.
    http://www.alleypond.com/about/maps.html
    
58. ALLIANCE FOR A LIVABLE WORLD
    http://www.amazon.com/exec/obidos/tg/detail/-/0415906504?v=glance
    
59. ALLIANCE FOR A LIVING OCEAN, INC.
    http://www.livingocean.org/hars.html
    
60. ALLIANCE FOR ENVIRONMENTAL RENEWAL, INC.
    http://www.rachel.org/orgList/orgResults.cfm?map=World&country_ID=all
    
61. ALLIANCE FOR THE CHESAPEAKE BAY, INC.
    http://www.acb-online.org/
    
62. ALLIANCE FOR THE WILD ROCKIES, INC
    http://www.wildrockies.org/
    
63. ALLIANCE TO END CHILDHOOD LEAD POISONING
    http://www.aeclp.org/
    
64. THE ALLIANCE TO SAVE ENERGY
    http://www.ase.org/
    
65. ALTAMAHA RIVERKEEPER, INC.
    http://www.altamahariverkeeper.org/
    
66. ALTERNATIVE ENERGY RESOURCES ORGANI
    http://www.energyideas.org/topics/default.cfm?o=h%2Cb%2Cbs%2Ct%2Cts&c=h%2Cb%2C3%2Ct%2C24&s_qob=title&s_qmr=50

67. AMARILLO BOTANICAL GARDENS
    http://www.amarillobotanicalgardens.org/

68. THE AMAZON ALLIANCE FOR TRADITIONAL & INDIGENOUS PEOPLES OF THE AMAZON BASIN
    http://www.amazonalliance.org/

69. AMAZON CONSERVATION ASSOCIATION
    http://www.amazonconservation.org/

70. AMAZON WATCH
    http://www.amazonwatch.org/

71. AMD & ART, INC., N.P.
    http://www.amd.com/

72. AMERICA IRIS SOCIETY
    http://www.hosta.org/

73. AMERICA RECYCLES DAY, INC.
    http://www.americarecyclesday.org/

74. AMERICA THE BEAUTIFUL FUND
    http://www.america-the-beautiful.org/

75. AMERICA'S CLEAN WATER FOUNDATION
    http://www.acwf.org/

76. AMERICA'S RIVER COMMUNITIES, INC.
    http://www.rivercommunities.org/

77. AMERICA'S WATERSHED LANDKEEPER, INC.
    http://www.chattahoochee.org/

78. AMERICAN ASSOCIATION OF BOTANICAL GARDENS & ARBORETA
    http://www.aabga.org/

79. AMERICAN BOTANICAL COUNCIL
    http://www.herbalgram.org/

80. AMERICAN CAVE CONSERVATION ASSOCIATION
    http://www.cavern.org/

81. AMERICAN CHESTNUT FOUNDATION
    http://www.acf.org/

82. AMERICAN CHESTNUT LAND TRUST, INC.
    http://www.acltweb.org/Administration/join.cfm

83. AMERICAN CONIFER SOCIETY
    http://www.conifersociety.org/

84. AMERICAN COUNCIL FOR AN ENERGY EFFICIENT ECONOMY
    http://www.aceee.org/

85. AMERICAN DAFFODIL SOCIETY
    http://www.daffodilusa.org/

86. AMERICAN DISCOVERY TRAIL SOCIETY
    http://www.discoverytrail.org/

87. AMERICAN FLORAL ENDOWMENT
    http://www.endowment.org/

88. AMERICAN FOREST ALLIANCE, INC
    http://frozen.icom.com/

89. AMERICAN FOREST FOUNDATION
    http://www.americanforests.org/

90. AMERICAN FORESTS
    http://www.americanforests.org/index.php

91. AMERICAN FREE TREE PROGRAM INC
    http://enn.com/

92. AMERICAN FRIENDS OF IUED
    http://www.fluoridealert.org/news/831.html

93. AMERICAN FRIENDS OF SPNI, INC.
    http://www.frommers.com/destinations/print_narrative.cfm?destID=227&catID=0227030056

94. AMERICAN GROUND WATER TRUST
    http://www.agwt.org/

95. AMERICAN HORTICULTURAL SOCIETY
    http://www.ahs.org/

96. AMERICAN HOSTA SOCIETY
    http://www.hosta.org/

97. AMERICAN LAND CONSERVANCY
    http://www.alcnet.org/

98. American Land Institute
    http://www.landinstitute.org/

99. AMERICAN LANDS ALLIANCE
    http://www.americanlands.org/

100. AMERICAN LITTORAL SOCIETY
    http://www.alsnyc.org/

101. AMERICAN OCEANS CAMPAIGN
    http://www.oceana.org/

102. AMERICAN ORCHID SOCIETY, INC.
    http://orchidweb.org/

103. AMERICAN PUBLIC INFORMATION ON THE ENVIRONMENT
    http://www.americanpie.org/

104. AMERICAN RHODODENDRON SOCIETY
    http://www.rhododendron.org/

105. AMERICAN RIVER CONSERVANCY
    http://www.arconservancy.org/

106. AMERICAN RIVER NATURAL HISTORY ASSOCIATION
    http://www.dcn.davis.ca.us/vme/ARNHA

107. AMERICAN RIVER PARKWAY FOUNDATION, INC. 
    http://www.arpf.org/MapAnnouncement.htm

108. AMERICAN RIVERS, INC.
    http://www.amrivers.org/

109. THE AMERICAN ROSE SOCIETY
    http://ars.org/

110. AMERICAN SOCIETY FOR HORTICULTURAL SCIENCE
    http://www.ashs.org/

111. AMERICAN SOLAR ENERGY SOCIETY, INC.
    http://www.ases.org/

112. AMERICAN STORYBOARD, INC
    http://www.theronan.org/storyboard.htm

113. AMERICAN WATER RESOURCES ASSOCIATION
    http://www.awra.org/

114. AMERICAN WHITEWATER AFFILIATION, INC.
    http://www.americanwhitewater.org/

115. AMERICAN WILDERNESS FOUNDATION
    http://www.awfguides.org/

116. AMERICAN WILDLANDS 
    http://www.wildlands.org/

117. AMERICAN WOMEN'S HERITAGE SOCIETY, INC.
    http://www.awhsinc.org/

118. AMERICAN-HELLENIC EDUCATIONAL CENTER, INC.
    http://www.gaepis.org/

119. AMERICANS FOR EQUITABLE CLIMATE SOLUTIONS
    http://www.cpc-inc.org/

120. AMERICANS FOR OUR HERITAGE & RECREATION
    http://www.ahrinfo.org/

121. AMERICANS FOR THE ENVIRONMENT
    http://www.cnie.org/NAE

122. AMIGOS BRAVOS INC
    http://www.rioweb.org/Partners/AmigosBravos

123. ANACOSTA WATERSHED SOCIETY, INC.
    http://www.pacd.org/products/bmp/bmp_append_b.htm

124. ANCHORAGE WATERWAYS COUNCIL
    http://www.anchwaterwayscouncil.org/

125. ANDOVER VILLAGE IMPROVEMENT SOCIETY
    http://www.avisandover.org/

126. ANGELINA BEAUTIFUL/CLEAN
    http://go-lufkin.com/abclean

127. THE ANNAPOLIS CENTER FOR ENVIRONMENTAL QUALITY, INC.
    http://www.aerodyne.com/caec/caec.html

128. THE ANTARCTICA PROJECT
    http://www.asoc.org/

129. ANZA-BORREGO DESERT NATURAL HISTORY ASSOCIATION
    http://www.california-desert.org/

130. APALACHICOLA BAY & RIVER KEEPER, INC.
    http://www.baynavigator.com/

131. APPALACHIAN EDUCATION & RECREATION SERVICES, INC
    http://www.ael.org/

132. APPALACHIAN MOUNTAIN CLUB
    http://www.outdoors.org/

133. APPALACHIAN RESOURCE CONSERVATION & DEVELOPMENT COUNCIL, INC.
    http://pages.preferred.com/~anetrcd

134. APPALACHIAN VOICES
    http://www.appvoices.org/

135. AQUATIC ECOSYSTEM RESTORATION FOUNDATION
    http://www.aquatics.org/

136. AQUATIC FNDTN OF METRO L.A.
    http://www.metroatlantachamber.com/macoc/members/newmember_archives_2002.shtml

137. AQUATIC OUTREACH INSTITUTE
    http://www.aoinstitute.org/

138. AQUIDNECK ISLAND LAND TRUST
    http://www.ailt.org/

139. ARBOR HILL ENVIRONMENTAL JUSTICE CORPORATION
    http://www.timesunion.com/communities/ahej

140. ARBORETUM FOUNDATION
    http://www.arboretum.org/

141. ARC ECOLOGY
    http://www.arcecology.org/

142. ARCATA COMMUNITY RECYCLING CENTER, INC.
    http://www.nrc-recycle.org/councils/NPRC/acrc.htm

143. ARDMORE MAIN STREET AUTHORITY
    http://www.ardmoremainstreet.com/

144. ARID LANDS PROJECT
    http://www.gis.uiuc.edu/mojave

145. ARIZONA ASSOC OF CONSERVATION DISTRICTS
    http://www.azod.com/Conservation%20news/Archive/2003/WCC/H2371.htm

146. ARIZONA CLEAN & BEAUTIFUL, INC.
    http://www.azclean.org/

147. ARIZONA COMMUNITY TREE COUNCIL
    http://aztrees.org/

148. ARIZONA NATURAL HISTORY ASSOCIATION, INC.
    http://www.appl.org/members-by-state.html

149. ARIZONA SOLAR VILLAGE CORPORATION dba TUCSON INSTITUTE FOR
SUSTATINABLE COMMUN
    [zero hits]

150. ARIZONA STRIP INTERPRETIVE
    http://www.az.blm.gov/asfo/asia/asia.htm

151. ARKANSAS ENVIRONMENTAL FEDERATION
    http://www.environmentark.org/

152. ARKANSAS FORESTRY ASSOCIATION EDUCATION FOUNDATION, INC.
    http://www.arkforests.org/forestry_links.html

153. ARKANSAS PUBLIC POLICY PANEL, INC.
    http://www.arpanel.org/educ/motionfriendofthecourtbrief.html

154. ARMAND BAYOU NATURE CENTER, INC.
    http://www.tripadvisor.com/Attraction_Review-g56003-d105914-Reviews-Armand_Bayou_Nature_Center-Houston_Texas.html
    
155. ARMANIAN AMERICAN MEDICAL SOCIETY OF CA INC
    http://www.radiologyresearch.org/hamids.htm
    
156. THE ARTS PROJECT OF CHERRY GROVE, NY INC.
    http://www.celticgrove.com/
    
157. ASBESTOS VICTIMS OF AMERICA
    http://www.house.gov/judiciary/10125.htm
    
158. ASIAN PACIFIC ENVIRONMENTAL NETWORK
    http://www.apen4ej.org/ 
    
159. ASPEN CENTER FOR ENVIRONMENTAL STUDIES
    http://www.aspennature.org/

160. ASPETUCK LAND TRUST, INC
    http://www.aspetucklandtrust.org/html/about.html

161. ASSOCATION FOR RESOURCE CONSERVATION
    http://www.sca-inc.org/

162. ASSOCIATION FOR BIODIVERSITY INFORMATION
    http://www.natureserve.org/index.jsp

163. ASSOCIATION FOR EFFICIENT ENVIRONMENTAL ENERGY SYSTEMS
    http://www.aeees.org/

164. ASSOCIATION FOR LABORATORY AUTOMATION
    http://labautomation.org/

165. ASSOCIATION FOR THE CORAL ENVIRONMENT, INC.
    http://www.eco.org/

166. ASSOCIATION FOR THE PROTECTION OF THE ADIRONDACKS, INC
    http://www.prfamerica.org/Stats-AssociationProtectionAdir.html

167. ASSOCIATION OF NEW JERSEY ENVIRONMENTAL COMMISSIONS
    http://www.anjec.org/

168. ASSOCIATION OF STATE & INTERSTATE WATER POLLUTION CONTROL ADMINISTRATORS
    http://www.stateforesters.org/

169. ASSOCIATION OF STATE DRINKING WATER ADMINISTRATORS
    http://www.asdwa.org/

170. ASSOCIATION OF U.S. DELEGATES TO THE GULF OF MAINE COUNCIL ON MARINE ENVIRON.
    http://members.aol.com/davelinc

171. ASSOCIATION OF VERMONT RECYCLERS
    http://www.vtrecyclers.org/

172. ASSOCIATION OF WETLAND MANAGERS INC
    http://www.aswm.org/

173. THE ASSOCIATION OF ZONE A&B HOMEOWNERS INC.
    http://www.freezone.org/

174. ASTICOU TERRACES TRUST
    http://www.maineolmsted.org/journal/resources/thuyalibrary.html

175. ATHENS HOCKING COUNTY RECYCLING CENTER INC
    http://home.frognet.net/~recycle

176. ATHENS LIMESTONE CLEAN COMMUNITY
    http://www.athens.edu/news/01082001.htm

177. THE ATLANTA BOTANICAL GARDEN, INC.
    http://www.atlantabotanicalgarden.org/

178. ATLANTIC STATES LEGAL FOUNDATION INC.
    http://www.aslf.org/

179. AU SABLE INSTITUTE OF ENVIRONMENTAL STUDIES
    http://www.ausable.org/

180. AUDUBON CANYON RANCH, INC.
    http://www.egret.org/

181. AUDUBON NATURALIST SOCIETY OF THE CENTRAL ATLANTIC STATES, INC.
    http://www.si.edu/archives/archives/findingaids/FARU7294.htm

182. Audubon Society of Greater Denver
    http://www.denveraudubon.org/

183. AUDUBON SOCIETY OF NEW HAMPSHIRE
    http://www.nhaudubon.org/

184. THE AUDUBON SOCIETY OF NEW YORK STATEINC
    http://dc.preferredjobs.com/altsearch/search.asp

185. AUDUBON SOCIETY OF OHIO
    http://www.audubon.org/chapter/oh

186. AUDUBON SOCIETY OF OMAHA
    http://audubon-omaha.org/

187. THE AUDUBON SOCIETY OF RHODE ISLAND
    http://www.asri.org/

188. AUDUBON SOCIETY OF THE EVERGLADES, INC.
    http://www.auduboneverglades.org/bylaws.htm

189. AUDUBON SOCIETY OF WESTERN PA. 
    http://www.aswp.org/

190. AUNTIE LITTER & U.S. INC. 
    http://www.auntielitter.org/

191. AURORA PROJECT, INC. 
    http://www.vandalia.org/aurora.shtm

192. AUSBON SARGENT LAND PRESERVATION TRUS
    [zero hits]

193. AUSTIN COMMUNITY GARDENS
    http://www.main.org/sacgarden
    
194. AUTONOMOUS UNDERSEA SYSTEMS INSTITUTE
    http://www.ausi.org/
    
195. AVIAN RESEARCH & CONSERVATION INSTITUTE, INC
    http://www.suttoncenter.org/
    
196. AWBURY ARBORETUM ASSOCIATION
    http://www.awbury.org/
    
197. BABB CREEK WATERSHED ASS. DVC
    [zero hits]
    
198. BADLANDS RC & D
    http://www.sdrcd.org/badlands.html
    
199. BALLARD FAMILY NATURE CENTER, INC.
    http://nt2.advant.com/kuocgi2/worc/toc.pl
    
200. BARNSTABLE LAND TRUST, INC.
    http://ray.cape.com/404.html
    
201. BARRIER BEACH PRESERVATION ASSN INC
    http://www.uwm.edu/Libraries/arch/FndIndst/Ser%20357_%20Dane.html
    
202. BARRIER ISLAND PARKS SOCIETY, INC.
    http://www.barrierislandparkssociety.org/
    
203. BARRIER ISLAND TRUST INC
    http://www.privatelandownernetwork.org/landtrustlist.htm
    
204. BASE CAMPS WILDERNESS INSTITUTE RV CONSUMER GROUP
    http://www.rv.org/whoarewe.htm 
    
205. BASS LIFE ASSOCIATES, INC.
    http://www.josseybass.com/
    
206. BATON ROUGE GREEN ASSOCIATION INC.
    http://www.batonrougerealtors.com/
    
207. BAY AREA ADVERTISING RELIEF COMMITTEE
    http://www.gregory.clow.com/infopages/adbaarc.htm
    
208. BAY AREA COMMUTER SERVICES, INC.
    http://www.tampabayrideshare.org/

209. BAY AREA ENVIRONMENTAL RESEARCH INSTITUTE
    http://www.baeri.org/

210. THE BAY INSTITUTE OF SAN FRANCISCO, INC.
    http://www.baychef.com/

211. BAY RIDGE PARKS & WATERFRONT COUNCIL
    http://www.ridgetrail.org/

212. BAYFRONT ACCESS & BEAUTIFICATION ORGANIZATION
    http://www.waterfrontvirtualtours.com/442palmct

213. BAYOU BEND GARDENS ENDOWMENT
    http://www.riveroaksgardenclub.org/CivicContribution.cfm

214. BAYOU CHICO ASSOCIATION, INC.
    http://www.wfrpc.dst.fl.us/barc/barc_weblinks.htm

215. BAYOU PRESERVATION ASSOCIATION, INC.
    http://www.cechouston.org/groups/bpa.html

216. BEACON HILL GARDEN CLUB, INC.
    http://www.worldcatlibraries.org/wcpa/ow/f6d52171bd230219.html

217. BEAUCHAMP TOWER INFO. SYSTEMS INC.
    http://www.irs.gov/irb/2004-16_IRB/ar16.html

218. BEAUFORT COUNTY OPEN LAND TRUST
    http://www.openlandtrust.com/

219. BEAUMONT PRODUCTS & SERVICES, INC.
    http://www.citrusmagic.com/

220. BEAVER CREEK WETLANDS ASSOCIATION, INC.
    http://yosemite.epa.gov/water/adopt.nsf/FTsearchForm?readform&Limit=200&Query=Field+AdoptSiteHUC+Contains+05090202

221. BEAVERKILL CONSERVANCY, INC.
    http://www.dec.state.ny.us/website/enb2001/20010131/not4.html

222. BECZAK ENVIRONMENTAL EDUCATION CENTER, INC.
    http://www.beczak.org/

223. BEDFORD AUDUBON SOCIETY
    http://www.bedfordaudubon.org/

224. BELGRADE REGIONAL CONSERVATION ALLIANCE
    http://www.belgradelakes.org/

225. BELOIT CONVENTION & VISITORS BUREAU
    http://www.visitbeloit.com/

226. BENJAMIN WEGERZYN HORTICULTURAL ASSOCIATION INC
    [zero hits]

227. BERGEN SWAMP PRESERVATION SOCIETY, INC.
    http://www.bergenswamp.org/membership-printable.htm

228. BERKELEY ECO HOUSE
    http://www.ecohouse.org/

229. BERKSHIRE GARDEN CENTER, INC. dba BERKSHIRE BOTANICAL GARDEN
    http://www.olin.wustl.edu/wcrc/cf/search.cfm

230. BERKSHIRE NATURAL RESOURCES COUNCIL, INC.
  http://www2.primushost.com/~mltc/demo/TEXT/NEARTRUST/landtrusts/berkshire.html

231. BERKSHIRE-PIONEER RESOURCE CONSERVATION & DEVELOPMENT AREA, INC.
    http://members.aol.com/berkpiorcd
    
232. BERMUDA CENTER 63-20, INC.
    http://facility-services.state.nc.us/nh.txt
    
233. THE BERRY BOTANIC GARDEN
    http://www.berrybot.org/
    
234. BETHLEHEM STEELWORKERS MEMORIAL COMMITTEE
    http://www.tnonline.com/archives/news/1999/03.30/business.html
    
235. BEXAR COUNTY MASTER GARDENERS, INC.
    http://www.hal-pc.org/~trobb/mastgar.html
    
236. BEYOND PESTICIDES: NCAMP
    http://www.beyondpesticides.org/
    
237. BHAILI HEALTH & EDUCATION FOUNDAT
    http://www.mahagujarat.com/../services/pincode/bpincode.htm
    
238. BIDDEFORD POOL LAND TRUST
    http://www.privatelandownernetwork.org/resource.asp?id=1139
    
239. BIG COUNTRY RC&D AREA, INC.
    http://www.cbcrcd.org/ 
    
240. BIG ISLAND RC&D
    http://www.bigislandrcandd.org/
    
241. BIG LAKE QUALITY WATER ASSOCIATION
    http://www.bigstarlake.org/waterquality.shtml
    
242. BIG SIOUX NURSERY, INC.
    http://sbcc.northern.edu/BJF2004EmployerList.htm
    
243. THE BIG SKY COMMUNITY CORPORATION
    http://www.bigskymt.org/3-19-04.html

244. BIG SUR LAND TRUST
    http://www.bigsurlandtrust.org/

245. THE BILLFISH FOUNDATION
    http://www.billfish.org/

246. BIO-INTEGRAL RESOURCE CENTER
    http://www.birc.org/

247. BIODIVERSITY INSTITUTE
    http://www.pacificbio.org/

248. BIODIVERSITY LEGAL FOUNDATION
    http://www.kscourts.org/ca10/cases/1998/06/97-1131.htm

249. THE BIODIVERSITY PROJECT, INC.
    http://www.biodiversityproject.org/

250. BIODYNAMIC FARMLAND CONSERVATION TRUST, INC.
    http://www.brookfieldfarm.org/history.html

251. BIRMINGHAM ENVIRONMENTAL CLEARINGHOUSE
    http://www.epa.gov/oppt/ejp2/al.htm

252. BITTER ROOT RESOURCE CONSERVATION & DEVELOPMENT AGENCY, INC.
    http://bitterrootrcd.org/ceds.html

253. BLACK DIAMOND RESOURCE CON & DEV INC.
    http://www.firstdiamondgroup.com/diamond

254. BLACK HILLS RESOURCE CONSERVATION & DEVELOPMENT
    http://www.wyormef.org/blackhillsconservation.html

255. BLACK MOUNTAIN FORESTRY CENTER
    http://www.blackmountainforestry.com/

256. BLACK POINT HISTORICAL PRESERVE, INC. MICHAEL ELLSWORTH & ASSOCIATES
    http://chpc.lib.uconn.edu/Surveydetailpage2.cfm

257. BLACK ROCK FOREST CONSORTIUM, INC.
    http://nysparks.state.ny.us/grants/award_zbga.htm

258. BLACK ROCK FOREST PRESERVE, INC.
    http://www.earthinstitute.columbia.edu/library/earthmatters/fall1997/earth_curriculum.html

259. THE BLACKFOOT CHALLENGE, INC.
    http://ims.geodata-mt.com/projects.htm

260. BLACKHAWK HILLS RESOURCE CONSERVATION & DEVELOPMENT AREA
    http://www.blackhawkhills.com/

261. BLACKLICK CREEK WATERSHED ASSOCIATION
    http://www.pawatersheds.org/WatershedDirectory/index.asp

262. BLACKLOCK NATURE SANCTUARY
    http://www.blacklock.org/

263. BLOCK ISLAND CONSERVANCY, INC.
    http://biconservancy.org/bicmembership.htm

264. BLOOM
    http://www.theorlandobloomfiles.com/

265. BLOOMFIELD CONSERVANCY, INC
    http://www.conedison.com/coned_NY/partnerships/environment.html

266. BLUE HILL HERITAGE TRUST
    http://www.mltn.org/trustdetails.asp?id=1192

267. BLUE HILLS ENVIRONMENTAL ASSOCIATION NON-PROFIT
    http://www.neponset.org/BiodiversityDays.htm

268. BLUE MOUNTAINS HABITAT RESTORATION COUNC
    http://www.npwrc.usgs.gov/resource/literatr/wetresto/wetresto.txt

269. BLUE PLANET FOUNDATION
    http://www.blueplanetfoundation.com/

270. BLUE RIDGE ENVIRONMENTAL DEFENSE LEAGUE, INC.
    http://www.nirs.org/mox/80%2C000galwastemox.htm

271. BLUE RIDGE PARKWAY FOUNDATION
    http://www.brpfoundation.org/

272. BLUEGRASS CONSERVANCY INC
    http://www.bluegrassconservancy.org/devdir.html

273. BLUEGRASS REGIONAL RECYCLING CORP.
    http://www.bigsandy.org/AddInfo/ExecutiveReview/Applicant_Search.asp

274. BOAT U.S. CLEAN WATER TRUST
    http://www.boatus.com/cleanwater

275. BONNEVILLE ENVIRONMENTAL FOUNDATION
    http://www.b-e-f.org/

276. BOOJUM INSTITUTE FOR EXPERIENTIAL EDUCATION
    http://www.boojum.org/

277. BOONE & CROCKETT CLUB
    http://www.bccn.boone.in.us/

278. BOOTHBAY REGION LAND TRUST, INC.
    http://bbrlt.org/

279. BORDER ECOLOGY PROJECT INC.
    http://www.borderecoweb.sdsu.edu/bew/drct_pgs/b/bep.html

280. BOSSIER CITY CLEAN CITY COMMITTEE
    http://www.bossiercity.org/dept/CCC/memorial_page.htm

281. BOSTON GREENSPACE ALLIANCE, INC.
    http://www.greenspacealliance.org/

282. BOSTON NATURAL AREAS FUND, INC.
    http://www.ziplink.net/users/gongora
    
283. BOSTON URBAN GARDENERS AT THE COMMUNITY FARM, INC.
    http://www.umass.edu/comec/charities/multiregion/envirofedne.html
    
284. BOTANIC GARDENS CONSERVATION INTERNATIONAL (US), INC.
    http://bgci.org/botanic_gardens/building_american_bgci_network.html
    
285. BOTANICA, INC.
    http://www.bio-botanica.com/
    
286. THE BOTANICAL & NATURE INSTITUTE OF SOUTH TEXAS, INC.
    http://www.mobot.org/
    
287. BOTANICAL SOCIETY OF AMERICA INC.
    http://www.botany.org/
    
288. BOULDER COUNTY NATURE ASSOCIATION
    http://www.bcna.org/
    
289. BOUNDARY WATERS WILDERNESS FOUNDATION
    http://www.friends-bwca.org/joinus/donate.html
    
290. BOURNE CONSERVATION TRUST
    http://www.capecodcommission.org/landbank/trusts.htm
    
291. BOWMAN'S HILL WILDFLOWER ASS'N, INC.
    [zero hits]
    
292. BRANDYWINE CONSERVANCY, INC
    http://www.brandywinemuseum.org/
    
293. BRANDYWINE VALLEY ASSOCIATION, INC.
    http://www.bvbb.com/
    
294. Branford Land Trust, Inc.
    http://www.branfordlandtrust.org/
    
295. BRASS CENTER LIMITED
    http://www.mailboxes.com/product.asp?Catalog_name=Mailboxes&Category_name=Free+Standing+Roll-a-Bouts&Product_id=295&Related_Product=&sCondition=CategoryName%3D'Free+Standing+Roll-a-Bouts'+And+DefinitionName%3D'Mailbox'&Previous=yes

296. BRATTLEBORO AREA COMMUNITY LAND TRUST, INC.
    http://www.baclt.org/projects.html

297. Brazos Beautiful, Inc.
    http://www.keepbrazosbeautiful.org/

298. BRIDGEWATER LAND TRUST, INC.
    http://www.lta.org/findlandtrust/MA.htm

299. BRING RECYCLING
    http://www.bringrecycling.org/

300. BRISTOL REGIONAL ENVIRONMENTAL CENTER
    http://www.appalachianforest.org/

301. BROAD RIVER WATERSHED ASSOCIATION
    http://home.earthlink.net/~broadriverwa/brwa.html

302. THE BROADMOOR GARDEN CLUB
    http://www.nwlagardener.org/broadmoor.html

303. BRONX RIVER RESTORATION PROJECT, INC.
    http://www.bronxriver.org/theRiver.cfm

304. BROOKLYN BOTANIC GARDEN CORPORATION
    http://www.bbg.org/sup/sample_wording.html

305. BROOKLYN CENTER FOR URBAN ENVIRONMENT
    http://www.bcue.org/

306. BRUKNER NATURE CENTER
    http://www.bruknernaturecenter.com/

307. BRUNSWICK-TOPSHAM LAND TRUST, INC.
    http://www.mltn.org/trusts.asp

308. BRYCE CANYON NATURAL HISTORY ASSOC.
    http://www.nps.gov/brca/nhordrfrm.htm

309. BTA/BOLT INC
    http://www.btabolt.org/equestrian/links.htm

310. BUCK HILL CONSERVATION FOUNDATION
    http://www.conserveland.org/landtrust/one?conservancy_id=4191

311. BUCKELEW COMMUNITY HSG DEV ORG
    http://www.politicsol.com/reports/2001_consumer_action_handbook.txt

312. BUCKEYE FOREST COUNCIL
    http://www.buckeyeforestcouncil.org/

313. BUCKS COUNTY AUDUBON SOCIETY
    http://www.bcas.org/

314. BUCKS COUNTY HORSE PARK
    http://www.buckscountyhorsepark.org/

315. BUENA VISTA AUDUBON SOCIETY
    http://www.bvaudubon.org/

316. THE BUFF FOUNDATION
    http://www.drugstore.com/product.asp?pid=56037

317. BUFFALO BAYOU PARTNERSHIP, INC.
    http://www.buffalobayou.org/

318. BUSINESS FOR THE ENVIRONMENT
    http://www.sustainablebusiness.com/

319. BUTTE DES MORTS CONSERVATION CLUB INC
    http://www.bdmcc.org/

320. BUTTERFLY HOPE
    http://www.butterflyhope.org/

321. CA. CITRUS STATE HISTORIC PARK
    http://www.parks.ca.gov/?page_id=649

322. CAHABA RIVER SOCIETY, INC.
    http://www.capitalresearch.org/search/orgdisplay.asp?org=CRS200

323. CALIFORNIA & OREGON FISH ENHANCEMENT INC
    http://www.ca.gov/
    
324. CALIFORNIA ARBORETUM FOUNDATION, INC.
    http://www.treeware.com/alpha.html
    
325. CALIFORNIA ARTIFICIAL REEF ENHANCEMENT
    http://www.calreefs.org/
    
326. CALIFORNIA ASSOCIATION OF RESOURCE CONSERVATION DISTRICTS
    http://www.carcd.org/
    
327. CALIFORNIA AUDUBON SOCIETY
    http://www.audubon-ca.org/
    
328. CALIFORNIA BONSAI SOCIETY
    http://www.california-bonsai-society.org/
    
329. CALIFORNIA ENVIRONMENTAL PROJECT
    http://ceres.ca.gov/
    
330. CALIFORNIA ENVIRONMENTAL TRUST
    http://www.rangelandtrust.org/ 
    
331. CALIFORNIA EXOTIC PEST PLANT COUNCIL
    http://www.caleppc.org/
    
332. CALIFORNIA FOUNDATION ON THE ENVIRONMENT & THE ECONOMY
    http://www.surfrider.org/
    
333. CALIFORNIA NATIVE PLANT SOCIETY (CHAPTERS)
    http://www.cnps.org/
    
334. CALIFORNIA OAK FOUNDATION
    http://www.californiaoaks.org/
    
335. CALIFORNIA RANGELAND TRUST
    http://www.rangelandtrust.org/
    
336. THE CALIFORNIA RESIDENCE FOUNDATION
    http://www.johnlautner.org/biblio.html
    
337. CALIFORNIA RESOURCE RECOVERY ASSOCIATION
    http://www.crra.com/
    
338. CALIFORNIA WATER ENVIRONMENT ASSOCIATION
    http://www.cwea.org/ 
    
339. CALIFORNIA WILDERNESS COALITION
    http://www.calwild.org/
    
340. CALIFORNIANS FOR ALTERNATIVES TO TOXIC SPRAYS
    http://my.execpc.com/~mjstouff/articles/horsefly.html
    
341. CALUSA LAND TRUST & NATURE PRES OF PINE ISLAND INC
    http://www.calusalandtrust.org/
    
342. CAMDEN CITY GARDEN CLUB, INC.
    http://www.camdenchildrensgarden.org/mission.html
    
343. CAMP CHAUTAQUA FOUNDATION, INC.
    http://www.newpaltz.edu/careers/search.cfm
    
344. CAMPAIGN FOR A PROSPEROUS GEORGIA, INC.
    http://www.law.emory.edu/11circuit/aug98/96-8655.man.html
    
345. CAMPAIGN RECYCLE MAUI
    http://www.co.maui.hi.us/departments/Public/recycle.htm
    
346. CANAAN VALLEY INSTITUTE INC
    http://www.canaanvi.org/

347. CANAL CORRIDOR ASSOCIATION
    http://www.canalcor.org/

348. CANNON RIVER WATERSHED PARTNERSHIP
    http://www.crwp.net/

349. CANYON AREA RESIDENTS FOR THE ENVIRONMENT
    http://www.c-a-r-e.org/

350. CANYONLANDS NATURAL HISTORY ASSOCIATION
    http://www.cnha.org/

351. CAPE & ISLANDS SELF RELIANCE, CORP
    http://www.capecod.com/

352. CAPE FEAR BOTANICAL GARDEN
    http://fayettevilleonline.com/botanicalgarden

353. CAPITOL CLEAN CITIES OF CONNECTICUT
    http://www.nationalcleancities.org/ChapterProgs/chapters.asp

354. CAPITOL REEF NATURAL HISTORY ASSOCIATION
    http://www.nps.gov/care/nha.htm

355. CAREGIVERS OF CENTRAL OCEAN COUNTY, INC.
    http://www.unitedwayofocean.com/certifiedagencylisting.htm

356. CARLSBAD HORTICULTURE SOCIETY
    http://www.emnrd.state.nm.us/nmparks/PAGES/CONCESS/ldzgshop.htm

357. CARMEL RIVER STEELHEAD ASSOCIATION
    http://www.carmelriverwatershed.org/crsa.html

358. CARNIVORE PRESERVATION, INC.
    http://www.cptigers.org/

359. CAROLINA RECYCLING ASSOCIATION
    http://www.cra-recycle.org/

360. CAROLINAS AIR POLLUTION CONTROL ASSOCIATION
    http://www.capca-carolinas.org/

361. CAROLINE DORMAN NATURE PRESERVE TRUST
    http://www.explorenatchitoches.com/outdoors.php?task=view&articleID=87

362. CARTH
    http://www.starwars.com/databank/character/carthonasi

363. CASCADE LAND CONSERVANCY
    http://www.cascadeland.org/

364. CASE-MIDDLE ATLANTIC DISTRICT II
    http://www.martsandlundy.com/site/pp.asp?c=9qLMKYNKE&b=6616

365. CATALINA SEA BASS FUND, INC.
    http://www.erieri.com/erimodule/waw_1.cfm?firstletter=C

366. CATAWBA LANDS CONSERVANCY
    http://www.catawbalands.org/

367. CATAWBA SCIENCE CENTER, INC.
    http://www.tryscience.net/TrySciRegA.nsf/4df89828cdcdebe58525687a006f415e

368. CATSKILL CTR. FOR CONSERVATION & DEVELOP
    http://www.norcrossws.org/Grants/grant20.htm

369. THE CAUMSETT FOUNDATION, INC.
    http://www.farrellfritz.com/community/community_set.html
    
370. CAVE CONSERVANCY FOUNDATION
    http://members.aol.com/cavecfinc
    
371. CAVE CONSERVANCY OF THE VIRGINIAS
    http://members.aol.com/caveconser
    
372. CAYUGA NATURE CENTER, INC.
    http://www.cayuganaturecenter.org/
    
373. CAZENOVIA PRESERVATION FOUNDATION
    http://www.privatelandownernetwork.org/landtrustlist.htm
    
374. CECIL LAND TRUST, INC.
    http://www.lta.org/findlandtrust/MD.htm
    
375. CEDAR LAKE PARK ASSOCIATION
    http://www.cedarlakepark.org/
    
376. CEDAR LAKES CONSERVATION FOUNDATION
    http://www.hnet.net/good/clcf
    
377. CEDAR RUN CONSERVANCY INC.
    http://www.heartlandcrossing.com/home.html
    
378. CENTENNIAL LAND TRUST
    http://coloradopartners.fws.gov/co5.htm
    
379. CENTER FOR A SUSTAINALBE COAST
    http://www.marlinnut.com/dcforum/DCForumID2/64.html
    
380. CENTER FOR AGRICULTURAL PARTNERSHIPS, INC.
    http://www.agcenter.org/
    
381. CENTER FOR ALASKAN COASTAL STUDIES
    http://www.akcoastalstudies.org/
    
382. CENTER FOR BIOLOGICAL DIVERSITY, INC.
    http://www.undueinfluence.com/southwes1.htm
    
383. CENTER FOR CHESAPEAKE COMMUNITIES
    http://www.chesapeakecommunities.org/
    
384. CENTER FOR CLEAN AIR POLICY
    http://www.ccap.org/
    
385. CENTER FOR ECOLOGICAL TECHNOLOGY, INC.
    http://www.cetonline.org/
    
386. CENTER FOR ECOSYSTEM SURVIVAL
    http://www.savenature.org/
    
387. CENTER FOR ENERGY & ENVIRONMENT
    http://www.crest.org/ 
    
388. CENTER FOR ENVIRONMENTAL EDUCATION
    http://www.epa.gov/teachers
    
389. CENTER FOR ENVIRONMENTAL HEALTH
    http://www.cdc.gov/nceh
    
390. CENTER FOR ENVIRONMENTAL INFORMATION, INC
    http://www.rochesterenvironment.org/
    
391. CENTER FOR ENVIRONMENTAL LAW & POLICY
    http://www.ciel.org/
    
392. CENTER FOR ENVIRONMENTAL POLICY, ECONOMICS & SCIENCE
    http://ceep.udel.edu/ceep.html
    
393. CENTER FOR FIELD RESEARCH, INC
    http://www.earthwatch.org/

394. CENTER FOR NEIGHBORHOOD TECHNOLOGY
    http://www.cnt.org/

395. CENTER FOR PROFESSIONAL STUDIES
    http://www.drake.edu/cps

396. CENTER FOR RENEWABLE ENERGY & SUSTAINABLE TECHNOLOGY
    http://www.crest.org/

397. CENTER FOR RESOURCE SOLUTIONS
    http://www.resource-solutions.org/

398. CENTER FOR SCIENCE IN PUBLIC PARTICIPATION
    http://www.csp2.org/

399. THE CENTER FOR THE RESPECT OF LIFE & ENVIRONMENT
    http://www.crle.org/

400. CENTER FOR THE SUPPORT OF NATIVE LANDS
    http://www.nativelands.org/

401. CENTER FOR UNDERSTANDING BUILT ENVIRONMENT
    http://www.cubekc.org/

402. THE CENTER FOR WATERSHED & COMMUNITY HEALTH
    http://www.cwp.org/

403. CENTERS FOR NATURE EDUCATION, INC.
    http://www.takeahike.org/

404. CENTRAL ARK RES, CONV & DEV DIST
    http://www.usgennet.org/usa/ne/county/gage/books/whoswho/whowhog2.htm

405. CENTRAL FLORIDA RESOURCE & DEVELOPMENT COUNCIL, INC.
    http://wwcol.com/con/cfrp.html

406. CENTRAL MD HERITAGE LEAGUE, INC.
    http://www.cmhl.org/gifts.html

407. CENTRAL OREGON ENVIRONMENTAL CENTER, INC
    http://www.idealist.org/

408. CENTRAL PENNSYLVANIA CONSERVANCY, INC.
    http://www.clearwaterconservancy.org/

409. CENTRAL ROCKY MOUNTAIN PERMACULTURE
    http://www.crmpi.org/

410. CENTRAL SAVANNAH RIVER RESEARCH, CONSERVATION, & DEVELOPMENT AREA INC.
    http://www.uga.edu/srel/outreach.htm

411. CENTRAL STATES AIR RESOURCE AGENCIES ASSOCIATION
    http://www.censara.org/

412. CENTRAL STATES WATER ENVIRONMENT ASSOCIATION, INCORPORATED
    http://www.cswea.org/awards

413. CENTRE FOR SCIENCE & ENVIRONMENT
    http://www.atm.ch.cam.ac.uk/

414. CHAGRIN RIVER LAND CONSERVANCY
    http://www.crlc.cc/

415. CHAGRIN RIVER WATERSHED PTRS, INC.
    http://www.ma.utexas.edu/~katerman/c/perl/practice/exercises/final/output

416. CHANNEL ISLANDS MARINE RESOURCE INSTITUTE
    http://www.graysreef.nos.noaa.gov/
    
417. CHANNEL ISLANDS MARINE SANCTUARY FOUNDATION
    http://www.cinms.nos.noaa.gov/
    
418. CHARITON RIVER GREENBELT FOUNDATION
    http://www.inhf.org/traillinks.htm
    
419. CHARLES DARWIN FOUNDATION, INC.
    http://www.galapagos.org/
    
420. CHARLES RIVER CONSERVANCY, INC.
    http://www.charlesriverconservancy.org/projects/Skatepark
    
421. CHARLES RIVER MUSEUM OF INDUSTRY, INC.
    http://www.artisanind.com/hotlinks.htm
    
422. CHARLES RIVER WATERSHED ASSOCIATION, INC
    http://www.merrimack.org/vemn/links.htm
    
423. CHARLIE RUSSELL RIDERS FOUNDATION INC.
    http://www.sdfoundation.org/news/grantssept2001.shtml
    
424. CHARLOTTE HARBOR ENVIRONMENTAL CENTER, INC.
    http://www.checflorida.org/
    
425. CHARLOTTE LAND TRUST INC
    http://www.privatelandownernetwork.org/landtrustlist.htm
    
426. CHARLTON HERITAGE PRESERVATION TRUST
    http://www.charltontrust.org/
    
427. CHARTIERS NATURE CONSERVANCY, INC.
    http://www.privatelandownernetwork.org/landtrustlist.htm
    
428. CHATHAM CONSERVATION FOUNDATION, INC.
    http://www.socialaw.com/appslip/appNov02ff.html
    
429. CHATTAHOOCHEE NATURE CENTER INC
    http://www.digitalcity.com/atlanta/entertainment/venue.adp?sbid=103498719
    
430. CHATTAHOOCHEE OCONEE HERITAGE ASS., INC.
    http://home.earthlink.net/~mjohnsen/Environment/Enviro_Law/Sierra_martin.html

431. CHATTANOOGA AUDUBON SOCIETY
    http://www.audubonchattanooga.org/

432. CHATTOWAH OPEN LAND TRUST, INC.
    http://www.privatelandownernetwork.org/landtrustlist.htm

433. CHAUT. WATERSHED CONSERVANCY, INC.
    http://www.co.chautauqua.ny.us/legis/AGENDA/LEG-03/MONTHLY%20MEETING%20AGENDA%20AND%20MINUTES%202003/MAY%2003/ALEG052803.htm

434. CHAUTAUQUA LAKE ASSOCIATION, INC.
    http://www.lakesideohio.com/

435. CHEEKWOOD-TENNESSEE BOTANICAL GARDENS & MUSEUM OF ART
    http://www.cheekwood.org/

436. CHELSEA GARDENS FOUNDATION, INC
    http://www.history.org/Almanack/life/garden/gardnbib.cfm

437. CHENEQUA LAND CONSERVANCY, INC.
    http://www.wisbar.org/res/capp/2004/03-2486.htm

438. CHEROKEE HILLS RC&D PROJECT, INC.
    http://www.okcc.state.ok.us/Calendar/calendar.htm

439. CHESAPEAKE APPRECIATION, INC.
    http://www.chesapeake-energy.com/

440. CHESAPEAKE AUDUBON SOCIETY, INC.
    http://www.oceanlegacy.org/campaign_map/maryland.html

441. CHESAPEAKE BAY FOUNDATION
    http://www.cbf.org/

442. CHESAPEAKE BAY TRUST
    http://www.chesapeakebaytrust.org/

443. CHESAPEAKE HERITAGE CONSERVANCY, INC.
    http://www.skipjackmarthalewis.org/

444. CHESAPEAKE RESEARCH CONSORTIUM, INC.
    http://www.chesapeake.org/crc/crc.html

445. CHEYENNE COMMONS ALLIANCE
    http://www.bravodesignalliance.com/aboutusa3.htm

446. CHICAGO HORTICULTURAL SOCIETY
    http://www.chicago-botanic.org/

447. CHICAGO INSTITUTE FOR THE STUDY OF ARCHITECTURE & CONST TECH INC
    http://www.artic.edu/aic/collections/arch/index_pc.html

448. CHICAGO'S ENVIRONMENTAL FUND
    http://www.ert.net/release_6_6_2001.html

449. CHICAGOLAND REDEVELOPMENT INSTITUTE
    http://chicagoredi.org/

450. CHILDHOOD LEAD ACTION PROJECT
    http://www.shac-ri.org/clap.htm

451. CHILDREN AT RISK TODAY
    http://www.worldcatlibraries.org/wcpa/ow/6825a834ee183533.html

452. CHILDREN'S CASE MANAGEMENT ORG., INC.
    http://www.adoptionagencies.org/

453. CHILDREN'S ENVIRONMENTAL TRUST FOUNDATION, INTERNATIONAL
    http://www.cetfoundation.org/

454. CHIPPEWA NATURE CENTER INC
    http://www.naturesedgetherapycenter.org/sponsors.htm

455. CHOOSE ENVIRONMENTAL EXCELLENCE-GATEWAY REGION
    http://www.ceegr.org/

456. CHRISTIAN HEALTH MINISTRIES FOUNDATION
    http://www.chmf.org/

457. CHULA VISTA BAYFRONT CONSERVANCY TRUST
    http://www.signonsandiego.com/news/metro/20020210-9999_1m10cvnature.html

458. CHURCHVILLE NATURE CENTER ADVISORY COMMITTEE INC
    http://www.spiritway.com/SpiritDirectory/sdstate/Penn/pa.html

459. THE CINCINNATI HORTICULTURAL SOCIETY
    http://www.cincyflowershow.com/

460. CINCINNATI NATURE CENTER
    http://www.cincynature.org/

461. CITIZENS AGAINST OPEN BAY DUMPING, INC.
    http://www.marylandconservationcouncil.net/v26n09.htm

462. CITIZENS CAMPAIGN FUND FOR THE ENVIRONMENT, INC
    http://www.prfamerica.org/Stats-CitizensCampaign4Environment.html
    
463. CITIZENS COMM-PRESBY MEMORIAL
    http://www.medicaljobstoday.com/directory/display.html?state=mi
    
464. CITIZENS ENVIRONMENT COALITION
    http://www.cechouston.org/
    
465. CITIZENS FOR A BETTER ENVIRONMENT
    http://www.cbemw.org/
    
466. CITIZENS FOR A BETTER SOUTH FLORIDA, INC.
    http://www.scienceworkshops.org/site/miami
    
467. CITIZENS FOR A HEALTHY BAY
    http://www.healthybay.org/
    
468. CITIZENS FOR A QUIET ENVIRONMENT
    http://www.cqe.homestead.com/
    
469. CITIZENS FOR A RATIONAL ENERGY POLICY
    http://www.americanpolicy.org/petition-energy-1.htm
    
470. CITIZENS FOR CONSERVATION INC
    http://www.clevelandclinic.org/
    
471. CITIZENS FOR GORGE DISCOVERY CENTER
    http://www.columbiarivercruise.com/html/Columbia_River/rivers/Attractions_Dalles.html

472. CITIZENS FOR PENNSYLVANIA'S FUTURE
    http://www.pennfuture.org/

473. CITIZENS TO PRESERVE BLACK HAWK PARK
    http://www.risd41.org/bhshs/events.htm

474. CITIZENS' ENVIRONMENTAL COALITION, INC.
    http://www.citact.org/

475. CITIZENS' LAND CONSERVANCY OF HAMILTON COUNTY, INC.
    http://www.landconservancyhc.org/

476. CITY OF SAN FRANCISCO JAPAN CENTER GARAGE CORPORATION
    http://www.basoc2012.org/board_mori_j.html

477. CIVIC GARDEN CENTER OF GREATER CINCINNATI
    http://www.civicgardencenter.org/

478. CLARK FORK COALITION 
    http://www.clarkfork.org/

479. CLASSICAL CHINESE GARDEN TRUST
    http://www.schommer-sons.com/projects/garden.htm

480. CLEAN AIR CAMPAIGN OF THE PIKES PEAK REGION
    http://www.clnair.org/

481. THE CLEAN AIR CONSERVANCY 
    http://www.cleanairconservancy.com/links.html

482. CLEAN AIR COUNCIL
    http://www.cleanair.org/

483. CLEAN AIR FORCE OF CENTRAL TEXAS
    http://www.cleanairforce.org/

484. CLEAN AIR NOW
    http://www.epa.gov/airnow

485. CLEAN AIR TRUST EDUCATION FUND 
    http://www.cleanairtrust.org/

486. CLEAN AIR, COOL PLANET, INC.
    http://www.rainbowsystem.com/

487. CLEAN AIRPORT PARTNERSHIP, INC.
    http://www.cleanairports.com/

488. CLEAN ISLANDS INTERNATIONAL INC
    http://www.islands.org/cii/ciipage.htm

489. CLEAN OCEAN ACTION INC.
    http://www.okinawaocean.org/

490. CLEAN VALLEY COUNCIL, INC.
    http://www.cleanvalley.org/

491. CLEAN WATER FUND
    http://www.cleanwaterfund.org/

492. CLEAN WATER FUND OF NORTH CAROLINA, INC.
    http://www.cwfnc.org/

493. CLEARFIELD COUNTY RAILS TO TRAILS
    http://www.clearfieldco.org/Information/Recreation/recreation.html

494. THE CLEARWATER CONSERVANCY OF CENTRAL PENNSYLVANIA, INC.
    http://www.clearwaterconservancy.org/

495. CLEVELAND BOTANICAL GARDEN
    http://www.cbgarden.org/

496. CLEVELAND NATIONAL FOREST FOUNDATION
    http://www.nonprofits2000.net/cnff

497. CLEVELAND RECYCLING CENTER IN ST. CLAIR-SUPERIOR, INC
    http://www.nhlink.net/neighborhooddirectory/a.htm

498. CLF SERVICES, INC.
    http://www.clfventures.org/

499. CLIMATE NEUTRAL NETWORK
    http://www.climateneutral.com/

500. THE CLIMATE TRUST (FORMERLY OREGON CLIMATE TRUST)
    http://oregonfuture.oregonstate.edu/part3/pf3_02.html

Clarification of Question by scottybaja-ga on 30 Dec 2004 06:59 PST
This looks pretty darn good for our purposes. Thank you! I think we
would like to give it a go with your Python script with usage
instructions. Do we point the script to the database? or paste the
data somewhere? For your instructions, please remember that we are
novices technically. Thanks! If allowed, feel free to send directly to
scott@emagazine.com.

Request for Question Clarification by leapinglizard-ga on 30 Dec 2004 13:06 PST
I'm afraid Researchers are not permitted to correspond with Askers
beyond the confines of this website. I will only deliver the script by
posting it as an Answer to your Question.

As it stands, the script takes a single argument, to wit, the name of
an Excel file in the same six-column format as the one you posted. If
you like, I can change the expected format of the input file to a
single-column Excel file instead, or to a plaintext file containing
one company name per line.

One more thing: in order to run the script, you will need a recent
version of the Python interpreter. This is a free download. If you are
running Windows or Mac OS X 10.2, an installation wizard is available.

python.org: Download Standard Python Software
http://www.python.org/download/

python.org: Download Standard Python Software: Python 2.4 Windows installer
http://www.python.org/ftp/python/2.4/python-2.4.msi

python.org: Download Standard Python Software: Python 2.3 OS X 10.2 installer
http://ftp.cwi.nl/jack/python/mac/MacPython-OSX-2.3-1.dmg

Once you have installed Python, test it by: making sure you can access
python.exe from the command line (also known as the DOS shell);
copying the following text verbatim --

import sys, string
if len(sys.argv) > 1:
    print 'you said, "%s"' % string.join(sys.argv[1:], ' ')

-- into a file named test.py; and executing the instruction

  python.exe test.py hello there

on the command line. If it works, you'll be able to run my script.

Let me know if all this is agreeable to you.

leapinglizard

Request for Question Clarification by leapinglizard-ga on 03 Jan 2005 08:30 PST
Have you managed to install the Python interpreter?

leapinglizard

Clarification of Question by scottybaja-ga on 03 Jan 2005 10:48 PST
I have installed Python and have both the command line (black box) and
the Python Shell open. I saved the text verbatum in a note pad file
giving it the name test.py in the Python folder. I typed python.py
hello there in to the command line and received a sytax error. I'm
sure I am doing something wrong. Thanks for any clarification.

Request for Question Clarification by leapinglizard-ga on 03 Jan 2005 11:04 PST
I typed python.py hello there in to the command line and
    received a sytax error.

Well, that's not the right command. Try the following.

1. Navigate to the directory containing the Python executable -- I
believe it's called python.exe under Windows.

2. Execute the following command, typing only what lies between the
caret (">") and the end of the line. Output should be similar to what
follows.

> python.exe
Python 2.4 (#1, Jan  1 2005, 03:38:26) 
[GCC 3.2.2 20030222 (Red Hat Linux 3.2.2-5)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>>

3. Type Ctrl-D to leave the Python interpreter.

4. Copy the text I gave above into a file called test.py, using a
simple text editor such as Notepad. Do not use a full word processor
such as Word.

5. Execute the following command.

> python test.py hello there
you said, "hello there"

If all that worked, you're ready to run my script. I'll post it only
once we've made sure that everything's fine at your end.

leapinglizard

Clarification of Question by scottybaja-ga on 03 Jan 2005 12:35 PST
Sorry LL. I can open the command line double-clicking on python.exe
(from Windows Explorer) which displays text similar to:Python 2.4 (#1,
Jan  1 2005, 03:38:26)
[GCC 3.2.2 20030222 (Red Hat Linux 3.2.2-5)] on linux2
Type "help", "copyright", "credits" or "license" for more information.

but when I type python.exe I still get an error message that says:
traceback <most recent call last>:file "<stdin>", line1, in ?
Nameerror: name 'python' is not defined

Request for Question Clarification by leapinglizard-ga on 03 Jan 2005 12:58 PST
Okay, so the Python interpreter works when you launch it by
double-clicking on the desktop icon. But that's not what I'm asking
you to do.

You should carry out all of the above instructions from the DOS shell,
also known as the command prompt. It's somewhere in the Start menu,
usually at Start->Programs->Accessories->Command Prompt. When you
launch it, it's just a black box with a command line. Try running
python.exe from the command line as soon as it starts up-- if this
doesn't work, then you'll have to navigate to the directory containing
the python.exe file using the "cd" command.

Once you can run python.exe from the command line, you'll be able to
follow the rest of my instructions.

If that still doesn't work for you, let me suggest an alternate method
of running a script.

1. Make a new directory. Figure out what its full pathname is. I don't
know what it's going to be on your machine, but let's call it ABC for
the time being.

2. In this directory, use a simple text editor to make a file called
test2.py that contains exactly the following.

for i in range(20):
    print i,
print

3. Launch the Python interpreter by double-clicking on its icon, then
execute the following inside the interpreter.

> import os; os.chdir('ABC')
> import test2

Of course, you'll have to replace ABC with the full pathname of the
directory containing the test2.py file.

The result should be a sequence of numbers. Once you've accomplished
that, you'll be able to run my script.

leapinglizard

Clarification of Question by scottybaja-ga on 04 Jan 2005 04:57 PST
Jumpin Leaping Lizards! It Worked! I got the response -- you said, "hello there"

I was thinking about something you asked in an earlier response: I
would like to keep the data together with numerous fields (similar to
what I sent you). Is it possible to have the new found URLs listed on
the same line to be able to add into the Excel database? (Not even
sure this is necessary. Maybe I should see how it works first.)

Thanks so much!

Scott
Answer  
Subject: Re: URL search from list
Answered By: leapinglizard-ga on 05 Jan 2005 02:47 PST
 
Dear scottybaja,

You'll find the goods below. Copy all of the code in one shot and
paste it into a text file named "scraper.py". Make sure you don't
perturb the indentation of the code, and remember to use a simple text
editor such as Notepad rather than a full word processor such as Word.

Once you've done that, go to a command prompt and navigate to a
directory where you can access all of the following: the Python
interpreter (python.exe), the scraper script (scraper.py), and the
Excel file containing your data in the same format as the one you
posted above.

Now you can execute the command "python.exe scraper.py" followed by
one or two arguments. If you pass a single argument, as in

  python.exe scraper.py envirounique2.csv

then the script will only output its guesses into the window
containing the command prompt. But if you pass two arguments, as in

  python.exe scraper.py envirounique2.csv enviro_results.csv

then your final argument is used as the name of an Excel file into
which the script outputs the original file plus an extra column
containing its guesses.

If you have any trouble with this script, please advise me through a
Clarification Request so that I have the opportunity to fully meet
your needs before you rate my answer.

Furthermore, as I stated above, I am willing to provide code
maintenance -- but not significant new features -- for the next month
or so. In particular, if the search engine changes its markup so that
my script crashes or starts emitting gibberish, I will update it as a
Clarification to this question.

Regards,

leapinglizard



#=======begin scraper.py

import sys, os, string, re
import csv
import urllib

if len(sys.argv) not in [2, 3]:
    sys.exit('usage: scraper.py <in_file> [<out_file>]')
in_fname = sys.argv[1]
out_fname = None
if len(sys.argv) == 3:
    out_fname = sys.argv[2]
if not os.path.isfile(in_fname):
    sys.exit('"%s" is not a file' % in_fname)

try:
    reader = csv.reader(open(in_fname, 'rb'), dialect='excel')
    header = reader.next() + ['Suspected URL']
    if out_fname:
        outf = open(out_fname, 'w')
        writer = csv.writer(outf, dialect='excel')
        writer.writerow(header)

    line_num = 0
    for line_list in reader:
        line_num += 1
        name = line_list[0]
        print '%d. %s' % (line_num, name)

        query = string.join(name.split(), '+').upper()
        postfix = '&ei=UTF-8&fr=sfp&fl=0&x=wrt'
        url = 'http://search.yahoo.com/search?p=%s%s' % (query, postfix)
        text = urllib.urlopen(url).read()
        groups = re.findall('3A\/\/(.*)"\s*target=_blank', text)
        guess = '[zero hits]'
        if len(groups) > 0:
            guess = groups[0]
            if re.match('.*yahoo', guess):
                groups = re.findall('id="http://([^"]+)"', text)
                if len(groups) > 0:
                    guess = groups[0]
            guess = 'http://%s' % guess
        print '    %s\n' % guess
        if out_fname:
            writer.writerow(line_list + [guess])

    if out_fname:
        outf.close()

except csv.Error:
    print 'error reading from csv file "%s"' % in_fname

#=======end scraper.py

Request for Answer Clarification by scottybaja-ga on 06 Jan 2005 14:34 PST
LL:

I have run two tests. The first stopped at record 499 with a socket
error on the command line (using two arguments). I stopped the second
after "no hits" was the constant message being recorded for every
record regardless of the fact that the record actually had a URL.
(This one did record over 500 URLs) A third attempt to run a single
argument to the same file resulted in all "no hits."

Do you think we are doing something wrong?

Thanks.

Scott

Clarification of Answer by leapinglizard-ga on 06 Jan 2005 17:13 PST
Oops, sorry! There's a step I took very early in the process that I
forgot to mention. My script can't read the .xls format directly, so I
first had to convert your file to .csv format.

You should be able to do this very easily in Excel. Open the .xls
file, then select "Save As" from the File menu. In the "File name"
box, change the extension of the file name from ".xls" to ".csv", and
Excel should automatically change the file format once you click Save.
To check whether the file is indeed in .csv format, open the file in
Notepad. The first few lines should look like this.

"Company Name","Mailing Address 1","Mailing City","Mailing
State","Mailing Zip","Phone Number"
"10,000 FRIENDS OF PENNSYLVANIA","117 South 17th Street
2300","Philadelphia","PA","19103","(215) 568-2225"
"1000 FRIENDS OF IOWA","104 SW Fourth Street","Des
Moines","IA","50309","(515) 288-5364"
"1000 FRIENDS OF MINNESOTA, INC.","370 Selby Avenue 300","St.
Paul","MN","55102","(651) 312-1000"
"1000 FRIENDS OF NEW MEXICO, INC.","1001 Marquette NW","Albuquerque","NM","87102",

If you see a bunch of odd characters instead, it's still in .xls
format because Excel wasn't configured to automatically change the
file format. In this case, repeat the "Save As" procedure in Excel,
except now when you're in the dialog box, manually select as "File
type" the "Text CSV" format. This really should force the conversion.

I apologize for the omission. Let me know if you can run the script
now. It still works on my machine.

leapinglizard

Request for Answer Clarification by scottybaja-ga on 07 Jan 2005 12:37 PST
LL,

It's working but only through about 600 when it starts to record "no
hits" on good names.

I cannot get the "xxx", "xxx", format to appear in Notepad even after
saving as a CSV or txt file. What appears in Notepad looks like tab
delimited with no quotes.

It's perfect except it needs to last longer.

Thanks.

Scott

Thanks.

Request for Answer Clarification by scottybaja-ga on 07 Jan 2005 12:46 PST
Hold the horses. I just finished writing you and went back to the DOS
box and it was running correctly again but the numbers started at
about 2400. Then after a few minutes it was back to the "no hits"
message and it seems to be searching much too quickly during the no
hits. Thanks.

Clarification of Answer by leapinglizard-ga on 07 Jan 2005 13:09 PST
Could you do me a favor and post the .csv file you're using when you
get these unwelcome results? This will help me diagnose the problem.

leapinglizard

Clarification of Answer by leapinglizard-ga on 07 Jan 2005 13:21 PST
Actually, I see what the problem is. The search engine cuts you off
temporarily after about 500 consecutive requests. I hadn't seen this
behavior during my earlier testing, so they must have implemented it
recently.

I'm going to do two things. First, I'll slow down the script so that
it pauses for one second between requests. This will make it slower by
an order of magnitude, but maybe the search engine will like it
better. Second, I'll rewrite the script to make it work with one or
two other search engines to see whether those are more accommodating.

leapinglizard

Clarification of Answer by leapinglizard-ga on 07 Jan 2005 15:39 PST
Below is a new version of the script. You can call it whatever you
like on your machine, although I've named it scraperA.py to
distinguish it from the original. This one uses a different web server
to get the same results. It also pauses for about a tenth of a second
between consecutive queries, which makes it considerably slower but
also a less voracious consumer of the search engine's resources. On my
machine, this script took 34 minutes to guess URLs for all 3129
company names in the file you posted above, which comes out to about
90 URLs per minute. The results of this run are in the following .csv
file.

http://plg.uwaterloo.ca/~mlaszlo/answers/envirounique2.results.csv

leapinglizard



#=======begin scraperA.py

import sys, os, string, re
import csv
import urllib
import time
import random

if len(sys.argv) not in [2, 3]:
    sys.exit('usage: scraperA.py <in_file> [<out_file>]')
in_fname = sys.argv[1]
out_fname = None
if len(sys.argv) == 3:
    out_fname = sys.argv[2]
if not os.path.isfile(in_fname):
    sys.exit('"%s" is not a file' % in_fname)

try:
    reader = csv.reader(open(in_fname, 'rb'), dialect='excel')
    header = reader.next() + ['Suspected URL']
    if out_fname:
        outf = open(out_fname, 'w')
        writer = csv.writer(outf, dialect='excel')
        writer.writerow(header)

    line_num = 0
    for line_list in reader:
        line_num += 1
        name = line_list[0]
        print '%d. %s' % (line_num, name)

        query = string.join(name.split(), '+').upper()
        prefix = 'http://www.altavista.com/web/results?itag=wrx&q='
        url = '%s%s&kgs=0&kls=0' % (prefix, query)
        while 1:
            try:
                time.sleep(random.randrange(1,21)*0.01)
                text = urllib.urlopen(url).read()
            except IOError:
                continue
            break
        groups = re.findall("<a class='res' href='([^']+)'>", text)
        guess = '[zero hits]'
        if len(groups) > 0:
            guess = groups[0]
        print '    %s\n' % guess
        if out_fname:
            writer.writerow(line_list + [guess])

    if out_fname:
        outf.close()

except csv.Error:
    print 'error reading from csv file "%s"' % in_fname

#=======end scraperA.py
Comments  
There are no comments at this time.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy