Google Answers Logo
View Question
 
Q: Data mining needed ( No Answer,   0 Comments )
Question  
Subject: Data mining needed
Category: Reference, Education and News
Asked by: magi101-ga
List Price: $26.00
Posted: 19 Jul 2003 11:08 PDT
Expires: 23 Jul 2003 01:43 PDT
Question ID: 232809
Hello, 
 
I need some data mining done for about 700-800 domains in the .EDU
domain.  For each domain, I need all Contact Info extracted to a
master Excel file.  Educause
(the Registrar for .EDU) provides this info in a simple, text format,
but I don't have the time to compile it or the knowledge to write a
simple program that does it for me.  Because of the number of records,
I suspect a program may do this much more efficiently than a human,
but you are welcome to do it any way you prefer -- write a program,
use an existing program, or do it by hand.
 
This will not be used for marketing purposes, but rather for an
internal study we are doing.  I cannot post the domain list here, but
you can test out
the Whois with domains like "baptist.edu" and "berkeley.edu". 
(http://whois.educause.net/edudomain/whois.asp)
 
If you can do this within 48 hours, simply post your intention, along
with an email address I can send the list of domains to.
 
Output format will be XLS or comma delimited.   
 
 
Thanks!

Request for Question Clarification by answerguru-ga on 21 Jul 2003 08:53 PDT
Hi magi101-ga,

Google Answers is a public forum, so researchers cannot communicate
privately with any of the forum's users. Furthermore, no personal
information posting is permissible. Regardless of these restrictions,
I would like to try and help you with this - I am interested in
answering your question, however I have a few questions first:

1. Does Educause provide information about the domains or does it
provide the actual information you're interested in?

2. Can you provide a sample of the format that the information is
provided? If it contains personal information please try to generalize
it - maybe two or three records will do for this.

3. Which fields are you interested in maintaining? (ie. Name, email
address, etc.)

4. Is there any specific type of contact information that you are
specifically interested in (ie. only administrators, faculty, etc.)?

What I'm thinking is that if I can provide you with the tool(s) to do
this, you can pull the information you need and nothing will need to
be posted here. Let me know :)

answerguru-ga

Clarification of Question by magi101-ga on 21 Jul 2003 09:47 PDT
1. Educause provides the exact info I am looking for, publicly, in
text format, at their Whois. 
(http://whois.educause.net/edudomain/whois.asp)


2. Name address, phone, and email of 1-3 of the following: registrant,
administrator, technical contact.  Query Whois for more details.

3. Need all of the above fields.

4. No - no faculty are listed in Whois.  Just the parties mentioned in
part 2.

Thanks!

Request for Question Clarification by answerguru-ga on 21 Jul 2003 13:54 PDT
Hi again,

I see what you are after now....I actually called Educause (spoke with
Linda) and asked if there was another way of accessing the information
other than the search page they provide. The answer was a clear and
definitive *NO*. So we are looking at developing a tool to iterate
through your list, querying each domain one at a time, and pulling out
the information you're interested in.

Although I can help you with this type of work, it will take several
hours to complete. Is this something you would be interested in
engaging in?

answerguru-ga

Clarification of Question by magi101-ga on 21 Jul 2003 14:00 PDT
Yes.  It's the exact same question that we started with.  

My guess is that a program-based or script-based solution is the best
way, versus copying and pasting 700 domains manually.  At the end of
the day, though, results -- not process -- is what I care about. 
Thanks.

Request for Question Clarification by answerguru-ga on 21 Jul 2003 14:44 PDT
Alright, so now that we are both on the same page, I feel that your
list price is inappropriate consider the complexity and projected
duration of the work involved. Granted, it was a fair price if the
information could have been obtained without any program/script
development> However, now that we are both certain that this cannot be
avoided, it is time to revisit this issue.

While I know you are interested in the data rather than the code that
gets it, I estimate that development time will be between 10-12 hours
if there aren't any snags along the way. Rather than quote a price, I
will give you the chance to re-price the question based on this fact.

I will also like you to post the list of domain names in plain text,
one per line as I will need this to generate the output data.

Looking forward to hearing from you :)

answerguru-ga

Clarification of Question by magi101-ga on 21 Jul 2003 15:16 PDT
Hi,

Sounds like it would take you a lot of work.  I'll cross my fingers
and hope someone else can do it in less time.  I've had similar
requests processed in about 1.5-2 hours, and my price reflects that.

Thanks.. appreciate your input!

Request for Question Clarification by answerguru-ga on 22 Jul 2003 08:17 PDT
Sure, that's no problem at all....you can just post another message
when and if you are interested in continuing along this path.

answerguru-ga
Answer  
There is no answer at this time.

Comments  
There are no comments at this time.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy