Google Answers: for joseleon-ga only please

View Question

Q: for joseleon-ga only please ( Answered 5 out of 5 stars

, 0 Comments )

Question

Subject: for joseleon-ga only please
Category: Computers > Programming
Asked by: bmcompany-ga
List Price: $150.00

Posted: 09 Dec 2003 07:45 PST
Expires: 08 Jan 2004 07:45 PST
Question ID: 285261

Hi there, Last week you wrote a link parser to generate a list of URLs to submit. The program is working fine but we would like a couple of new features. Are you ok for this?
Request for Question Clarification by joseleon-ga on 09 Dec 2003 08:24 PST Hello, bmcompany: Sure, just tell me what do you need and you have it. Regards.
Clarification of Question by bmcompany-ga on 09 Dec 2003 09:28 PST Hello again! at the moment, using the link parser, you choose a directory and it reads the url from the top links and then copies that into a list, appending all of the .htm file names to the URLS, until you have a complete url (inc page name) ready for submission. We now need to do the same but with several sites in one directory (each site with around 50 pages. IE, one dir will have 400 pages, 8 clients (50 pages per client) So, the program needs to look into each html file and extract the url on an individual basis. the list will look like this http://www.domain1.com/page1.htm http://www.domain1.com/page2.htm http://www.domain1.com/page3.htm http://www.domain1.com/page4.htm http://www.domain2.com/page1.htm http://www.domain2.com/page2.htm http://www.domain2.com/page3.htm http://www.domain2.com/page4.htm http://www.domain2.com/page5.htm http://www.domain3.com/page1.htm http://www.domain3.com/page2.htm http://www.domain3.com/page3.htm Thats the first of 2 changes. Secondly, for each client, one of the pages contains a list of links to other pages (in a hidden layer). We need another field in the DB which notes whether the page contains this hidden layer or not. If you have a look at the files i sent you last time, you should be able to find the hidden layer (as its the html file thats 1 or 2K larger than the others). If you could clarify the question to make sure i have made sense. Thanks Again bmcompany
Request for Question Clarification by joseleon-ga on 09 Dec 2003 09:50 PST Hello, bmcompany: First of all, thanks for your confidence with me, I will try to make it as fast and as better as possible, so here is what I understand do you want me to do: 1. You want me to alter the previous procedure and use the domain name stored "independently" on each file, that way, you can have in the same directory many customer websites and generate reports for a single customer or for several without need to change anything, right? 2. For the second change, I deleted your data files, so I would need you to post them again, and you just want me to scan for that hidden layer and store it's existance (or absence) in a field in the db. Do you want this information on the text result also? Regards.
Request for Question Clarification by joseleon-ga on 10 Dec 2003 00:48 PST Hello, bmcompany: Should I start the modifications? Everything's ok? Regards.
Clarification of Question by bmcompany-ga on 10 Dec 2003 01:01 PST http://www.sesuk.net/ga/ga.zip sorry for the delay, we're in the uk - so you're 5 hours or so behind. The text output doesnt need the hidden layer info, just the db. Perfect, please go ahead with the modifications. Please let me know once you've downloaded the file and I'll remove them. Thanks again bmcompany
Request for Question Clarification by joseleon-ga on 10 Dec 2003 01:38 PST Hello, bmcompany: I have downloaded the file, you can remove it safely, by the way, I'm in Spain, so here is GMT+1 ;-) I start the modifications right now. Regards.
Clarification of Question by bmcompany-ga on 10 Dec 2003 02:22 PST good stuff!

Answer

Subject: Re: for joseleon-ga only please
Answered By: joseleon-ga on 10 Dec 2003 02:57 PST
Rated: 5 out of 5 stars

Hello, bmcompany:

You can download the improved version from this location:

http://www.qadram.com/Link_parser_improved.zip
  
Now you can mix all the websites you want on a single dir and results
will be extracted sorted and grouped by domain, also, in the database,
a new field has been created, called "layer", is boolean and stores
whether the file contains the hidden layer or not.

I hope this is what you need, but in any case, don't hesitate to
request for a clarification, I'm here to help you!

Regards.

Clarification of Answer by joseleon-ga on 10 Dec 2003 06:08 PST
Hello, bmcompany:
  Thanks for the tip!, I'm online most of the time, so you can post a
new question when you want.

Regards.

bmcompany-ga rated this answer: 5 out of 5 stars

and gave an additional tip of: $20.00

Perfect (worked for 30,000 pages no problem :)

next thing we want it to do is to have the URL and page name as 2
seperate fields. At that point the database needs to be split to 2
tables and normalised. Ill post that for you later on today if that
falls within your remit?

Comments

There are no comments at this time.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.

Search Google Answers for

Google Home - Answers FAQ - Terms of Service - Privacy Policy