contact  |  about  |  sitemap


contact  |  about  |  sitemap

spider script to get all urls in a web site
Last Post 16 Nov 2009 11:12 AM by Joey. 3 Replies.
Sort:
PrevPrev NextNext
Author Messages
Tijn

--
02 Feb 2009 07:24 PM
This script spiders a web site like the bots from search engines. It will collect all urls in a page and then visit the urls found for new urls. The script will stop if it has found and visited all urls in your site. It will not follow urls outside your web site.

Use this script to see if a search engine robot can find all your pages.

The first script lines define the settings for the script, like the base URL and what page extensions should be included. A debug switch will also produce a text file with an overview of all links on each page in your web site.

Note: This script uses the Djuggler List variable which is not available in the Djuggler Personal version. Use the Djuggler Lite or higher version.

Get_all_urls_in_a_site.djs

Amanda

--
11 Feb 2009 10:59 PM
Thanks! Great for finding 404 errors.


Barry

--
11 Mar 2009 09:49 AM
Creating your own Internet bots with Djuggler, cool!
Thanks for the script.


Joey

--
16 Nov 2009 11:12 AM
Nice web scraper example script.
Thanks.
J.




Quick Reply
toggle
  Username:
Subject:
Body:
Security Code:
Enter the code shown above:

Submit

Powered by Active Forums

Forum participation and optional registration

You don't need to be registered to partcipate in the Djuggler forums, however if you want to subscribe to email notifications you need to register. You can also subscribe to the forum RSS feed.

Forum participation and optional registration

You don't need to be registered to partcipate in the Djuggler forums, however if you want to subscribe to email notifications you need to register. You can also subscribe to the forum RSS feed.