contact  |  about  |  sitemap

anonymous proxy
Last Post 15 Jun 2009 11:47 AM by usadhikari. 7 Replies.
Sort:
PrevPrev NextNext
Author Messages
ddr

--
02 Jun 2009 10:22 AM
I'm testing Djuggler. I need it to collect data from some web pages.


My target website has some kind of "anti spider" protection, and after browsing around 50 pages, it blocks the requesting ip,

so data collection can't proceed further.

What can I do to solve this ?

The program has some built in functions to, for example, use proxies, or reset the modem to renew his public ip address ?


Tijn

--
02 Jun 2009 10:22 AM
You could use a public anonymous proxy.
Here is a list:

http://www.publicproxyservers.com/page1.html


In the Open web page action, you will find the proxy settings.
You can typically switch them every 49 request in the script logic.


expedable

--
08 Jun 2009 09:23 AM
Would be great if you can upload copy of this script in action when you get it working


Umesh

--
08 Jun 2009 12:44 PM
Hi,

I am also facing similar kind of problem, I used my script to extract data from a secure site i.e. Https sites, So how can I extract data from a secure website, may be a secure site generate a session or may be something else ( since I am new in this field, the question may be silly.)..can any one suggest me, is there any logic/way to extract data from a secure site.

Thanks
Umesh.


Tijn

--
10 Jun 2009 12:02 PM
Djuggler can extract data from a secure web by simply using the Open Web Action with a https address.

Doing that via a proxy is also no problem.
When you want to use a secure proxy that will depend on the proxy.

Tijn


Umesh

--
13 Jun 2009 08:59 AM
Hi Tijn,

Still facing same problem while retrieving data from a secure site. Here I do attach my script.

In this website when U go through a proper channel you will get some data i.e follow this path.

open web page: http://www.grad.cmich.edu/programs_alpha.htm

then go record having Title : "Hospitality and Tourism Administration **"
it has secure link i.e. https://bulletins.cmich.edu/display-degree.asp? yr=2006&level=gr&loc=onc&pid=21001040&pname=Certificate+in+Hospitality+%26+Tourism+Administration

but if I open direct this page (https://bulletins.cmich.edu/display-degree.asp? yr=2006&level=gr&loc=onc&pid=21001040&pname=Certificate+in+Hospitality+%26+Tourism+Administration) it shows error page due to security reason.

So still my question is same, Is it possible to retrieve data from a secure site, generally I used a loop which read a no of secure links, so that my script should retrieve data from all those links.

Thanks,
Umesh


Tijn

--
15 Jun 2009 10:40 AM
Hi,

The problem was a space in the url. I used a replace text action to remove the space and now it seems to work.

See the attached script.

Regards,
Tijn

secureSiteDemo_problem_updated.djs

usadhikari

--
15 Jun 2009 11:47 AM
Thanks Tijn,

Sorry for disagree with your previous post. It was my mistake. :(

Regards,
Umesh




Quick Reply
toggle
  Username:
Subject:
Body:
Security Code:
Enter the code shown above:

Submit

Powered by Active Forums

Forum participation and optional registration

You don't need to be registered to partcipate in the Djuggler forums, however if you want to subscribe to email notifications you need to register. You can also subscribe to the forum RSS feed.