Hi there,
I'm a new user and would like to ask how I can download skipped files?
I wanted to copy a site and I was able to log in and get some of the stuff but seems like majority of the sound files / images were not downloaded like below.
WOuld you please let me know what I can do to get them downloaded please?
THanks in advance for any suggestions.
Hello,
Welcome to the forums and thanks for the question.
That "skipped" status is a bit confusing, I should probably try to document it better.
You can see from your screen shot that the skipped URI's all return a response code of 303, which is a redirect. What WebCopy does is skip saving the URI that has the redirect, but it follows the redirect. For example, if redirect.php returns a redirect to content.php, WebCopy will skip saving a physical file for redirect.php but will scan and download content.php. It will then remap any source links which pointed to redirect.php to point to content.php instead.
Hope this helps.
Regards;
Richard Moss
Thanks for the reply.
I think what happened was that, I used browser to authenticate, but how can I stay authenticated? I think the redirect was redirecting to the login page again, and that was the reason why it did not download any of those files. Is it possible to keep the login session opened so that files are going to be downloaded?
Any advice greatly appreciated.
Hello,
Regardless of if you create a form to post login credentials, or use the new "use external browser" added to 1.7, there's still one fairly important detail - you need to create a rule to exclude any logout pages. WebCopy will hit every URL it can find that unless expressly instructed otherwise, so if it hits a sign out / log out page then the action will occur and session cookies etc will be deleted. Without more details I can't say if that it was is happening here but it something that has confused users in the past.
Regard;
Richard Moss
Hi,
I tried to put in the rules to exclude the login pages that I know (I have never come across log out pages). But the files are still skipped.
When I clicked on one of the links which were skipped, it brought me to /login/index.php which I tried to exclude already.
Is there an example I can see how this is done at all? I cannot get it to include those 'redirect' links at all but I feel that it's because the session id expired...there must be a way to keep it around?
Many thanks.
I went into one of the links' headers and I wonder if this provides any insight on why it is skipped?