Last post by mikesmurph - September 01, 2022, 06:43:12 PM
since version 22.214.171.1242 it is no longer possible to call "C:\ProgramFiles\Cyotek\WebCopy\wcopy.exe c:\cwpproject\Project.cwp" for an automated webcrawling. WCopy.exe always returns the message "Only http and https URI schemes are supported"
Is this a problem of the new Version or what does the URI have to look like so that the call works again?
After successful download, I try opening the site on Windows Edge, Chrome, and Firefox.
Windows Edge opens a lot of the site and then blanks out to its grey background.
Chrome and Firefox, opens a couple lines of text much faster then also blanks out to a grey background.
I am wondering if a protection was placed by the site's original creator keeping it from copy?
Originally, I thought it was something with the JS or PHP not referencing, however I do not see any PHP references on the site (from a cursory search), but also believe that the partial success on Windows edge would not be possible if the JS/PHP was an issue.
Last post by The_yoyo - July 10, 2022, 05:44:57 PM
Bump this. I have a very similair issue: The website is fairly basic structure, but slightly complex URI's, like: www.example.com/m/?Cat=Selection&V_Sub=X&Page=1&SortBy=Z and www.example.com/m/?Cat=Selection&V_Sub=X&Page=2&SortBy=Z
I limited the crawl length from root so as to limit the page range (no sure if it worked). Anyway, the files I get are named: Index.htm Index-1.htm Index-2.htm ... Index-45678.htm
While the HREF inside the htm's refer to the URI's without the index, like: C/DownloadedWebsites/example.com/m/?Cat=Selection&V_Sub=X&Page=2&SortBy=Z This means the internal links don't work (404's).