Cyotek Forums

Products => WebCopy => Topic started by: michaelnorth on February 04, 2015, 11:57:27 PM

Title: Downloading slider images, Creating a self-referential copy
Post by: michaelnorth on February 04, 2015, 11:57:27 PM
Hi, thanks for providing WebCopy, seems to work pretty well, but I'm running into a couple problems and wonder if I have the right settings.

Some big slider images are missing. Any way to retrieve those?

Also, even though the site is on my local machine, it's extremely slow. I experimented by unplugging my network connection, now the site is very fast. So it seems the local copy is trying to reach out over the Internet to resolve some of its content. Are there any settings that eliminate that problem? To just have the local copy refer to itself instead of live content on the real domain?
Title: Re: Downloading slider images, Creating a self-referential copy
Post by: Richard Moss on February 05, 2015, 09:16:54 PM
Hello,

Thanks for the message!

Firstly, in regards to the slider images. WebCopy does not include a JavaScript parser or a virtual DOM. So if the site makes heavy use of JavaScript to set up images or links, then WebCopy will be completely unable to find these as it can only parse the content directly downloaded. However, if the images are linked directly in the HTML could you paste an example so I can check them against WebCopy's parser.

Secondly, in regards to remote access. If possible, the local copy will only access local copy resources. However, if the remote site tries to pull resources from other domains (CDN's for example) then by default WebCopy will leave these as-as. One of the latest updates to WebCopy did add experimental support for directly downloading such linked content without crawling them fully, although this feature is currently experimental and may not work properly yet.

I haven't really had much time to update WebCopy recently, it's overdue for some bug fixes!

Regards;
Richard Moss