Cyotek Forums

Products => WebCopy => Topic started by: ignat miller on May 02, 2016, 01:27:19 PM

Title: Problems with embedded pages
Post by: ignat miller on May 02, 2016, 01:27:19 PM
Hello.

I believe there's a problem for WebCopy to spot pages embedded into downloading page. I'm experiencing problems trying to download the following site: https://filtron.co (https://filtron.co)

Here's the screen of that embedded page:

(https://forums.cyotek.com/proxy.php?request=http%3A%2F%2Fimg4.imagetitan.com%2Fimg4%2Fsmall%2F13%2F13_screeny425.png&hash=11f151d8b0bfe7acd615c869efe9097ba5779b07) (http://img4.imagetitan.com/img.php?image=13_screeny425.png)

Here's the project file:
https://onedrive.live.com/redir?resid=501FA25B75FA2E06!458843&authkey=!AC-0-Jkyxb91niY&ithint=file%2ccwp (https://onedrive.live.com/redir?resid=501FA25B75FA2E06!458843&authkey=!AC-0-Jkyxb91niY&ithint=file%2ccwp)

And here's an expected copy of filtron.co site (fixed manually):
https://onedrive.live.com/redir?resid=501FA25B75FA2E06!458494&authkey=!ANnK_2_DKbAu4yM&ithint=folder%2chtm (https://onedrive.live.com/redir?resid=501FA25B75FA2E06!458494&authkey=!ANnK_2_DKbAu4yM&ithint=folder%2chtm)


Please make WebCopy crawl and copy those embedded pages as manual fixing is annoying, time consuming and requires several web copiers to be used.

Thank you in advance.
Title: Re: Problems with embedded pages
Post by: Richard Moss on May 03, 2016, 04:41:55 PM
Hello,

Thank you for the report, I'm sorry you're having troubles with WebCopy. WebCopy does fully support iframe elements, it's one of the things that is tested (http://demo.cyotek.com/html/elements/iframe.php) anytime changes are made to WebCopy.

The problem is because that iframe is being generated via JavaScript. When viewing the source you can see that the tag originally was <script src="https://filtron.co/embed/v1/7" id="filtron_script_7" async></script> (which is present in the project file you helpfully supplied), but WebCopy does not currently have the ability to execute JavaScript, so the iframe is never created.

Unfortunately this the current design of WebCopy, in its current form I have neither the time nor resource to build a browser-like environment for executing scripts and then working out new URL's from that - one day hopefully, but not any time soon.

Regards;
Richard Moss
Title: Re: Problems with embedded pages
Post by: ignat miller on May 04, 2016, 08:45:10 AM
Got it. Thank you for explanations.