Problems with embedded pages

Started by ignat miller, May 02, 2016, 01:27:19 PM

Previous topic - Next topic

ignat miller

Hello.

I believe there's a problem for WebCopy to spot pages embedded into downloading page. I'm experiencing problems trying to download the following site: https://filtron.co

Here's the screen of that embedded page:



Here's the project file:
https://onedrive.live.com/redir?resid=501FA25B75FA2E06!458843&authkey=!AC-0-Jkyxb91niY&ithint=file%2ccwp

And here's an expected copy of filtron.co site (fixed manually):
https://onedrive.live.com/redir?resid=501FA25B75FA2E06!458494&authkey=!ANnK_2_DKbAu4yM&ithint=folder%2chtm


Please make WebCopy crawl and copy those embedded pages as manual fixing is annoying, time consuming and requires several web copiers to be used.

Thank you in advance.

Richard Moss

Hello,

Thank you for the report, I'm sorry you're having troubles with WebCopy. WebCopy does fully support iframe elements, it's one of the things that is tested anytime changes are made to WebCopy.

The problem is because that iframe is being generated via JavaScript. When viewing the source you can see that the tag originally was <script src="https://filtron.co/embed/v1/7" id="filtron_script_7" async></script> (which is present in the project file you helpfully supplied), but WebCopy does not currently have the ability to execute JavaScript, so the iframe is never created.

Unfortunately this the current design of WebCopy, in its current form I have neither the time nor resource to build a browser-like environment for executing scripts and then working out new URL's from that - one day hopefully, but not any time soon.

Regards;
Richard Moss
Read "Before You Post" before posting (https://forums.cyotek.com/cyotek-webcopy/before-you-post/). Do not send me private messages. Do not expect instant replies.

All responses are hand crafted. No AI involved. Possibly no I either.

ignat miller

Got it. Thank you for explanations.