Site is downloaded But external Image link, CSS linked on pages are not download

Started by coderfx, June 30, 2020, 11:13:58 PM

Previous topic - Next topic

coderfx

Hi there,

When i started copying website, it download and I can surf on local machine. But I notice all the images are link to other site.
for example.

Downloading :  site1.com

but site1.com has  image src link to  site2.com/image1.jpg

So when I inspect the downloaded HTML file it links to site2.com instead of downloading that image1.jpg file and using relative path locally downloaded image.

do anyone know how I can achieve download the files fro sit2.com images when I am downloading site1.com (which has hyperlinked site2.com images)

coderfx

<figure class="post-gallery">
<a href="https://*****.com/tips-from-a-botched-survivor/" title="Tips From A Botched Survivor"><img width="740" height="560" src="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg" class="attachment-thevoux-style1-2x size-thevoux-style1-2x thb-lazyload wp-post-image lazyautosizes lazyloaded" alt="" sizes="370px" data-src="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg" data-sizes="auto" data-srcset="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg 740w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-370x280.jpg 370w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-20x15.jpg 20w" srcset="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg 740w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-370x280.jpg 370w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-20x15.jpg 20w"></a>
</figure>


so data-srcset  and srcset links I want to download all this type of images.

coderfx

I added custom attribute

data-original
data-srcset
srcset
//img/@data-src
//img/@data-dm-image-path
//img/@data-srcset


But is not helping me to download.

Richard Moss

Hello,

Welcome to the forums and thanks for the post. While WebCopy supports custom attributes (as you have discovered), normal HTML, it currently doesn't support parsing multiple URLs from them and, except for specific cases (e.g. the real srcset attribute) assumes one value per attribute.

This is something I plan to address in version 1.9.

Regards;
Richard Moss
Read "Before You Post" before posting (https://forums.cyotek.com/cyotek-webcopy/before-you-post/). Do not send me private messages. Do not expect instant replies.

All responses are hand crafted. No AI involved. Possibly no I either.

coderfx

Quote from: Richard Moss on July 03, 2020, 07:02:10 AM
Hello,

Welcome to the forums and thanks for the post. While WebCopy supports custom attributes (as you have discovered), normal HTML, it currently doesn't support parsing multiple URLs from them and, except for specific cases (e.g. the real srcset attribute) assumes one value per attribute.

This is something I plan to address in version 1.9.

Regards;
Richard Moss

Thank You for your response. Do you know how I can track version 1.9 and it will be available.