Cyotek Forums

Products => WebCopy => Topic started by: coderfx on June 30, 2020, 11:13:58 PM

Title: Site is downloaded But external Image link, CSS linked on pages are not download
Post by: coderfx on June 30, 2020, 11:13:58 PM
Hi there,

When i started copying website, it download and I can surf on local machine. But I notice all the images are link to other site.
for example.

Downloading :  site1.com

but site1.com has  image src link to  site2.com/image1.jpg

So when I inspect the downloaded HTML file it links to site2.com instead of downloading that image1.jpg file and using relative path locally downloaded image.

do anyone know how I can achieve download the files fro sit2.com images when I am downloading site1.com (which has hyperlinked site2.com images)
Title: Re: Site is downloaded But external Image link, CSS linked on pages are not download
Post by: coderfx on July 01, 2020, 01:27:29 AM
<figure class="post-gallery">
<a href="https://*****.com/tips-from-a-botched-survivor/" title="Tips From A Botched Survivor"><img width="740" height="560" src="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg" class="attachment-thevoux-style1-2x size-thevoux-style1-2x thb-lazyload wp-post-image lazyautosizes lazyloaded" alt="" sizes="370px" data-src="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg" data-sizes="auto" data-srcset="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg 740w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-370x280.jpg 370w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-20x15.jpg 20w" srcset="https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-740x560.jpg 740w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-370x280.jpg 370w, https://fv3ga3mwjje21cpeee211zcp-wpengine.netdna-ssl.com/wp-content/uploads/2020/05/blog16-20x15.jpg 20w"></a>
</figure>


so data-srcset  and srcset links I want to download all this type of images.
Title: Re: Site is downloaded But external Image link, CSS linked on pages are not download
Post by: coderfx on July 01, 2020, 01:29:16 AM
I added custom attribute

data-original
data-srcset
srcset
//img/@data-src
//img/@data-dm-image-path
//img/@data-srcset


But is not helping me to download.
Title: Re: Site is downloaded But external Image link, CSS linked on pages are not download
Post by: Richard Moss on July 03, 2020, 07:02:10 AM
Hello,

Welcome to the forums and thanks for the post. While WebCopy supports custom attributes (as you have discovered), normal HTML, it currently doesn't support parsing multiple URLs from them and, except for specific cases (e.g. the real srcset attribute) assumes one value per attribute.

This is something I plan to address in version 1.9.

Regards;
Richard Moss
Title: Re: Site is downloaded But external Image link, CSS linked on pages are not download
Post by: coderfx on July 08, 2020, 02:29:01 AM
Quote from: Richard Moss on July 03, 2020, 07:02:10 AM
Hello,

Welcome to the forums and thanks for the post. While WebCopy supports custom attributes (as you have discovered), normal HTML, it currently doesn't support parsing multiple URLs from them and, except for specific cases (e.g. the real srcset attribute) assumes one value per attribute.

This is something I plan to address in version 1.9.

Regards;
Richard Moss

Thank You for your response. Do you know how I can track version 1.9 and it will be available.