URL not writing to download folder properly

Started by FrankWard, April 11, 2024, 03:59:41 PM

Previous topic - Next topic

FrankWard

I'm trying to download a blog. I've encountered several errors I was able to mitigate using regex. However, I'm not sure how to address this one.

When downloading the site it pulls various images from domains like.

http://3.bp.blogspot.com/_-sFohRgxOBI/R3natx-8SXI/AAAAAAAABiA/mp-2BeZnnYk/s1600-h/happy-new-year+Woody+Woodpecker.jpg

In the download folder it's not putting the domain as a folder. It's just writing a TON of the urls starting with the "_"character after the domain like so..

_-sFohRgxOBI/R3natx-8SXI/AAAAAAAABiA/mp-2BeZnnYk/s1600-h/happy-new-year+Woody+Woodpecker.jpg

This happens with the following domains, but not others, so it appears it may be a bug.

http://1.bp.blogspot.com/
http://2.bp.blogspot.com/
http://3.bp.blogspot.com/
http://4.bp.blogspot.com/

Is there a way to make the program remap these domains to specific folders in the downloaded site hierarchy?
Any advice? Thanks!