File Links don't work if File Name has # in name

Started by LinuxDan2015, November 22, 2021, 08:43:07 PM

Previous topic - Next topic

LinuxDan2015

I am working on a project capturing web pages from a client's web enabled database. I have 815 pages to pull and each one of them has one or more file attachments. The file attachments are PO records so many of them have names like PO# 999.pdf or Invoice for PO# 888.xls. The WebCopy tool brings them all down to the local drive fine, but in the HTML code it generates, it leaves the # symbol in the anchor tag which breaks the link. In the href= area it should convert # to %23.

Example of code from WebCopy:

a href="ALC- Quote #SAT2510BU2 07.19.13 .xls"

Should be saved as:

a href="ALC- Quote %23SAT2510BU2 07.19.13 .xls"

Is there some switch or option I am missing so that the app will convert these properly?

I have tried using WinHTTrack against this same database and it pulls down the HTML correctly, using %23 in the href tag instead of #. However, WinHTTrack is so, so, so, so, so much slower than Cyotek's WebCopy. I want to use Cyotek's tool! :-)

Thanks!!

Richard Moss

Hello,

Thanks for the bug report. I've logged this as issue #433 and a fix has been committed - it should be in 1.9.817 or higher (a nightly will be available later on today hopefully).

Regards;
Richard Moss
Read "Before You Post" before posting (https://forums.cyotek.com/cyotek-webcopy/before-you-post/). Do not send me private messages. Do not expect instant replies.

All responses are hand crafted. No AI involved. Possibly no I either.