Main Menu

Recent posts

#91
WebCopy / Re: Why are external links pro...
Last post by Manuela - July 14, 2023, 12:04:35 PM
Thank you for your answer.

I guess I misunderstood this option. Now I have it unchecked and will see what happens.
#92
WebCopy / Re: Why are external links pro...
Last post by Richard Moss - July 13, 2023, 09:05:00 PM
Because I can't win.

If external links are excluded, then people complain that resources on CDN's aren't downloaded automatically.

If external links are included, then people complain that external links are being scanned.

As WebCopy has evolved, it tries to have a sensible set of defaults, and the sensible default in this pair is to assume that a user wants to copy linked resources. That means while it won't automatically download or crawl HTML on external sites, it will still try to get the content type in order to determine if it is a resource (such as your PDF) that it will download.

See the Downloading all resources setting for details - this controls whether external URLs are queried for their content type in order to download non-HTML

If there are certain domains you don't care about, then create rules to exclude them.

Regards;
Richard Moss
#93
WebCopy / Re: How deep in the night was ...
Last post by Richard Moss - July 13, 2023, 08:55:39 PM
Hello,

Thanks for the bug report! Turns out the OK button was being localised twice, once for the OK and then again for the Cancel, instead of the actual Cancel button.

Bug is fixed and will be in tomorrow's nightly as today's window has passed - it isn't that deep in the night really  ;D

Thanks again for noticing this!

Regards;
Richard Moss
#94
WebCopy / How deep in the night was the ...
Last post by Manuela - July 13, 2023, 06:00:42 PM
This is when you want to export the results. Actually the left cancel-button does the export.
#95
WebCopy / Why are external links process...
Last post by Manuela - July 13, 2023, 05:34:30 PM
This is very time consuming especially when the linked site is not available or a large pdf is linked.
#96
WebCopy / Backing up a MediaWiki wiki
Last post by Manuela - July 13, 2023, 05:00:22 PM
I am still experimenting with Webcopy.

Are there recommanded options for creating a backup for a MediaWiki wiki?

Until now I have created a bunch of rules to exclude all unwanted links like editing, history etc.

Additionally I have created a .css file to suppress those links (the wiki does not look like a wiki any longer)

Thank you and greetings

--Manuela
#97
WebCopy / Re: Skip reason is Failed alth...
Last post by Manuela - July 11, 2023, 05:28:06 PM
Thank you, I will give it a try.
#98
WebCopy / Re: Skip reason is Failed alth...
Last post by Richard Moss - July 08, 2023, 07:16:05 PM
Hello,

Thanks for the message. There was a bug introduced in a recent rework of WebCopy rule processing which meant if you had rules that checked content types or minimum or maximum file sizes then WebCopy would incorrect issue a HEAD test to get the content type and length. This was resolved in the later 1.9.1 builds, so if you have a current 1.9.1 build (870 at time of writing) this issue should be resolved.

I'm not 100% sure if this is related to the issue you're having, but it is (or was!) the only occurrence I'm aware of where excluded URLs would be accessed.

Regards;
Richard Moss
#99
WebCopy / Re: Poor image resolution for ...
Last post by Richard Moss - July 08, 2023, 09:52:34 AM
Hello,

This doesn't sound like something WebCopy would be able to help with - due to the nature of a map dynamically loading images of different qualities and scales as one zooms in an out of a map that pretty much guarantees that JavaScript is involved. WebCopy doesn't execute JavaScript and so generally can't copy these websites.

In addition, maps are interactive, e.g. they only update when you scroll or zoom. These interactions will almost certainly never be supported by WebCopy.

Sorry, but I don't think WebCopy is a good fit for this scenario.

Obligatory documentation link: https://docs.cyotek.com/cyowcopy/current/javascript.html

Regards;
Richard Moss
#100
WebCopy / Re: redirect download folder?
Last post by Richard Moss - July 08, 2023, 09:48:27 AM
Hello,

Thanks for the suggestion. I've logged this as issue #494, should nicely complement some other grouping options that I logged... well, quite a long time ago.

No timeline for implementing this at the moment, unfortunately.

Thanks again for the useful suggestion!

Regards;
Richard Moss