Rule to save HTML files only -- seeking help with that

Started by Jimbo, February 17, 2023, 03:14:23 PM

Previous topic - Next topic

Jimbo

Hello folks.

I have been playing around with WebCopy and I think it's a beautifully-built app.

However, I'm not much good at stuff like creating rules within it as the syntax structure is foreign to me.

What I would like to be able to do is to save only the HTML (text) files of a given web site, not all of the ancillary files like GIFs, PDFs, JPGs, etc.

Could somebody please help me create a rule to save HTML files only?

Thank you.







Richard Moss

Hello,

Thanks for the question. You don't need to use rules for this as there is a simpler approach

  • Open Project Properties (Project | Project Properties)
  • Select Content Types in the left hand tree
  • Click Include only resources with the content types listed below
  • In Types to include, enter
    text/html
  • Click OK to apply the changes and close the dialog
  • Copy the website

Regards;
Richard Moss
Read "Before You Post" before posting (https://forums.cyotek.com/cyotek-webcopy/before-you-post/). Do not send me private messages. Do not expect instant replies.

All responses are hand crafted. No AI involved. Possibly no I either.

Jimbo

Thank you very much for the reply. That worked beautifully.  :)

gary1854

I tried this using application/pdf to download only pdf files from a site and it downloads nothing at all. 

Quote from: Richard Moss on February 23, 2023, 07:37:29 PMHello,

Thanks for the question. You don't need to use rules for this as there is a simpler approach

  • Open Project Properties (Project | Project Properties)
  • Select Content Types in the left hand tree
  • Click Include only resources with the content types listed below
  • In Types to include, enter
    text/html
  • Click OK to apply the changes and close the dialog
  • Copy the website

Regards;
Richard Moss