Cyotek Forums

Products => WebCopy => Topic started by: Jimbo on February 17, 2023, 03:14:23 PM

Title: Rule to save HTML files only -- seeking help with that
Post by: Jimbo on February 17, 2023, 03:14:23 PM
Hello folks.

I have been playing around with WebCopy and I think it's a beautifully-built app.

However, I'm not much good at stuff like creating rules within it as the syntax structure is foreign to me.

What I would like to be able to do is to save only the HTML (text) files of a given web site, not all of the ancillary files like GIFs, PDFs, JPGs, etc.

Could somebody please help me create a rule to save HTML files only?

Thank you.






Title: Re: Rule to save HTML files only -- seeking help with that
Post by: Richard Moss on February 23, 2023, 07:37:29 PM
Hello,

Thanks for the question. You don't need to use rules for this as there is a simpler approach


Regards;
Richard Moss
Title: Re: Rule to save HTML files only -- seeking help with that
Post by: Jimbo on February 25, 2023, 12:23:41 AM
Thank you very much for the reply. That worked beautifully.  :)
Title: Re: Rule to save HTML files only -- seeking help with that
Post by: gary1854 on March 21, 2024, 06:05:58 PM
I tried this using application/pdf to download only pdf files from a site and it downloads nothing at all. 

Quote from: Richard Moss on February 23, 2023, 07:37:29 PMHello,

Thanks for the question. You don't need to use rules for this as there is a simpler approach

  • Open Project Properties (Project | Project Properties)
  • Select Content Types in the left hand tree
  • Click Include only resources with the content types listed below
  • In Types to include, enter
    text/html
  • Click OK to apply the changes and close the dialog
  • Copy the website

Regards;
Richard Moss