Hello folks.
I have been playing around with WebCopy and I think it's a beautifully-built app.
However, I'm not much good at stuff like creating rules within it as the syntax structure is foreign to me.
What I would like to be able to do is to save only the HTML (text) files of a given web site, not all of the ancillary files like GIFs, PDFs, JPGs, etc.
Could somebody please help me create a rule to save HTML files only?
Thank you.
Hello,
Thanks for the question. You don't need to use rules for this as there is a simpler approach
- Open Project Properties (Project | Project Properties)
- Select Content Types in the left hand tree
- Click Include only resources with the content types listed below
- In Types to include, enter
text/html - Click OK to apply the changes and close the dialog
- Copy the website
Regards;
Richard Moss
Thank you very much for the reply. That worked beautifully. :)
I tried this using application/pdf to download only pdf files from a site and it downloads nothing at all.
Quote from: Richard Moss on February 23, 2023, 07:37:29 PMHello,
Thanks for the question. You don't need to use rules for this as there is a simpler approach
- Open Project Properties (Project | Project Properties)
- Select Content Types in the left hand tree
- Click Include only resources with the content types listed below
- In Types to include, enter
text/html - Click OK to apply the changes and close the dialog
- Copy the website
Regards;
Richard Moss