Thanks for the reply.
I've checked all of those, but:
- demoronizer removes the biggest horrors but leaves some left (e.g. <b><i> ... </b></i>. Maybe I can change the code.
- tidy is the tool of choice (under normal conditions). As I mentioned in CB, the script/website runs on a provider's server where I am not allowed to install software (poor customer's choice). Anyway, I need to drop all CSS stuff, which requires to post-parse tidy's output.
- wvHtml looks interesting, too. I may implement a doc file upload. Anyway, both restrictions mentioned before (no install of software, no CSS stuff) apply here, too.
Thanks for your comments and wisdom, I'll will have a sleep about this (or two) before I go on.
alex pleiner <email@example.com>
zeitform Internet Dienste