Re: Converting Word97 (or later) exported HTML to valid HTML

by projekt21 (Friar)
on Nov 06, 2001 at 16:50 UTC ( #123560=note:

in reply to Converting Word97 (or later) exported HTML to valid HTML

Thanks for the reply.

I've checked all of those, but:

  • demoronizer removes the biggest horrors but leaves some left (e.g. <b><i> ... </b></i>. Maybe I can change the code.
  • tidy is the tool of choice (under normal conditions). As I mentioned in CB, the script/website runs on a provider's server where I am not allowed to install software (poor customer's choice). Anyway, I need to drop all CSS stuff, which requires to post-parse tidy's output.
  • wvHtml looks interesting, too. I may implement a doc file upload. Anyway, both restrictions mentioned before (no install of software, no CSS stuff) apply here, too.

Thanks for your comments and wisdom, I'll will have a sleep about this (or two) before I go on.

alex pleiner <>
zeitform Internet Dienste

Re: Re: Converting Word97 (or later) exported HTML to valid HTML
by hatter (Pilgrim) on Nov 06, 2001 at 18:07 UTC
    If you can run CGIs, chances are you can upload precompiled binarie or, compile your own binaries on their server from CGIs and then call them from other scripts. Unless they need to approve scripts and they then put hem live - in which case, obfuscate anything and see if they put it live when they don't understand it.

    /msg me if you want some more specific hints on doing things on shared servers that the admin thought they could stop.

    the hatter

