Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??

I do this with a file upload and wp2html, which creates really lean HTML and has the added bonus of working with WordPerfect docs too. I'm really happy with this solution- it's fast as heck, the HTML is pretty good and you have mucho control over the generated HTML.

While you can get the source, there is a 5 pound licensing fee. (very reasonable, considering the amount of work that must have gone into this). The author is very responsive, too.

I've tried wvHTML too, I like wp2html better because it keeps the intent of the document, and a good amount of the formatting without trying to stay TOO true to the original format of the document. Basically, wp2html gets the good stuff, while wvHTML jumps through too many hoops to keep the converted document looking like the original Word doc.

If you can upload a compiled binary, I highly suggest you check it out. It rocks!

-Any sufficiently advanced technology is
indistinguishable from doubletalk.


In reply to Re: Converting Word97 (or later) exported HTML to valid HTML by Hero Zzyzzx
in thread Converting Word97 (or later) exported HTML to valid HTML by projekt21

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others sharing their wisdom with the Monastery: (7)
As of 2024-03-28 12:37 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found