Some googling turned up this by Util. ++ to him. This is a cut down version that could get you started.
#!/usr/bin/perl
use strict;
use warnings;
use Win32::OLE;
use Win32::OLE::Enum;
my $word = Win32::OLE->GetActiveObject('Word.Application');
my $document = $word->ActiveDocument;
my $paragraphs = $document->Paragraphs();
my $enumerate = Win32::OLE::Enum->new($paragraphs);
while( my $paragraph = $enumerate->Next()) {
my $style = $paragraph->{Style}->{NameLocal};
my $text = $paragraph->{Range}->{Text};
$text =~ tr{\n\r}{}d;
$text =~ tr{\x0b}{\n};
printf qq{%s -> ***%s***\n}, $style, $text;
}
It assumes a document is open in Word. My simple document parsed as
Heading 1 -> ***Heading 1 text***
Heading 2 -> ***Heading 2 text***
Normal -> ***Normal***
For producing HTML I would consider something like HTML::Element.
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|