<?xml version="1.0" encoding="UTF-8"?>
<w:document xmlns:w="http://schemas.openxmlformats.org/wordprocessingm
+l/2006/main" xmlns:m="http://schemas.openxmlformats.org/officeDocumen
+t/2006/math" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:
+r="http://schemas.openxmlformats.org/officeDocument/2006/relationship
+s" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:ve="http://schemas.o
+penxmlformats.org/markup-compatibility/2006" xmlns:w10="urn:schemas-m
+icrosoft-com:office:word" xmlns:wne="http://schemas.microsoft.com/off
+ice/word/2006/wordml" xmlns:wp="http://schemas.openxmlformats.org/dra
+wingml/2006/wordprocessingDrawing">
<w:body>
<w:p w:rsidR="00D479B1" w:rsidRDefault="00D479B1">
<w:r>
<w:t>1234</w:t>
</w:r>
</w:p>
<w:p w:rsidR="00D479B1" w:rsidRDefault="00D479B1">
<w:r>
<w:t>5678</w:t>
</w:r>
</w:p>
<w:sectPr w:rsidR="00D479B1" w:rsidSect="00D479B1">
<w:pgSz w:w="11906" w:h="16838" />
<w:pgMar w:top="1440" w:right="1800" w:bottom="1440" w:left="
+1800" w:header="708" w:footer="708" w:gutter="0" />
<w:cols w:space="708" />
<w:docGrid w:linePitch="360" />
</w:sectPr>
</w:body>
</w:document>
becomes:
<?xml version="1.0" encoding="UTF-8"?>
<w:document xmlns:w="http://schemas.openxmlformats.org/wordprocessingm
+l/2006/main" xmlns:m="http://schemas.openxmlformats.org/officeDocumen
+t/2006/math" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:
+r="http://schemas.openxmlformats.org/officeDocument/2006/relationship
+s" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:ve="http://schemas.o
+penxmlformats.org/markup-compatibility/2006" xmlns:w10="urn:schemas-m
+icrosoft-com:office:word" xmlns:wne="http://schemas.microsoft.com/off
+ice/word/2006/wordml" xmlns:wp="http://schemas.openxmlformats.org/dra
+wingml/2006/wordprocessingDrawing">
<w:body>
<w:p w:rsidR="00D479B1" w:rsidRDefault="00D479B1">
<w:r>
<w:t>1234</w:t>
</w:r>
</w:p>
<w:p>
<w:r>
<w:br w:type="page" />
</w:r>
</w:p>
<w:p w:rsidR="00D479B1" w:rsidRDefault="00D479B1">
<w:r>
<w:t>5678</w:t>
</w:r>
</w:p>
<w:sectPr w:rsidR="00D479B1" w:rsidSect="00D479B1">
<w:pgSz w:w="11906" w:h="16838" />
<w:pgMar w:top="1440" w:right="1800" w:bottom="1440" w:left="
+1800" w:header="708" w:footer="708" w:gutter="0" />
<w:cols w:space="708" />
<w:docGrid w:linePitch="360" />
</w:sectPr>
</w:body>
</w:document>
See also the other links already provided in this thread, and their associated links. To be honest your work flow ('I'm using Perl to scrape text from a JavaScript that printed out one page at a time..') seems somewhat convoluted, but you don't go into much detail. |