![]() |
|
go ahead... be a heretic | |
PerlMonks |
Re^2: Extracting text from MS Word files on a Linux boxby afoken (Chancellor) |
on Jun 21, 2018 at 20:18 UTC ( #1217133=note: print w/replies, xml ) | Need Help?? |
Have you tried strings? Always used to do the trick before the MS format changed. docx is just a bunch of zipped XML files and some misc files. strings will fail due to ZIP, but once unpacked, strings will happily dig through the XML files. Alexander
-- Today I will gladly share my knowledge and experience, for there are no sweeter words than "I told you so". ;-)
In Section
Seekers of Perl Wisdom
|
|