in reply to Character coding issues with Spreadsheet::XLSX
If M$ somehow decided that Excel 2007 would not change the way unicode is handled in spreadsheets, then this might help you out: xls2tsv uses the old Spreadsheet::ParseExcel, but if the unicode handling hasn't changed, then you'll find a consistent clue about when you need to "decode()" from UTF-16BE into utf8 to get what you want.
Then again, if M$ did decide to change their unicode handling in Excel, you might need to get some sort of hex-dump picture of the character data in the cells of interest. Save a spreadsheet with known non-ascii characters in selected cells, and you should be able to work out what needs to be done.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^2: Character coding issues with Spreadsheet::XLSX
by suaveant (Parson) on Feb 20, 2009 at 21:00 UTC | |
by Anonymous Monk on Sep 21, 2010 at 16:21 UTC | |
by Anonymous Monk on Sep 22, 2010 at 03:13 UTC |
In Section
Seekers of Perl Wisdom