Pathologically Eclectic Rubbish Lister | |
PerlMonks |
UTF-8 and XML::LibXMLby davies (Prior) |
on Nov 26, 2019 at 11:53 UTC ( [id://11109243]=perlquestion: print w/replies, xml ) | Need Help?? |
davies has asked for the wisdom of the Perl Monks concerning the following question: XML::LibXML seems to be doing strange things to UTF-8 encoded strings.
Some of my output is below. I have removed the lines printing the characters as that would involve more rendering issues.
My real case is reading files, but I am getting the issue demonstrated in this example. The character I have chosen is one that is causing problems (a U with an acute accent), but other characters are being transformed as well. Given that the XML is flagged as being UTF-8, I cannot see anything in the docs indicating why this transformation should take place. What have I missed, please? Regards, John Davies
Back to
Seekers of Perl Wisdom
|
|