http://qs321.pair.com?node_id=1227702


in reply to Handling utf-8 characters when scraping

Assuming that your source file is encoded correctly in UTF-8, then the output you've shown is correct - \x{2026} is U+2026 HORIZONTAL ELLIPSIS. Could you show an SSCCE of code you're having trouble with?