http://qs321.pair.com?node_id=1190977


in reply to Re^3: \b in Unicode regex
in thread \b in Unicode regex

You actually had the opposite problem: You had UTF-8, but the regex engine expects a string of Unicode Code Points[1]. utf8::decode provides the latter from the former.


  1. More specifically, it's \w, \b, \d, etc that are defined in terms of UCP.