in reply to Re^3: \b in Unicode regex
in thread \b in Unicode regex
You actually had the opposite problem: You had UTF-8, but the regex engine expects a string of Unicode Code Points[1]. utf8::decode provides the latter from the former.
More specifically, it's \w, \b, \d, etc that are defined in terms of UCP.
In Section
Seekers of Perl Wisdom