http://qs321.pair.com?node_id=11141637


in reply to Re^2: Splitting multiline string into words, the stuff between words, and newlines
in thread Splitting multiline string into words, the stuff between words, and newlines

That is because \b{wb} matches between those signs.

This seems to solve the issue:

my @fragments = grep length, split /(\b{wb}\w.*?\b{wb}|\n+)/, $book;

But my knowledge of Unicode and the \b{wb} semantics is rather limited so that may have other issues.