in reply to Re^2: Splitting multiline string into words, the stuff between words, and newlines
in thread Splitting multiline string into words, the stuff between words, and newlines
That is because \b{wb} matches between those signs.
This seems to solve the issue:
my @fragments = grep length, split /(\b{wb}\w.*?\b{wb}|\n+)/, $book;
But my knowledge of Unicode and the \b{wb} semantics is rather limited so that may have other issues.
In Section
Seekers of Perl Wisdom