in reply to Strange Unicode normalization question
but I have no evidence of NonspacingMark ever being in the normalized string.
There are three in the example you gave:
use strict; use warnings qw( all ); use feature qw( say ); use utf8; use open ':std', ':encoding(UTF-8)'; use charnames qw( ); use Unicode::Normalize qw( NFKD ); my $html = "Lubomír,Bartoňová"; my $decomposedHtml = NFKD( $html ); say charnames::viacode(ord($_)) for $decomposedHtml =~ /(\p{NonspacingMark})/g;
Output:
COMBINING ACUTE ACCENT COMBINING CARON COMBINING ACUTE ACCENT
The code you posted is a hack to find an ASCII "equivalent" to the input.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^2: Strange Unicode normalization question
by mje (Curate) on Aug 16, 2018 at 18:10 UTC | |
Re^2: Strange Unicode normalization question
by Veltro (Hermit) on Aug 16, 2018 at 11:43 UTC | |
by ikegami (Patriarch) on Aug 16, 2018 at 14:43 UTC | |
Re^2: Strange Unicode normalization question
by mje (Curate) on Aug 16, 2018 at 17:35 UTC | |
by Your Mother (Archbishop) on Aug 16, 2018 at 17:46 UTC |
In Section
Seekers of Perl Wisdom