http://qs321.pair.com?node_id=376828


in reply to utf weirdness in regex

Using decode here is very wrong. Decode is if you have a sequence that is in utf8, but perl does not know it. Your's is in latin1 and it does not convert to valid utf8. retry it with
$string1 = Encode::decode(utf8 => $string1, Encode::FB_CROAK);
to convert all to valid unicode, try:
$string1 = Encode::decode(latin1 => $string1, Encode::FB_CROAK); $string2 = Encode::decode(latin1 => $string2, Encode::FB_CROAK); $string3 = Encode::decode(latin1 => $string3, Encode::FB_CROAK);
Boris