comment on

Looks like you want to compute the Hamming distance (number of character positions in which they disagree) between two words. Do you really really need a regex, or will another way of matching be ok?

If you're not strongly committed to using a regex, use something like this (probably will do strange things for non-ascii character sets):

sub hamming {
  my ($x1, $x2) = @_;
  return -1 if length($x1) != length($x2);

  (my $xor = $x1 ^ $x2) =~ tr/\x0//cd;
}

print hamming(@$_), $/
   for [qw[ abcdef abccef ]],
       [qw[ abcdef abc ]],
       [qw[ abcdef abbbbf ]];
__END__
1
-1
3
[download]

However, you could still cram this hamming function inside of a regex, with some trickery/cheating:

# untested
my $target = "abcdef";
my $distance = 1;

my $len = length $target;
my $qr  = qr/ \b(\w{$target})\b (?(??{ hamming($target,$1) < $distance
+ }) | (?!))  /x
[download]

I'm in a bit of a rush at the moment, so my (?(cond)pattern) syntax is probably wrong. Also, it could be optimized greatly to avoid backtracking (say, with (?>pattern) to capture $1). Maybe someone can help me out here with the details, but this should give some idea..

blokhead

In reply to Re: searching for a string w/ a * in any single position? by blokhead
in thread searching for a string w/ a * in any single position? by mdunnbass

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


We don't bite newbies here... much
	PerlMonks