Why this code run faster?

Gangabass has asked for the wisdom of the Perl Monks concerning the following question:

Dear Monks i need your help again.

I have some code (not my) that use g and o matching modifiers. I think there is no need in them but this code show strange result. It ran faster than without this modifiers.

Can you explain why?

Here is code:

#!/usr/bin/perl -W

use warnings;
use strict;
use Benchmark;

my $phrase1 = 'network';
my $phrase2 = 'networK';
my ($t1, $t2);

##########################################
$t1 = new Benchmark;

for (1..10000000) {
    if ($phrase1 =~ /^network$/go) {}
}

$t2 = new Benchmark;

print timestr (timediff ($t2, $t1)), "\n";

##########################################
$t1 = new Benchmark;

for (1..10000000) {
    if ($phrase2 =~ /^networK$/) {}
}
$t2 = new Benchmark;

print timestr (timediff ($t2, $t1)), "\n";
[download]

And here is result:

7 wallclock secs ( 6.40 usr +  0.01 sys =  6.41 CPU)
9 wallclock secs ( 9.06 usr +  0.00 sys =  9.06 CPU)
[download]

Comment on Why this code run faster? Select or Download Code

Replies are listed 'Best First'.

Re: Why this code run faster?
by shmem (Chancellor) on Nov 08, 2007 at 17:57 UTC

#!/usr/bin/perl
#
use Benchmark qw( cmpthese );

cmpthese( -2, {
    g => sub { 'network' =~ /^network$/g  },
    c => sub { 'network' =~ /^network$/gc },
    s => sub { 'networK' =~ /^networK$/   },
} );
__END__
       Rate    s    g    c
s 2106781/s   -- -28% -68%
g 2931188/s  39%   -- -56%
c 6602248/s 213% 125%   --
[download]

The /c modifier really boosts! Why? It doesn't reset the search position on a failed match while /g is in effect (see perlop). So it tests once from the beginning, and at each further invocation of the same match, it tests beginning at the end, failing quickly. With a single /g, every other match fails.

--shmem

_($_=" "x(1<<5)."?\n".q·/)Oo.  G°\        /
                              /\_¯/(q    /
----------------------------  \__(m.====·.(_("always off the crowd"))."·
");sub _{s./.($e="'Itrs `mnsgdq Gdbj O`qkdq")=~y/"-y/#-z/;$e.e && print}

[reply]
[d/l]

Re: Why this code run faster?
by kyle (Abbot) on Nov 08, 2007 at 17:19 UTC

The /o modifier isn't doing anything. Try running with and without it; when I did, there wasn't any difference. That modifier only affects patterns that have a variable interpolated in them, and your patterns don't.

The /g modifier seems to be the one that's making the difference, but I don't see why. I'd actually expect it to work the opposite way from what it does.

You might want to see also No More Meaningless Benchmarks! The operations involved are ridiculously fast, so I'm not sure how useful (or accurate) it is to compare them. Consider:

use Benchmark qw( cmpthese );

cmpthese( 10_000_000, { lower => sub { 'network' =~ /^network$/ },
                        upper => sub { 'networK' =~ /^networK$/ } } );
__END__
           Rate lower upper
lower 3174603/s    --   -9%
upper 3496503/s   10%    --
[download]

Matching uppercase is faster than lowercase? Seriously?

I tried comparing literally the same subs, and there was still a 1% difference.

Adding an explicit scalar context brought the difference down a bit, but I'm not sure because the results aren't very consistent. In fact, I'd call them downright erratic.

All that being said, if someone can explain why a /g would make the pattern faster, I'd be very interested to hear. As it stands, I think there isn't a meaningful or consistent difference.