Optimizing existing Perl code (in practise)

JaWi has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.

(jeffa) Re: Optimizing existing Perl code (in practise)
by jeffa (Bishop) on Aug 18, 2002 at 22:21 UTC

I think a very important item to optimize is code maintainabibilty - how easy is it to extend your program and fix bugs that break your code?

So, how do i optimize my Perl code? I generally don't (but i do try to get it right the first time - measure twice, cut once). If i do, it is to replace areas of wheel re-invention with CPAN modules, or to refactor items into classes to improve robustness. If i wanted faster code i would port it to C instead, but since most of what i write relies on database and web servers, Perl is 90% of the time not the bottleneck.

jeffa

L-LL-L--L-LL-L--L-LL-L--
-R--R-RR-R--R-RR-R--R-RR
B--B--B--B--B--B--B--B--
H---H---H---H---H---H---
(the triplet paradiddle with high-hat)

[reply]

Re: (jeffa) Re: Optimizing existing Perl code (in practise)

by cybear (Monk) on Aug 19, 2002 at 10:56 UTC

The fastest script in the world is worthless if a change in your
system's directory structure breaks your code and you can't fix it.

If you bother to optimize for anything, do it for maintainability
But never forget "monitorability".

Unless your script is being called to do huge jobs, or your resources
are very restricted (Sparc Ultra 1 or Intel 486, etc.) optimization for
speed is not usually that big an issue.

However, thorough and correct logging of events, meaningful commentary
in the script itself, reusability of the code; these will all help
with maintainability.

[reply]

Re^3: Optimizing existing Perl code (in practise)

by Aristotle (Chancellor) on Aug 19, 2002 at 11:07 UTC

perrin

impressive eToys success story

Makeshifts last the longest.

[reply]

Re: Re^3: Optimizing existing Perl code (in practise)

by cybear (Monk) on Aug 19, 2002 at 14:01 UTC

Re: Optimizing existing Perl code (in practise)
by atcroft (Abbot) on Aug 18, 2002 at 22:07 UTC

The first thing would be to find where your program spends the most time, and look to see if that can be improved. You might look at the Devel::* modules, such as Devel::DProf, to help with this portion of the process.

You might also wish to consult several of the excellent books available. Some of the books I am aware of (and own) that might be helpful include Programming Perl, Advanced Perl Programming, Mastering Algorithms with Perl, and Perl Cookbook.

[reply]

Re: Optimizing existing Perl code (in practise)
by sauoq (Abbot) on Aug 18, 2002 at 22:50 UTC

Successfully choosing the right algorithm takes careful consideration of the problem. If there is a secret to it at all it's probably choosing the right representation for your data. How to do that is a matter of experience and education. There isn't a cookbook solution to it because it usually depends greatly upon details of the problem you need to solve.

-sauoq
"My two cents aren't worth a dime.";

[reply]

Re: Re: Optimizing existing Perl code (in practise)

by Xanatax (Scribe) on Aug 20, 2002 at 10:56 UTC

The real trick isn't optimizing your code but optimizing your solution.

use

[reply]

Re: Optimizing existing Perl code (in practise)
by derby (Abbot) on Aug 18, 2002 at 22:59 UTC

Effective Perl Programming

-derby

[reply]

Re: Optimizing existing Perl code (in practise)
by semio (Friar) on Aug 19, 2002 at 06:44 UTC

converting hex to char

#!c:/perl/bin/perl -w

use strict;
use POSIX qw(strftime);
my $x;
my $maxint = 200000;
my $start = strftime "%H:%M:%S", localtime;

for ($x=0; $x <$maxint;$x++) {
print unpack "H*", "abc"
}

my $finish = strftime "%H:%M:%S", localtime;

print "$start $finish";
[download]

#!c:/perl/bin/perl -w

use strict;
use POSIX qw(strftime);
my $x;
my $maxint = 200000;
my $start = strftime "%H:%M:%S", localtime;

for ($x=0; $x <$maxint;$x++) {
printf "%x%x%x",ord('a'),ord('b'),ord('c');
}

my $finish = strftime "%H:%M:%S", localtime;

print "$start $finish";
[download]

In this case, unpack is the clear winner, although the performance difference doesn't become apparent until after 100000 iterations. So, in my opinion, being that TIMTOWTDI, I would look for a performance differential between these methods and opt for the one that requires the least amount of execution time.

The second thing I would check to see if any shelling out can be replaced by an available perl function. I recently wrote a program that required that the date/time stamps in a log file be updated. For this, I made the mistake of relying on shelling out

my $time1 = `date '+%H:%M:%S'`;
[download]

my $time1 = strftime "%H:%M:%S", localtime;
[download]

cheers, -semio

[reply]
[d/l]
[select]

Re: Re: Optimizing existing Perl code (in practise)

by grep (Monsignor) on Aug 19, 2002 at 07:11 UTC

You should definately look into Benchmark. I was able to reduce your test down to this, and I get the CPU usage

use strict;
use Benchmark;

timethese(1500000,
          {
              'unpack' => 'unpack "H*", "abc"',
              'sprintf'  => 'sprintf "%x%x%x",ord("a"),ord("b"),ord("c
+")'
              }
          );
[download]

Benchmark: timing 1500000 iterations of sprintf, unpack...
   sprintf:  0 wallclock secs ( 0.17 usr +  0.00 sys =  0.17 CPU) @ 88
+23529.41/s (n=1500000)
            (warning: too few iterations for a reliable count)
    unpack: 10 wallclock secs ( 9.87 usr +  0.01 sys =  9.88 CPU) @ 15
+1821.86/s (n=1500000)
[download]

ACCCK!!!Abigail-II caught me in a latenight brain seizure. I shoulda been tipped off by sprintf winning. :( ++Abigail-II

grep

Mynd you, mønk bites Kan be pretti nasti...

[reply]
[d/l]
[select]

No Benchmark is better than a bad Benchmark

by Abigail-II (Bishop) on Aug 19, 2002 at 11:49 UTC

Another thing that should ring loud bells is that you are doing sprintf() in void context. That's not a natural operation. Perhaps Perl optimizes that away for you - totally screwing up your benchmark. It's a simple test:

    $ perl -MO=Deparse -wce 'sprintf "%x%x%x", ord ("a"), ord ("b"), o
+rd ("c")'
    Useless use of a constant in void context at -e line 1.
    BEGIN { $^W = 1; }
    '???';
    -e syntax OK  
    $
[download]

#!/usr/bin/perl
    
use strict;
use warnings 'all';   
    
use Benchmark;
    
timethese -10 => {
    unpack     =>  '$_ = unpack "H*" => "abc"',
    sprintf    =>  '$_ = sprintf "%x%x%x", ord ("a"), ord ("b"), ord (
+"c")',
}

 
__END__
Benchmark: running sprintf, unpack for at least 10 CPU seconds...
   sprintf: 11 wallclock secs (10.25 usr +  0.00 sys = 10.25 CPU) @ 77
+5053.56/s (n=7944299)
    unpack: 11 wallclock secs (10.48 usr +  0.01 sys = 10.49 CPU) @ 33
+1145.09/s (n=3473712)
[download]

sprintf

    $ perl -MO=Deparse -wce '$_ = sprintf "%x%x%x", ord "a", ord "b", 
+ord "c"'
    BEGIN { $^W = 1; }
    $_ = '616263';
    -e syntax OK
    $
[download]

sprintf

    $ perl -MO=Deparse -wce '($a, $b, $c) = split // => "abc";
               $_ = sprintf "%x%x%x", ord $a, ord $b, ord $c'
    BEGIN { $^W = 1; }
    ($a, $b, $c) = split(//, 'abc', 4);
    $_ = sprintf('%x%x%x', ord $a, ord $b, ord $c);
    -e syntax OK
    $
[download]

#!/usr/bin/perl
 
use strict;
use warnings 'all';
   
use Benchmark;
    
use vars qw /$a $b $c $abc/;

$abc = "abc";
($a, $b, $c) = split // => $abc;

timethese -10 => {
    unpack     =>  '$_ = unpack "H*" => $::abc',
    sprintf    =>  '$_ = sprintf "%x%x%x", ord $::a, ord $::b, ord $::
+c',
}
       

__END__
Benchmark: running sprintf, unpack for at least 10 CPU seconds...
   sprintf: 11 wallclock secs (10.51 usr +  0.01 sys = 10.52 CPU) @ 20
+8379.75/s (n=2192155)
    unpack: 10 wallclock secs (10.10 usr +  0.00 sys = 10.10 CPU) @ 32
+3836.04/s (n=3270744)
[download]

unpack

The moral: no benchmark is better than a bad benchmark.

Abigail

[reply]
[d/l]
[select]

Re: Re: Optimizing existing Perl code (in practise)

by snafu (Chaplain) on Aug 19, 2002 at 15:03 UTC

This one particular piece of advice is very good. A peave of mine is when I see people who write Perl scripts and all the work in them is done by using system() calls. What is the point in writing a Perl script if you're not going to use the Perl functions? You might as well write the thing in shell.

Spawning system calls does take more resources and thus it behooves the Perl programmer to try and code the functionality they want using Perl built-ins and modules.

gj! ++ on this one.

_ _ _ _ _ _ _ _ _ _
- Jim
Insert clever comment here...

[reply]

Re: Optimizing existing Perl code (in practise)

by Abigail-II (Bishop) on Aug 19, 2002 at 15:37 UTC

A peave of mine is when I see people who write Perl scripts and all the work in them is done by using system() calls. What is the point in writing a Perl script if you're not going to use the Perl functions? You might as well write the thing in shell.

Your point of view is quite opposite of the viewpoint of "code reuse". Unix comes with a handy toolkit. There's nothing wrong with using it.

You might as well write the thing in shell.

Spawning system calls does take more resources and thus it behooves the Perl programmer to try and code the functionality they want using Perl built-ins and modules.

Really, what's the point of writing:

    my $text = do {
        open my $fh => $file or die "open: $!\n";
        local $/;
        <$fh>;
    };
[download]

    my $text = `cat $file`;
[download]

    system mkdir => -p => $dir;
[download]

Of course, making use of external programs makes you less portable, but so does making use of modules not coming with the core. And many programs dealing with file names aren't portable anyway. Do you always use File::Spec when dealing with file names? I certainly don't.

I'm not claiming everything should be done with system. Not at all. But I don't thing that everything that can be done in Perl should, and that therefore system should be avoided.

Abigail

[reply]
[d/l]
[select]

Re: Re: Optimizing existing Perl code (in practise)

by snafu (Chaplain) on Aug 19, 2002 at 18:13 UTC

Re: Optimizing existing Perl code (in practise)

by Abigail-II (Bishop) on Aug 20, 2002 at 10:15 UTC

Some notes below your chosen depth have not been shown here

Re: Re: Re: Optimizing existing Perl code (in practise)

by dmmiller2k (Chaplain) on Aug 21, 2002 at 03:11 UTC

Re: Re: Optimizing existing Perl code (in practise)

by sauoq (Abbot) on Aug 19, 2002 at 22:34 UTC

Re^2: Optimizing existing Perl code (in practise)

by Aristotle (Chancellor) on Aug 20, 2002 at 12:25 UTC

Re: Re: Optimizing existing Perl code (in practise)

by smalhotra (Scribe) on Aug 19, 2002 at 16:16 UTC

Re: Optimizing existing Perl code (in practise)

by Abigail-II (Bishop) on Aug 19, 2002 at 16:22 UTC

Some notes below your chosen depth have not been shown here

Re^2: Optimizing existing Perl code (in practise)

by Aristotle (Chancellor) on Aug 19, 2002 at 10:18 UTC

my $time1 = strftime "%H:%M:%S", localtime;

s/localtime/time/

Makeshifts last the longest.

[reply]
[d/l]

Re: Re^2: Optimizing existing Perl code (in practise)

by Hrunting (Pilgrim) on Aug 19, 2002 at 13:35 UTC

You mean s/localtime/time/ of course.

I sure hope he doesn't. From the POSIX perldoc page:

Synopsis:
  strftime(fmt, sec, min, hour, mday, mon, year, wday = -1, yday = -1,
+ isdst = -1)
[download]

[reply]
[d/l]

Re: Optimizing existing Perl code (in practise)
by gmpassos (Priest) on Aug 19, 2002 at 11:47 UTC

To win speed, you can make tests of your code, specially inside loops, peaces that will be runned a lot of times, to find the best way to write it! Here are some tips:

Variables:
Don't use:
$var = $var . "add" ;
The best way is:
$var .= "add" ;
The first way (wrong) will rewrite all the variable in the memory, the second will only add the new data. Use the same idea for: += , -= , *= , /=

For subs use the content of the @_, specially for big data sent to the function. If you want speed use first the @_[0], then if you need to change the data inside @_[0], you use my ($var) = @_ ;, and if you have big data you use the "shift".
Don't use for big data:
sub { my ($var1,$var2) = @_ ; }
The best way is to use the @_[0] it self or the shift:
sub { my $var1 = shift ; my $var2 = shift ; }
* If you use @_[?] you can't modifie it, you need to past to a $scalar.

If you have a loop (while,for,foreach) that will be runned a lot of times, try to not use the my inside it:
Normal way: for(0..10) { my $var = $_ ; }
Faster:
my $var ;
for(0..10) { $var = $_ ; }
* Of course this will only improve speed if you try to make the my outside for all the variables, in other words for bigger codes inside the loop.

Don't use local(), my() is faster! The command local() in the begin of perl was used like my, but now it's only good if you want to make local *HANDLES, not variables.

Try to use the variables in this order: $scalar, @array, %hash. Some thimes we use %h or @a and they aren't needed, but they are more slower than $s and use more memory, specially %h!

About regular expressions (RE), use it only when it's needed! Dont make this: if($var =~ /x/) if you can do if($var eq 'x'). But some times RE can be faster than bigger codes, the best way to chose is test the 2 codes.

But always think that any tip here will improve some microseconds for you. Only spend time improving speed in the peaces of your code that really need! Always try to use the resources of core, don't remake things that can be made by Perl it self.

"The creativity is the expression of the liberty".

[reply]
[d/l]
[select]

Re: Re: Optimizing existing Perl code (in practise)

by RMGir (Prior) on Aug 20, 2002 at 12:18 UTC

For subs use the content of the @_, specially for big data sent to the function:
Don't use:
sub { my ($var1,$var2) = @_ ; }
The best way is to use the @_[0] it self or the shift:
sub { my $var1 = shift ; my $var2 = shift ; }
* If you use @_[?] you can't modifie it! Use shift if you need to write to the var.

I was pretty sure that was wrong when I read it, so I whipped out Benchmark:

#!/usr/bin/perl -w

use strict;
use Benchmark qw(cmpthese);

sub shifter {
    my $a=shift;
    my $b=shift;
    my $c=shift;
    my $d=shift;
    my $e=shift;
    my $f=shift;

    return $a*$b*$c*$d*$e*$f;
}

sub assigner {
    my ($a,$b,$c,$d,$e,$f)=@_;

    return $a*$b*$c*$d*$e*$f;
}

sub direct {
    return $_[0]*$_[1]*$_[2]*$_[3]*$_[4]*$_[5];
}

cmpthese(-5,
        {
             'shifter' => sub {shifter(1,2,3,4,5,6);},
             'assigner' => sub {assigner(1,2,3,4,5,6);},
             'direct'   => sub {direct(1,2,3,4,5,6);},
         }
     );
[download]

$ perl testSubs.pl
Benchmark: running assigner, direct, shifter, each for at least 2 CPU 
+seconds...

  assigner:  0 wallclock secs ( 2.06 usr +  0.02 sys = 2.08 CPU) @ 384
+577.33/s
(n=800690)
    direct:  3 wallclock secs ( 2.04 usr +  0.00 sys = 2.04 CPU) @ 629
+222.22/s
(n=1285501)
   shifter:  2 wallclock secs ( 2.09 usr +  0.00 sys = 2.09 CPU) @ 294
+563.31/s
(n=616521)
             Rate  shifter assigner   direct
shifter  294563/s       --     -23%     -53%
assigner 384577/s      31%       --     -39%
direct   629222/s     114%      64%       --
[download]

[reply]
[d/l]
[select]

Re: Re: Re: Optimizing existing Perl code (in practise)

by gmpassos (Priest) on Aug 22, 2002 at 18:52 UTC

The "shift" options is good to use when you send big data to the function! The process of the command is not fast, because it need to cut the value from the array, reorder the array, and create and save to a scalar variable! "shift" is good to use for big data because you don't leave in the memory the data 2 times! You just move to the scalar! If you want speed use first the @_[0], then if you need to change the data inside @_[0], you use my ($var) = @_ ;, and if you have big data you use the "shift".

"The creativity is the expression of the liberty".

[reply]
[d/l]
[select]

Re: Re: Re: Re: Optimizing existing Perl code (in practise)

by RMGir (Prior) on Aug 22, 2002 at 19:00 UTC

Re: Re: Re: Re: Re: Optimizing existing Perl code (in practise)

by gmpassos (Priest) on Aug 23, 2002 at 03:26 UTC

Some notes below your chosen depth have not been shown here

Re: Optimizing existing Perl code (in practise)
by JaWi (Hermit) on Aug 19, 2002 at 10:05 UTC

Greets to all,

-- JaWi

"A chicken is an egg's way of producing more eggs."

[reply]

Re: Optimizing existing Perl code (in practise)
by feloniousMonk (Pilgrim) on Aug 19, 2002 at 18:02 UTC

my @a = ();
if ( $foo =~ /^(\d+)\s+(\w+)\s*$/ ) {
    @a = ($1, $2);
}
[download]

my @a = split (/\s+/, $foo);
[download]

[reply]
[d/l]
[select]

Re: Re: Optimizing existing Perl code (in practise)

by Anonymous Monk on Aug 19, 2002 at 18:48 UTC

Those two code snippets are not at all similar in function, so benchmarking them is useless.

[reply]

Re: Re: Re: Optimizing existing Perl code (in practise)

by feloniousMonk (Pilgrim) on Aug 20, 2002 at 14:04 UTC

[reply]

Re^4: Optimizing existing Perl code (in practise)

by Aristotle (Chancellor) on Aug 20, 2002 at 15:27 UTC

Re: Re: Re: Re: Optimizing existing Perl code (in practise)

by Anonymous Monk on Aug 20, 2002 at 15:57 UTC

Re: Optimizing existing Perl code (in practise)
by thoglette (Scribe) on Aug 20, 2002 at 12:29 UTC

Write it and optimise only if it needs it
Get your algorithms right first

Case in point - on a recent project with over 1/2Mbyte of script and about 400 'instances' two 'instances' ran far too slowly. Most 'instances' ran in under 10 seconds while these two required 60 minutes, which was unacceptable.
An analysis (See comments on monitoring) showed that we had the following:

while(1)
{
   $thing = new thing;
   $thing->method(getc());
   print $thing->result();
   $thing->DESTROY;
}

new

result

method

So, about 120 lines of code (and about 20 @INC function calls) to do 10 lines of code.

Time for some faster, locally optimised code AND VERY LOUD COMMENTS. Both in the local code and in the class which was being 'broken'. Nett result was a run time about 10 seconds. Which was acceptable for this project.
--
Butlerian Jihad now!

[reply]

Re: Optimizing existing Perl code (in practise)
by pingo (Hermit) on Aug 19, 2002 at 14:06 UTC

For my part, I don't do much optimizing. Instead, I rely on fastcgi to make my perl scripts fast enough (of course, this only applies to cgi).

[reply]


go ahead... be a heretic
	PerlMonks