Beefy Boxes and Bandwidth Generously Provided by pair Networks
Don't ask to ask, just ask
 
PerlMonks  

Re^2: Behavior of File::Find's preprocess and glob

by hsinclai (Deacon)
on Feb 04, 2005 at 02:13 UTC ( #427902=note: print w/replies, xml ) Need Help??


in reply to Re: Behavior of File::Find's preprocess and glob
in thread Behavior of File::Find's preprocess and glob

If you do filter our directory names in the "wanted" function then you'll prevent File::Find from recursing into those directories.
Did you mean filter out the directory names in "preprocess" (as opposed to the "wanted" function) - That is why the recursion stopped! I see..

.. it still doesn't answer the question as to why the "preprocess" glob of *.pl *.txt in my earlier snippet returns the directory name.. but thanks, I do see now how I was blocking File::Find from recursing by removing directories from the what "preprocess"'s return list ..

Although your code rework works perfectly, the same result can be gotten with just doing directory and file-extension pattern matching in "wanted".. without bothering with a "preprocess" call ... so for example what if there were several hundred subdirectories which I knew did not contain *.pl or *.txt - I was trying to find a way that "wanted" could skip the needless processing/recursion..
  • Comment on Re^2: Behavior of File::Find's preprocess and glob

Replies are listed 'Best First'.
Re^3: Behavior of File::Find's preprocess and glob
by edoc (Chaplain) on Feb 04, 2005 at 02:30 UTC

    .. it still doesn't answer the question as to why the "preprocess" glob of *.pl *.txt in my earlier snippet returns the directory name

    Actually it doesn't. Your "wanted" function prints it when it is called with the directory name before "preprocess" is called with the directory contents. The first thing I did with your code was to add 'print "GLOBBING\n";' to the preprocess function. The directory name is printed before GLOBBING.

    Sounds like you want to change the $filespec regex to only exclude directories you don't wish to traverse. Then filter out the rest of the directories in the "wanted" function. You could create a list of directories as keys to a hash and have "preprocess" skip any directories in the list.

    Update: Added code

    #!/usr/bin/perl -w use strict; use Cwd; use File::Find; my $filespec = qr/\.(?:txt|pl)$/; my %dirskip = ( 'path/to/dir' => 1, 'path/to/another/dir' => 1 ); my $dir = $ARGV[0] || getcwd(); find( { wanted => \&find_function, preprocess => \&globber }, $dir ); sub find_function { return if $File::Find::name !~ /$filespec/ || (-d $File::Find::name) +; print $File::Find::name.$/; } sub globber{ my @files; foreach(@_){ push(@files, $_) unless $dirskip{$File::Find::name}; } return @files; }

    cheers,

    J

      Thank you- that is cool! At one point I had a print statement going in the sub - but totally missed it somehow.. thanks for the clarification!!

      I'm throwing "preprocess" out the window now..

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://427902]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (6)
As of 2022-05-16 08:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Do you prefer to work remotely?



    Results (62 votes). Check out past polls.

    Notices?