Pathologically Eclectic Rubbish Lister | |
PerlMonks |
comment on |
( [id://3333]=superdoc: print w/replies, xml ) | Need Help?? |
Yes, but there is a fundamental difference...
The first script will only do MD5 hashes on files if there is more than one file with the same file size, then compares the MD5s for the files of that size. Yours MD5's *everything*, then compares *all* the MD5s. If a file has a unique filesize, it *can't* have a duplicate. Depending on the make up of the files, this can have a dramatic effect:
Results: The second script is four times slower than the first... Admittedly, if all your files were the same size there would be no difference, but in most cases, the first script will win. But hey... In reply to Re: Re: Find duplicate files.
by Anonymous Monk
|
|