Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re^4: Copying a file to a temporary file

by shay (Beadle)
on Jun 16, 2004 at 16:01 UTC ( [id://367285]=note: print w/replies, xml ) Need Help??


in reply to Re^3: Copying a file to a temporary file
in thread Copying a file to a temporary file

That doesn't look like a great way to choose a backup filename - the rename will succeed even for candidate backup filenames that exist (permissions permitting), so the backup has potentially just clobbered another file! (Or have I misunderstood you?)

Anyway, as I said, the backup filename is supplied by the caller of this code if a backup file is required. My real concern is what the best way to achieve the in-place edit via a temporary file is, possibly taking advantage of the given backup filename if one is given.

I like the idea of writing the processed data to a temporary file and then moving that back (either (1) by a rename or (2) by copying the contents), rather than my original idea of moving/copying the file to be edited and then writing the processed data back to it, so that the process can be easily re-run if it failed the first time.

However, both options (1) and (2) above have problems:

Option (1) goes something like this (return values obviously need checking, and there are some chmod games that can be played too, but this is the bare bones of it):

use File::Temp qw(tempfile); my $file = 'test.txt'; my($tmpfh, $tmpfile) = tempfile(); open my $fh, '<', $file; binmode $fh; while (<$fh>) { # Process $_ here print $tmpfh $_; } close $fh; close $tmpfh; rename $tmpfile, $file;

I can see two problems with that. Firstly, tempfile() was not called in scalar context so the temporary file will not be cleaned up if the program is interrupted or killed. (A $SIG{INT} handler could arrange for them to be cleaned up if interrupted, but not if the program is killed.) Secondly, while the rename itself is (normally) atomic, there is a race condition between the close and the rename - somebody else could potentially modify the file inbetween.

Option (2) looks like this (with the same caveats as before):

use Fcntl qw(:seek); use File::Temp qw(tempfile); my $file = 'test.txt'; my $tmpfh = tempfile(); open my $fh, '<', $file; binmode $fh; while (<$fh>) { # Process $_ here print $tmpfh $_; } close $fh; seek $tmpfh, 0, SEEK_SET; open my $fh2, '>', $file; binmode $fh2; print $fh2 $_ while <$tmpfh>; close $fh2; close $tmpfh;

This time, the temporary file's contents are written back to the original file without the temporary file having been closed, so there is no close/rename race condition. Also, tempfile() was called in scalar context so the temporary file will be cleaned up even if the program is killed (on Win32, at least, via the O_TEMPORARY flag that is used when opening the file). However, the process of copying the temporary file's contents back to the original file is no longer atomic, so if the program is interrupted during the final while loop then the original file will be left partially written.

So neither option is perfect. Which is approach is the lesser of the two evils? Is there another approach with none of these pitfalls?

Replies are listed 'Best First'.
Re^5: Copying a file to a temporary file
by BrowserUk (Patriarch) on Jun 16, 2004 at 17:53 UTC
    That doesn't look like a great way to choose a backup filename - the rename will succeed even for candidate backup filenames that exist (permissions permitting),...

    Really? I'm pretty certain that I have never used a filesystem that, regardless of permissions, would allow you to rename one file on top of an existing one. Which filesystem are you using?


    Examine what is said, not who speaks.
    "Efficiency is intelligent laziness." -David Dunham
    "Think for yourself!" - Abigail
    "Memory, processor, disk in that order on the hardware side. Algorithm, algoritm, algorithm on the code side." - tachyon
      I don't think the filesystem is relevant. It is true that on my Windows NTFS filesystem that the shell command "rename" will not rename OLDNAME to NEWNAME if NEWNAME already exists, but we're talking about Perl...

      The perlfunc manpage entry for Perl's built-in rename() function says:

      Changes the name of a file; an existing file NEWNAME will be clobbered.
      and it's quite correct (I just tried it to make sure!).

      Any more thoughts on my temporary file issue?

      - Steve

        I really never knew that. How dumb. Both my assumption in not checking what I knew could never be so and the logic that makes me wrong. You'll have to decide for yourself which is dumber:)

        It will be a while before I stop thinking about the logic that allows a rename function to become a "delete target and then copy over" command.

        You could consider this.

        #! perl -slw use strict; use Win32::API::Prototype; ApiLink( 'kernel32', 'UINT GetTempFileName( LPCTSTR lpPathName, LPCTSTR lpPrefixString, UINT uUnique, LPTSTR lpTempFileName )' ) or die $^E; my $tempFileName = ' ' x 254; my $path = '.'; my $prefix = 'temp0000'; GetTempFileName( $path, $prefix, 0, $tempFileName ) or die $^E; print $tempFileName;

        After the above code has been run, the an empty file with the name returned will have been created. You can then open and use it as you need to.


        Examine what is said, not who speaks.
        "Efficiency is intelligent laziness." -David Dunham
        "Think for yourself!" - Abigail
        "Memory, processor, disk in that order on the hardware side. Algorithm, algoritm, algorithm on the code side." - tachyon

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://367285]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others examining the Monastery: (7)
As of 2024-04-23 20:52 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found