I have 2 files that need to have certain data merged and other data discarded. Each line of the file is 210 characters long and has no separators. Here is the jist of what I am attempting to do:
1. From file_1 select specific character positions and match this in file_2.
2. From the matched characters in file_2 select the whole line and change certain character positions to some predefined values.
3. Output this to file_3.
4. If you encounter a line in file_1 that starts with the number 2, this is a special line that must be copied to file_3 unchanged and in the order it was encountered in file_1.
In my code I do not want to open file_2 in memory as these files will have a few hundred thousand lines. I have tried to use Tie::File and read it one line at a time but I am encountering the following error:
Can't modify negation (-) in predecrement (--) at ./SS7Merge line 62, near "$output_line ="
Execution of ./SS7Merge aborted due to compilation errors.
Here is my code:
use File::Copy;
use Tie::File;
my($input_file1) = $ARGV[0];
my($input2) = $ARGV[1];
my($output_file) = $ARGV[2];
if ( !defined($input_file1) || !defined($input2) || !defined($output_f
+ile) ) {
print "Error: usage: ./SS7Merge input_file1 input_file2 output_fil
+e\n";
}
else {
# -----Backup the input files in case of error-----
copy( $input_file1, $input_file1 . ".bak" ) or
die "Could not backup file 1 $input_file1 to $input_file1.bak:
+ $!\n";
copy( $input2, $input2 . ".bak" ) or
die "Could not backup file 2 $input_file2 to $input_file2.bak:
+ $!\n";
# -----Attempt to open all of the files-----
open( INFILE1, $input_file1 ) || die( "Could not read input file 1
+ ($input_file1): $!" );
open( OUTPUT, "> " . $output_file ) || die( "Could not open output
+ file ($output_file): $!" );
# We are going to read file2 into an array. The file will not be loade
+d into memory which will improve processing of large files.
tie @input2, 'Tie::File', \*FH, or die "Problem tying file $input2: $!
+";
while (<INFILE1>) {
my $line = $_;
chomp($line);
# -----A line starting with a '2' is a header and is left unch
+anged
if ( $line !~ m/^2/ ) {
foreach $line2 (@input2) {
$date = substr( $line, 6, 6 );
$number_dialed = substr( $line, 29, 10 );
if ( index( $line2, $date ) != -1 and index( $line2, $
+number_dialed ) != -1 ) {
$record_type = substr( $line, 5, 2 );
# -----From File2-----
$carrier_info = substr( $line2, 44, 5 );
$destination_number = substr( $line2, 122, 10 );
$connect_time = substr( $line, 54, 6 );
$send_to_OCN = substr( $line, 186, 4 );
$record_type = "25";
$send_to_OCN = "2604";
-----Generate the output string-----
$output_line = substr( $line, 0, 4 ) . $record_typ
+e . $date . substr( $line, 12, 17 )
. $number_dialed . substr( $line, 39, 5 )
. $carrier_info . substr( $line, 49, 5 )
. $connect_time . substr( $line, 60, 62 )
. $destination_number . substr( $line, 132, 54
+ )
. $send_to_OCN . substr( $line, 190, 20 ) . "\
+n";
# -----Debug code. Add in if you are experiencing p
+roblems-----
# print OUTPUT $output_line;
# print STDOUT "Output " . ++$outputcount . "\n";
last;
}
}
}
else {
print OUTPUT $line . "\n";
}
}
# Untie the array before closing the file use
untie @input2;
# -----Close all of the files-----
close( INFILE1 );
close( OUTPUT );
}