Good Day bros. I was hunting around for some resources to parse Apache log files, and I ran across
this script which contains the following statement
my ($host,$date,$url_with_method,$status,$size,$referrer,$agent) = $
+line =~
m/^(\S+) - - \[(\S+ \+\d{4})\] "(\S+ \S+ [^"]+)" (\d{3}) (\d
++|-) "(.*?)" "([^"]+)"$/;
in an attempt to split apart a $line from the log file like
76.172.202.159 - - [31/Aug/2007:15:58:15 -0600] "GET / HTTP/1.1" 200 2
+9692 "http://www.paperbackswap.com/forum/view_topic.php?t=70235&ls="
+"Mozilla/5.0 (Macintosh; U; Intel Mac OS X; en-US; rv:1.8.0.12) Gecko
+/20070508 Firefox/1.5.0.12"
This didn't look right to me since the
$line =~ m/.../ part would evaluate to 1 rather than a list. I tried it in the debugger sure enough it didn't work.
This got me to thinking about whether there is a way to make it work. I looks like intent of it is for the parens in the regexp to capture the material of interest, but this would wind up in the special vars $1, $2, etc., rather than a list. Does anyone know how this could be made to work?