Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight
 
PerlMonks  

Re^2: Extracting information from file to Hash

by bart (Canon)
on Jan 18, 2012 at 22:07 UTC ( [id://948632]=note: print w/replies, xml ) Need Help??


in reply to Re: Extracting information from file to Hash
in thread Extracting information from file to Hash

It would be nice if the regular expression could be made less specific, but some features of your data format make that tricky (e.g. the fact that the value following "=>" can be a zero-length string).
However, I think it's pretty much guaranteed that there will be whitespace between the key/value pairs. Yet, you use /\s*/. Also, nowhere do I see specified that there even may be whitespace around the "=>" — you just made that up. As this file looks to be computer generated, I sincerely doubt that this will ever be the case. Finally: the only place do I see whitespace inside a column value, is in the final column of the line: the timestamp.

In short: I think this regex will do:

/^Collection=>(\S*) \s+ ImageCount=>(\S*) \s+ Status=>(\S*) \s+ Missing=>(\S*) \s+ Modified=>(.*\S) /x

And if you do this:

my %r = /^ (Collection)=>(\S*) \s+ (ImageCount)=>(\S*) \s+ (Status)=>(\S*) \s+ (Missing)=>(\S*) \s+ (Modified)=>(.*\S) /x;
you even get a nice hash record out of it, even though it is restricted to one match per line (otherwise, when using /g you'd get list context, with a different behavior as a result.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://948632]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (3)
As of 2024-04-25 14:48 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found