Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re: text extraction question

by throop (Chaplain)
on Dec 05, 2006 at 21:19 UTC ( [id://587983]=note: print w/replies, xml ) Need Help??


in reply to text extraction question

I'm going to assume that everything but the anglebrackets is reliably alphanumeric. I shoved $inputexample into $_ to unclutter the code.
use strict; use warnings; my $templateformat = 'w<NM>b<NM>cm<CH>sw<SW>'; $_ = 'w8b8cm512swno'; my(@values, @names, $val); # In the example, @h will be ('w', 'NM', 'b', 'NM', 'cm', 'CH', 'sw', +'SW') my @h = $templateformat =~ /(\w+)<(\w+)>/g; for (my $ix=0; $ix + 1 < @h; $ix += 2){ push(@names, $h[$ix +1]); # Match up to the next piece of template if($ix + 2 < @h){ ($val, $_) = /$h[$ix](.+)($h[$ix +2].+)/ or die 'bad middle'} # or match to the end if there's no next piece else{ ($val) = /$h[$ix](.+)/ or die 'bad end'}; push(@values, $val)}; local($,, $\ ) = ("\t", "\n"); print(@names); print(@values);
gives
NM NM CH SW 8 8 512 no
You might want to add more robust error-checking. I could have shoved the push into @values inside the if/else. I'd have saved creating the $val variable but I'd have duplicated the push.

throop

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://587983]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others taking refuge in the Monastery: (2)
As of 2024-04-24 23:46 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found