Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight
 
PerlMonks  

comment on

( #3333=superdoc: print w/replies, xml ) Need Help??
I am wanting to process the following tagged text file to use with Adobe Indesign CS. The data is about a given horse race.
1 p FRODO HEST q 1r Fronts C J Campbell,Mrs M J Pault B G 3yrs Earl-Amy Hest Driver:...........u Trainer: Phil Williamson, Oamaruvx Black, Blue & White Chequered Sashw oFirst Start at a Race Meeting. Qualified: 25/04/2006y y 2 p SMOKEY MICKPOT q 2r Front s (1 Starts: 0(0) - 0 - 0 - Lt$0($0) - W$0($0))t W R Low,E T Murphyt BR G 3yrs Wrestle-Tough Whiz Driver:...........u Trainer: Wayne Low, Waimatev (Last Driver:Wayne Low)x Green, Brown & White Striped Braces, White Sleevesw 0 o15Jun06 Forbury Pk 2200 Std Ft 11 of 14 Wnr:Daddy Warbucks +y  y
I want to read this data into variables so I can create a new tagged text file to be imported into InDesign. InDesign has its own tagged format.

Desired result
$horse = "FRODO HEST";
$postion = "1";
$detail = "FRONT";
$history = "1 Starts: 0(0) - 0 - 0 - Lt$0($0) - W$0($0))";
$
etc Heres a sample of my previous program I used to process each line
if ($line =~ /v/) { # B M 5yrs Straphanger-Amy Hest D +river:...........u chomp $line; my ($trainer, $last_driver) = split //, $line; # Remove whitespaces from start $trainer =~ s/^\s+//; # Remove the first chars of variable $last_driver = substr($last_driver, 1); # Remove whitespaces from start $last_driver =~ s/^\s+//; print OUTPUT "<ParaStyle:><pHyphenationLadderLimit:0><pHyp +henationZone:22.700000><pTabRuler:28.350000\,Left\,.\,0\,\;251.050000 +\,Right\,.\,0\,\;><pMaxWordSpace:1.500000><pMinWordSpace:0.750000><pM +axLetterspace:0.250000><pMinLetterspace:-0.050000><pKeepFirstNLines:1 +><pKeepLastNLines:1><pRuleAboveColor:Black><pRuleAboveTint:100.000000 +><pRuleBelowColor:Black><pRuleBelowTint:100.000000><cSize:5.500000><c +BaselineShift:12.000000><cLeading:5.500000><cFont:Switzerland> $tr +ainer $last_driver <cSize:><cBaselineShift:><cLeading:><cFont:><pHyphenationLadderLimit:> +<pHyphenationZone:><pTabRuler:><pMaxWordSpace:><pMinWordSpace:><pMaxL +etterspace:><pMinLetterspace:><pKeepFirstNLines:><pKeepLastNLines:><p +RuleAboveColor:><pRuleAboveTint:><pRuleBelowColor:><pRuleBelowTint:>" +; }
The "OUTPUT" is to the other text file I mentioned earlier. One of the problems I am facing is that the source tag file can often change (E.g. an extra field added). I am wanting some advise on a better approach to handle the processing. Can any one help me with a clear way to subtract this data into appropriate variables or arrays using pattern matching. Will I always require some hard coding for the type of tags used. I am wanting to be able to have these variables at my finger tips, so I can prompt the user to choose an approprate layout.
I think i've blabbed on enough by now
Thanks

In reply to Processing tagged text by diggernz

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others browsing the Monastery: (4)
As of 2022-01-17 23:02 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    In 2022, my preferred method to securely store passwords is:












    Results (52 votes). Check out past polls.

    Notices?