comment on

I'm writing a parser for a specified format (so I'm stuck with the format). I have no doubt this will lead to many questions, but here's my first:

Given a string of comma separated elements, where an element can contain a function, and functions can have commas in their arguments, how do I best grab the elements?

After looking over Merlyn's nested C comment parser and The CSV parser from Mastering Regex, I have a working solution. I'm not convinced, however, that this is the easiest/best way to do it. Comments?

#!/usr/bin/perl

$teststr="blah,blah(blah,blah(blah,blah(blah))),blah";

#This is three elements: 
# blah
# blah(blah,blah(blah,blah(blah)))
# blah
# I don't have to worry about escaped parens, the file format forbids 
+it.

foreach (&parse_comma($teststr)){
  print "$_\n";   #This just proves that it works
}

sub parse_comma{ 
  my $commastr=shift;
  my @tags;
  my $count=0;
  my $carrystr="";

  foreach (split(/,/, $commastr)){
    $_=$carrystr.",".$_ if $carrystr;
    $count=s/\(/(/g;
    $count-=s/\)/)/g;
    if($count){
      $carrystr=$_;
    }else{
      $carrystr="";
      push @tags, $_;
    }
  }
  return @tags;
}
[download]

In reply to Balancing Parens by swiftone

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


Think about Loose Coupling
	PerlMonks