comment on

You can grab all the possible values for intron and exon with your regex and then split them up.

Consider replacing your intron/exon elsif blocks with this:

  #new intron elsif block
    elsif(/\s+\/intron="(.+)"\n/) {
        foreach $item (split('\;',$1)) {
            print OUT "Intron\t $item\n";
        }
    }
[download]

I replaced all the *s with +s, from my understanding this is more efficient, but I'm no regex guru :) The regex puts everything between the "double quotes" in $1

This will print out, based on your input data:

Intron   1-48
Intron   334-385
[download]

Now that they are separated, you can do whatever you want with them.

Ryan

In reply to Re: about regular expression by ryan
in thread about regular expression by agustina_s

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


There's more than one way to do things
	PerlMonks