Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical
 
PerlMonks  

Re^4: intron length

by bioinformatics (Friar)
on Feb 28, 2013 at 04:46 UTC ( [id://1020989]=note: print w/replies, xml ) Need Help??


in reply to Re^3: intron length
in thread intron length

Ok, I missed parts at the bottom of the first post. Does your fasta file have transcript sequences? I'm assuming then you have transcript sequences that contain intronic regions that aren't spliced out? Or are these simply splicing variants and alternate exons? Either way, you can use regular expressions to grab the exon lengths in the cufflinks file, and you can compare this to the sequence length from the fasta file or other transcipt information files from the UCSC genome browser (or the GTF file that you are likely using from Illumina via the cufflinks webpage to annotate transcripts), etc. Again, you can store the transcript information in a hash and use the gene symbol or accession number as the key so that you are comparing the correct things.

Sorry for missing that, I need more coffee it seems :).

Bioinformatics

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1020989]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others romping around the Monastery: (4)
As of 2024-03-28 22:28 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found