
--
You received this message because you are subscribed to the Google Groups "pasapipeline-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pasapipeline-us...@googlegroups.com.
To post to this group, send email to pasapipel...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pasapipeline-users/f87a7cd4-7819-4244-b9be-86eee4ee3ebe%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hi Nick,From what I could tell, the TAIR10_GFF3_genes.gff should have loaded ok with the genome 'TAIR10_chr_all.fas'.In the image you provided, the top 2 entries have names 'GO104.2' and '.GO104.1'. Where did these identifiers come from?Perhaps try reloading the TAIR10_GFF3_genes.gff file and rerunning the annotation comparison step.~brian--On Tue, Apr 21, 2015 at 5:48 PM, Nick Schurch <nicks...@gmail.com> wrote:Sure, this is they:--On 21 April 2015 at 20:30, Brian Haas <bh...@broadinstitute.org> wrote:Hi Nick,The pasa gene validation script needs to be updated. Also, I should make sure that PASA is compatible with both of your gff3 files. Can you send me URLs to the gff3 files you're using? I'll aim to resolve this.best,~brianCheers,
Nick Schurch
just poking this back up to the top. It'd be great to get peoples thoughts on these questions...
On Thursday, 23 April 2015 09:52:17 UTC+1, Nick Schurch wrote:Thanks for updating the gff utilities Brian, I'll give them a go. Do I need to rerun pasa completely, or just the annotation step?
As I said, originally I used the ensembl genome and gff and these don't have the 'Chr' on the start of the chromosome ids - that's why I adjusted the gff file from NCBI to reflect this. I'll give it a try again with the ensembl genome and gff and we'll see if that works.On the ncRNA issue, surely the first stage of PASA - the merging of compatable transcript models - will work just fine for any and all the transcript models from the transcriptome assembly? I can see how that wouldn't be the case for the annotation update stage though. Any idea on when/if PASA will be updated to increase support for ncRNAs?
The other thing I'd like to get an opinion on is how you thing ribo-minus data, as opposed to polyA-pulldown data will effect its performance. I have the former because we wanted to see ncRNAs - but it also means we see a considerable minority of gene expression as pre-mRNAs with incomplete splicing. As such, we get quite a few transcript models that cover annotated introns. Do you anticipate that these will confuse PASA at either stage? How wary should I be of the merging process on these?