Hi,
The transcript errors you are seeing are due to the gff3 estructure. Majiq assumes that the gff3 is a tree like structure where
Gene
/ … \
Transcript1 transcript N
/ … \ / … \
Exon 1.1 exon1.m exonN.1 exon N.p
That is specified by the attributes Id and parentID in each line of the gff3.
If an exon row is found with a parentID that has not been found before, that error appears. Is not a big deal, it is just those exon definition are discarded. Take in care that some gff3 like the older versions of ensembl include the type in the name, like transcript:ENST00000431853. Check that the transcript keyword is included in the transcript ID row as well. We found some of these cases happening in ensemble, but they are few and it will not affect the overall run.
Jordi Vaquero
--
You received this message because you are subscribed to the Google Groups "majiq_voila" group.
To unsubscribe from this group and stop receiving emails from it, send an email to majiq_voila...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/majiq_voila/45d75478-c154-45e2-8c77-8f192745bd01n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/majiq_voila/8bf10a0d-afcc-461a-a05c-267abd9f012dn%40googlegroups.com.