Hi Alex,
I have received an updated GFF files for my organisms, but no changes in ref genome. So, I am having some debate if I should rerun my analyses with new GFF (I had used GTF in previous run). I had some questions and have answered it from my experience on bioinformatics. But, could you verify and give some suggestions:
1) Is there any differences in outcome when using GTF vs. GFF.
I think there will be no difference in alignement, since STAR aligns first to the reference genome and if transcriptomeBAM is need it will then search for corresponding alignement (from genome) with gene/exon boundries in gtf/gff files. But, would there be any advantage in using GTF vs. GFF files.
2) Also, how much alignment differences in the genome could we expect if we use two different version of GTF/GFF with the same reference genome and same RNAseq data.
- Because STAR first aligns the data with ref genome first, I don't expect to see (or very less difference) when using same ref genome, RNAseq data but different gtf.
- But, I expect to see changes in gtf/gff boundries if 'SJ.out.tab' files are supplied during 2nd-pass mapping. But, still the final SJ.out.tab boundries should be the same, because its same RNAseq data.