no exon lines in the GTF file

1,122 views
Skip to first unread message

Janet

unread,
Jun 5, 2017, 12:33:37 PM6/5/17
to rna-star
Hi,

I am trying to do STAR indexing of my reference genome (only including 72 contigs from NCBI). I downloaded a gbff file from NCBI as well. I converted gbff to GTF with a python script. But there are no exon lines in the GTF file, only gene and CDS lines. So I got the error message "no exon lines in the GTF file".

My reference is only assembled into contigs, and annotated into proteins. I do no have any other information. So my questions are:
1) can STAR consider CDS lines? If not, can I simply replace CDS to exon in my GTF file?
1) If still no, can I use STAR at all? I would like to generate a gene table where the numbers of reads mapped to reference genes are presented.

Thank you!

Best,
Janet

Janet

unread,
Jun 5, 2017, 12:36:55 PM6/5/17
to rna-star
I attached my reference genome and gtf files here for your information. Thanks!

-Janet
Escherichia_coli_ms107-1.ADWV01.1.contigs_new.fasta
Escherichia_coli_ms107-1.ADWV01.1.gtf

Alexander Dobin

unread,
Jun 5, 2017, 5:09:34 PM6/5/17
to rna-star
Hi Janet,

you can rename the "CDS" into "exon", or you can use --sjdbGTFfeatureExon CDS option to tell STAR that your exons are called "CDS".
When converting GFF to GTF, please make sure that each of the CDS lines has the "transcript_id" and "gene_id" tags.

Cheers
Alex

Janet

unread,
Jun 6, 2017, 10:38:31 AM6/6/17
to rna-star
Hi Alex,

It works very well! Thank you!

Best,
Janet
Reply all
Reply to author
Forward
0 new messages