Hi Behrooz,
I have another question: did you align the reads against the reference genome (or DNA contigs)?
Hi,morex_contig_2669042 protein_coding exon 2 251 . - . gene_id "MLOC_42952"; transcript_id "MLOC_42952.1"; exon_number "1"; seqedit "false";
I check the file Hordeum_vulgare.sam, the alignment is still against the cDNA sequences.
If you check the gtf file, you will see the following lines:
morex_contig_2669042 protein_coding CDS 16 249 . - 0 gene_id "MLOC_42952"; transcript_id "MLOC_42952.1"; exon_number "1"; protein_id "MLOC_42952.1";
morex_contig_2668822 protein_coding exon 118 275 . + . gene_id "MLOC_42947"; transcript_id "MLOC_42947.1"; exon_number "1"; seqedit "false";
morex_contig_2668822 protein_coding CDS 118 273 . + 0 gene_id "MLOC_42947"; transcript_id "MLOC_42947.1"; exon_number "1"; protein_id "MLOC_42947.1";
morex_contig_2668583 protein_coding exon 3 231 . + . gene_id "MLOC_42936"; transcript_id "MLOC_42936.1"; exon_number "1"; seqedit "false";
In which morex_contig_2669042 is a DNA sequence, MLOC_42952.1 is a cDNA sequence. However, the SAM header looks as follows:
@HD VN:1.0 SO:unsorted
@SQ SN:MLOC_7.2 LN:255
@SQ SN:MLOC_16.1 LN:474
@SQ SN:MLOC_30.1 LN:444
@SQ SN:MLOC_31.1 LN:198
@SQ SN:MLOC_44.1 LN:402
which means that you aligned against the cDNAs, not the DNA sequences.