Hi Nicolas,
I believe this is a problem with HTseq not recognizing the latest SAM specifications, which introduced the the "B" type of the SAM attributes.
Note, that you would need the latest version of samtools, 0.1.18, for dealing with these attributes. The latest version of Picard also validates these files without
Cufflinks has the same problem with these attributes.
If you are planning to use the same files with HTseq or cufflinks,
do *not* use the option --outSAMattributes All .
Note that these extra-attributes are not required by Cufflinks or HTseq. They may be convenient, but, in principle, this information can be extracted from the SAM alignments and genome sequence:
jM:B:c,M1,M2,… Intron motifs for all junctions (i.e. N in CIGAR): 0: non-canonical; 1: GT/AG, 2: CT/AC, 3: GC/AG, 4: CT/GC, 5: AT/AC, 6: GT/AT. If splice junctions database is used, and a junction is annotated, 20 is added to its motif value.
jI:B:I,Start1,End1,Start2,End2,… Start and End of introns for all junctions (1-based)
I will try to contact Cufflinks and HTseq authors to request that they comply with the latest SAM specifications.
Cheers
Alex