757773 147 RG:Z:ref.CAb15_8Sep_E1
157421 147 XS:A:-
6176 147 XS:A:+
8569543 163 RG:Z:ref.CAb15_8Sep_E1
714631 163 XS:A:-
1589699 163 XS:A:+
7144052 339 RG:Z:ref.CAb15_8Sep_E1
830819 339 XS:A:-
1641485 339 XS:A:+
1049871 355 RG:Z:ref.CAb15_8Sep_E1
10422 355 XS:A:-
7391 355 XS:A:+
1049871 403 RG:Z:ref.CAb15_8Sep_E1
10422 403 XS:A:-
7391 403 XS:A:+
7144052 419 RG:Z:ref.CAb15_8Sep_E1
830819 419 XS:A:-
1641485 419 XS:A:+
8569543 83 RG:Z:ref.CAb15_8Sep_E1
714631 83 XS:A:-
1589699 83 XS:A:+
757773 99 RG:Z:ref.CAb15_8Sep_E1
157421 99 XS:A:-
6176 99 XS:A:+
My library in tophat terms is fr-first stranded. From these results, for my data, I don't think it is calling strands accurately.
Here is how I called STAR (@PG) from SAM header:
@PG ID:STAR PN:STAR VN:STAR_2.4.2a CL:STAR --runThreadN 8 --genomeDir /data/leelab/projects/common/mtrDmm10 --readFilesIn E1_R1.00.fastq E1_R2.00.fastq --outSAMstrandField intronMotif --outSAMattributes All --outSAMattrRGline ID:ref.CAb15_8Sep_E1 SM:p.all.nb --outFilterMismatchNmax 36 --clip3pNbases 100
I am stating the above because I need strand specific (XS:A:) fields in all my SAM reads. I was thinking of removing the ones that were generated and replacing them based on knowing library type and if the mapping of the reads were rc or not (along with which read). Your thoughts are welcome!
Regards,
Barry