Hi Alex,
Thanks for this nice feature, it saves a very time consuming call to Picard. This introduces an issue for me though where FastQC will not parse the header correctly. It might be an issue with FastQC but I thought I would post it here as well:
Here is a header that is broken:
@PG ID:STAR PN:STAR VN:STAR_2.3.1z_r395 CL:/n/hsphS10/hsphfs1/chb/local/bin/STAR --runThreadN 1 --genomeDir /n/home05/kirchner/cache/bcbio-nextgen/tests/data/automated/tool-data/../../genomes/mm9/star --readFilesIn /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/trim/1_110907_ERP000591_1_fastq_trimmed.txt /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/trim/1_110907_ERP000591_2_fastq_trimmed.txt --outFileNamePrefix /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/align/Test1/1_110907_ERP000591 --outReadsUnmapped Fastx --outSAMstrandField intronMotif --outSAMunmapped Within --outSAMattrRGline ID:1 PL:illumina PU:1_110907_ERP000591 SM:Test1 --outFilterMultimapNmax 10 cl:/n/hsphS10/hsphfs1/chb/local/bin/STAR --genomeDir /n/home05/kirchner/cache/bcbio-nextgen/tests/data/automated/tool-data/../../genomes/mm9/star --readFilesIn /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/trim/1_110907_ERP000591_1_fastq_trimmed.txt /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/trim/1_110907_ERP000591_2_fastq_trimmed.txt --runThreadN 1 --outFileNamePrefix /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/align/Test1/1_110907_ERP000591 --outReadsUnmapped Fastx --outFilterMultimapNmax 10 --outSAMunmapped Within --outSAMattrRGline ID:1 PL:illumina PU:1_110907_ERP000591 SM:Test1 --outSAMstrandField intronMotif
The command line looks repeated twice. If I remove the second repetition (by deleting everything after the cl:):
@PG ID:STAR PN:STAR VN:STAR_2.3.1z_r395 CL:/n/hsphS10/hsphfs1/chb/local/bin/STAR --runThreadN 1 --genomeDir /n/home05/kirchner/cache/bcbio-nextgen/tests/data/automated/tool-data/../../genomes/mm9/star --readFilesIn /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/trim/1_110907_ERP000591_1_fastq_trimmed.txt /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/trim/1_110907_ERP000591_2_fastq_trimmed.txt --outFileNamePrefix /n/home05/kirchner/cache/bcbio-nextgen/tests/test_automated_output/align/Test1/1_110907_ERP000591 --outReadsUnmapped Fastx --outSAMstrandField intronMotif --outSAMunmapped Within --outSAMattrRGline ID:1 PL:illumina PU:1_110907_ERP000591 SM:Test1 --outFilterMultimapNmax 10
It works fine. I'm not sure what it is about the new option, if I run it without the --outSamAttr option it works fine as well. Is there any way that second cl: entry can get dropped in the final header?
Thanks, and thanks for keeping up with maintaining STAR.