Hi,
I
am want to do a 2Pass method alignment to human genome(Primary
assembly fasta). I have multiple samples and I'm confused(cant understand) with the help document/other sites on the steps involved.
This is what I understand
1) Create a Star index of the reference fasta file
2) Align each sample individually to the above index and obtain the SJ.out.tab
STAR --genomeDir Index/ --readFilesIn Sample1/R1.fastq Sample1/R2.fastq --outFileNamePrefix Sample1
:
:
:
STAR --genomeDir Index/ --readFilesIn SampleN/R1.fastq SampleN/R2.fastq -outFileNamePrefix SampleN
3) Creating new index with previous reference fasta and SJ.out.tab from all samples
STAR --runMode genomeGenerate --genomeDir NewIndex/ --genomeFastaFiles Reference.fa --sjdbFileChrStartEnd Sample1/SJ.out.tab ... SampleN/SJ.out.tab
4) Remapping to the New Index
STAR --genomeDir NEW_Index/ --readFilesIn Sample1/R1.fastq Sample1/R2.fastq --outFileNamePrefix Sample1
:
:
:
STAR --genomeDir NEW_Index/ --readFilesIn SampleN/R1.fastq SampleN/R2.fastq -outFileNamePrefix SampleN
Please do let me know if my above steps is correct?