Hi,
I am trying to generate an index for the human genome (GRCh38) in which all As have been converted to Gs with an ensembl annotation gtf file.
The command I use is as follows:
STAR --runMode genomeGenerate --runThreadN 48 --genomeDir "STARIndex" --genomeFastaFiles Homo_sapiens.GRCh38.dna.primary_assembly_AtoGconverted.fa --sjdbGTFfile Homo_sapiens.GRCh38.108.gtf --sjdbOverhang 100
the last line in the log file is: Finished inserting junction indices. After this it gets stuck (even after running for more than 12 hours).
The following command works fine on the original un-AtoG-converted genome fasta:
STAR --runMode genomeGenerate --runThreadN 48 --genomeDir "STARIndex" --genomeFastaFiles Homo_sapiens.GRCh38.dna.primary_assembly.fa --sjdbGTFfile Homo_sapiens.GRCh38.108.gtf --sjdbOverhang 100
This completes in about 1 hour.
The following without the gtf file also works fine:
STAR --runMode genomeGenerate --runThreadN 48 --genomeDir "STARIndex" --genomeFastaFiles Homo_sapiens.GRCh38.dna.primary_assembly_AtoGconverted.fa
Any suggestions as to why this might not work?
Thanks,
Harry