Hi Alex,
I have the same problem. My data is human-herat failure PE sequences with four separate .fastq files for the first read and four .fastq files for th second read for each patient:
P1.1.R1.fastq.gz
P1.2.R1.fastq.gz
P1.3.R1.fastq.gz
P1.4.R1.fastq.gz
P1.1.R2.fastq.gz
P1.2.R2.fastq.gz
P1.3.R2.fastq.gz
P1.4.R2.fastq.gz
I’ve run the following script for all patients’ fastq files and it’s just working fine except for one patient:
STAR\
--runThreadN 32\
--genomeDir /HF2-STAR/genome\
--sjdbGTFfile /Homo_sapiens.GRCh38.79.gtf\
--readFilesIn P1.1.R1.fastq.gz, P1.2.R1.fastq.gz, P1.3.R1.fastq.gz, P1.4.R1.fastq.gz P1.1.R2.fastq.gz, P1.2.R2.fastq.gz, P1.3.R2.fastq.gz, P1.4.R2.fastq.gz\
--readFilesCommand zcat\
--outFilterMultimapNmax 20\
--outSAMtype BAM SortedByCoordinate\
--quantMode TranscriptomeSAM GeneCounts\
--outFilterMatchNminOverLread 0\
--outFilterScoreMinOverLread 0\
--outFileNamePrefix "p1"\
For one patient I got the following error:
EXITING because of FATAL ERROR in reads input: quality string length is not equal to sequence length
@NB501373:8:HTTKYBGXX:4:22403:20084:1317
GCGATCTTTTTCAATACAATTTACACCCTCATCCCCATTTCCAGTCTGATTATACAAGTGCTAAGTGGCAGAA
@NB501373:8:HTTKYBGXX:4:13504:19963:7094 47732852 N 3
SOLUTION: fix your fastq file
I tried to remove these lines from fastq files but then I got another error message:
EXITING because of FATAL ERROR: Read1 and Read2 are not consistent, reached the end of the one before the other one
SOLUTION: Check you your input files: they may be corrupted
I checked the number of reads in two files with $ wc -l r1.fq r2.fq and was the same (both were 56759618—not divisible by 4 though).
Right now I am stuck with this patient. Any help is much appreciated.
Thank you,
Nasima