It would be so grateful if someone can give me an insight.
So what I am doing in here is running bcl2fastq in nextflow and pass the result fastq.gz files to fastqc process.
However, I keep getting an error message that missing output files even though I can see that files were generated correctly after bcl2fastq.
-------------------------------------------------------------------------------------------------------------
Error message:
[cf/0f2116] process > bcl2fastq [100%] 1 of 1, failed: 1 ✘
[- ] process > fastqc -
Something when wrong, look up the log
Error executing process > 'bcl2fastq'
Caused by:
Missing output file(s) `*.fastq.gz` expected by process `bcl2fastq`
-------------------------------------------------------------------------------------------------------------
Nextflow script:
$params.seq_dir = some directory (/net/vol9/seqdir/BRCA1/nameofseqdir)
$params.out_dir = some directory (/net/vol1/data/nextflow_cluster_test/nobackup)
$params.sample_sheet = some directory = (/net/vol1/data/samplesheet.csv)
(These three parameters are defined in params.config)
process bcl2fastq {
publishDir "$params.out_dir/bcl2fastq/", mode: 'copy'
echo "bcl2fastq is running"
output:
path "*.fastq.gz", emit: bcl2
"""
#!/usr/bin/env bash
bcl2fastq \
-R $params.seq_dir -o $params.out_dir/bcl2fastq \
--sample-sheet $params.sample_sheet --interop-dir $params.out_dir/bcl2fastq \
--no-lane-splitting --use-bases-mask Y*,I*,I*,Y* \
--minimum-trimmed-read-length 0 --mask-short-adapter-reads 0
"""
}
process fastqc {
publishDir "$params.out_dir/bcl2fastq/fastqc_out", mode: 'copy'
input:
path bcl2
"""
fastqc \
$bcl2 -o $params.out_dir/bcl2fastq/fastqc_out -t 4
"""
}
workflow {
main:
bcl2fastq()
fastqc(bcl2fastq.out)
}
-------------------------------------------------------------------------------------------------------------