params.index='s3://bucket-name/genome/index/path'
params.library='s3://bucket-name/library/file/path/library.csv'
params.featureref='s3://bucket-name/featureref/file/path/feature-ref.csv'
process cellranger_count {
publishDir "$params.outdir/count" , mode : 'copy' , overwrite : false
input:
path genome from params.index
path libraryfile from params.library
path featurefile from params.featureref
output:
file '*'
file '*' into seurat_ch
"""
echo ${libraryfile}
/docker_main/cellranger-5.0.1/cellranger count \
--id=${
params.id} \
--transcriptome=${genome} \
--libraries=${libraryfile} \
--feature-ref=${featurefile} \
--expect-cells=1000
"""
}
The structure of the library file (library.csv)
fastqs,sample,library_type
s3://fastqs/path1,s1,antibody
s3://fastqs/path2,s2,Gene expression
The problem in the script when I try t o run a AWS-Batch jobs:
The count process is unable to read the s3:// paths as it takes absolute paths starting with '/' in the column fastqs inside the library.csv file.
Cellranger reference link:
Thanks,
Tanashree