Hi all!
I would like to suggest some extensions to the YAML file even though I am not sure if it is already handle by the pigx pipeline.
Is it possible to open a new section in the YAML file
#Patterns detection:
ptrn1: _R1
ptrn2: _R2
ext: .fq
The reason I am suggesting the following is that while I was trying to setup myself a pipeline using snakemake, I end with issues with the files extensions *.fq and *.fastq. I don't remember exactly which tool (some of the one used in pigx) could not detect one of the extensions. I had to rename it manually, it was ok because they were few. This however can be a extremely tedious work and prone to error.
Additionally, I notice that the _R1, _R2 pattern is hard coded (at least in pigx_rnaseq.py). I also had issues by assuming the same. The file patterns were _1, _2.
My solution was to setup these as arguments in the beginning of my snakefile:
#File pattern detection
ptrn1 = "_1"
ptrn2 = "_2"
#File extention detection
ext = ".fq"
Usage:
rule trim_galore_pe:
input:
[path_reads + "{sample}" + ptrn1 + ext + ".gz", path_reads + "{sample}" + ptrn2 + ext + ".gz"]
.....
Hope this is useful for improving the usage of the pipeline.
Paulo