Hello,
I have Illumina Miseq paired end sequencing reads of 51 samples (102 fastq files; 1 forwards+1 reverse for each sample).
Initially I was using fastq-join to join each sample's forward and reverse strands but this is taking a lot of time. So instead I want to use multiple_join_paired_ends.py and this was my command
multiple_join_paired_ends.py -i ../0_data/ -o ../1_fastq_join/ -p fastqjoin_params.txt
where all the forward and reverse sequence (2* .fastq files per sample) of every sample is present in the 0_data folder
and the parameters file contains this information
join_paired_ends:min_overlap 50
join_paired_ends:perc_max_diff 8
the command runs successfully however I find that the output is a folder for every sample that contains fastqjoin.join.fastq, fastqjoin.un1.fastq and fastqjoin.un2.fastq
files.
I want to edit the script of multiple_join_paired_ends.py in such a way that the output are three files for each sample but the output filename will have 'filename'.join.fastq instead of fastqjoin.join.fastq. And if this happens, I dont need the outputs to be in separate folders in the output folder, they can be just output fastq files with individual filename.
since I am not good with programming or scripting, can anyone help me how to edit the script to achieve this above objective ?
Thank you in advance.