Hi,
Thank you for the great response on this forum! I have three questions regarding multiple fastq files which describes different scenarios:
1) If different pairs of fastq files are from the same library, then I should place all these fastq pairs into the same fastq folder to run juicer.sh at the same time and it should combine all these fastq files from the same library into one result?
2) If one library only has one pair of fastq files, the folder should only contain this pair (if Q1 is correct). Although one biological replicate can be used to generate multiple libraries, we should not mix fastq files from multiple libraries of the same biological replicate. After running juicer.sh on each library, we can then concatenate the merged_nodups.txt from different libraries generated from the same biological replicate to merge all the libraries, is this correct?
3) If a cell line has multiple biological replicates, we can use mega.sh to merge all its biological replicates to form the final per cell line hic data, is this true?
So, if we have one cell line with biological replicate 1 and 2, each of these replicates has 3 libraries, my workflow would be: 1) for each library (6 in total), run juicer.sh (will either only have 1 pair of fastq files, or multiple pairs), 2) concat the output of 3 libraries for biological replicate 1 and 2 respectively, getting 2 merged files in total, 3) run mega.sh on these 2 merged files so that we have one file per cell line.
Best wishes,
Savio Chow