I'm testing DeepTools since a few days and I was wondering if you have advices to give on the normalisation step.
I mean for example when you have done ChipSeq for one histone mark.
You have 2 replicates and one input.
All are in bam format.( from bwa alignment for example)
And finally you want only one wiggle "normalised" to display on viewer or to make some plots to profile the signal around several coordinates.
You want at the end one wiggle file normalised in RPKM or RPM.
This is what I am doing ,(there is a step where i'm using wiggleTools from Ensembl to mean replicates wig for treatment and remove input signal) and I am wondering something in computeMatrix.
1-/ For compute matrix, what are the defaults for binSize & averageTypeBins.
And If you have compute bamCoverage with binSize of 10, it would be appropriate to use the same binSize, no. And is the signal not already set to its mean for this binsize by bamCoverage ?
So then when you use plotProfile , it will compute the stat set in averageTypeBins for the binsize set here. But if averageTypeBins or binsize is not set, what will it do by default ?
If set, reads that have the same orientation and start position will be considered only once. If reads are paired, the mate’s position also has to coincide to ignore a read.
Thanks
echo "Create bigwig rep1 "
bamCoverage --bam ${rep1BamPath} --normalizeUsingRPKM --outFileName ${OUT}${name}.rep1.new.bw --ignoreDuplicates --outFileFormat bigwig --binSize 10 --extendRead ${fragSize} --minMappingQuality 20
echo "Create bigwig rep2 "
bamCoverage --bam ${rep2BamPath} --normalizeUsingRPKM --outFileName ${OUT}${name}.rep2.new.bw --ignoreDuplicates --outFileFormat bigwig --binSize 10 --extendRead ${fragSize} --minMappingQuality 20
echo "Create bigwig control "
bamCoverage --bam ${controlBamPath} --normalizeUsingRPKM --outFileName ${OUT}${name}.control.new.bw --ignoreDuplicates --outFileFormat bigwig --binSize 10 --extendRead ${fragSize} --minMappingQuality 20
echo "wiggletools mean Replicates and substract new"
wiggletools write ${OUT}${name}.mean.normalised.wig diff mean ${OUT}${name}.rep1.new.bw ${OUT}${name}.rep2.new.bw : ${OUT}${name}.control.new.bw
echo "WigGoBigWig"
wigtoBigWig -clip ${OUT}${name}.mean.normalised.wig ${chromLengthIndex} ${OUT}${name}.mean.normalised.bw
echo "computeMatrix"
computeMatrix reference-point --regionsFileName ${joinedPathToBed} --scoreFileName ${OUT}${name}.mean.normalised.bw --outFileName ${OUT}${name}.mean.normalised.gz -a ${winInExon} -b ${winInIntron}
#-binSize ( Already binsize was set to 10 in bamCoverage , so no need to set it again)
#-averageTypeBins (what is used by default ?)
echo "plotProfile"
plotProfile -m ${OUT}${name}.mean.normalised.gz -out ${OUT}${name}.mean.normalised.png --plotTitle ${name}