Hi,
I am interested in using bamCompare in order to compare two samples with and without treatment, as it seems like a very useful tool for our purposes. The reason we want to use bamCompare is because it looks like the two samples we are comparing have very
different signal-to-noise ratios. Am I correct in my understanding that using the SES method to determine the scaling factor can correct for this discrepancy?
Would it be correct in my case to use the flags
--scaleFactorsMethod SES --ratio subtract
I.e. will bamCompare scale each sample separately in this case? Or is it better to use bamCompare separately on each sample vs. input first, and then compare log2 ratios of each sampe vs. input to find regions where there is a significant difference between sample 1 and sample 2?
Also, is the flag
--scaleFactorsMethod SES
a complete normalization on its own, or do I need to use the flag combination
--normalizeUsingRPKM --scaleFactorsMethod SES
to account for both differences in signal-to-noise ratio, and overall sequencing depth?
Thanks alot for your help. i look forward to hearing from you and seeing what results I get with bamCompare!
Sincerely,
Isaac Kremsky
Postdoc in Victor Corces' Lab
Hi Devon,
Great thanks for your help. That clears up alot! I have a few more questions though.
So if II understood what you said correctly, in the case where I use "--scaleFactorsMethod SES --ratio subtract --normalizeUsingRPKM", it won't ignore the flag "--normalizeUsingRPKM"? Does that mean it will apply both RPKM and SES normalization? Or are the 2 methods mutually exclusive? If not would it make sense to use both if the two samples have a large difference of total reads as well as differences in IP efficiency?
Thanks!
Isaac
It will SES normalize the samples and then RPKM normalize the resulting difference between the samples. SES is nice when you have decent IPs and want to make the enriched regions pop out a bit more. This will help a bit in accounting for IP efficiency differences, but there's nothing it can do to fully account for such differences.
Devon
-- Devon Ryan, PhD Bioinformatician / Data manager Bioinformatics Core Facility Max Planck Institute for Immunobiology and Epigenetics Email: dpry...@gmail.com
Oh I see, so the RPKM normalization wouldn't affect the direction of change then because its normalizing after taking the difference between samples, right? That would be more for making differences between two sets of samples comparable to each other I guess? Does it just normalize based on the sumof the reads from the 2 samples?
Thanks again for all your help!
Isaac