Hi Alex,
Thank you for your reply.
I reached out to a few internet resources, I used Bismark.coverage from Bismark extractor into methylkit. I want to use CHH, CHG content also, so, how do I define them in methread function, as bismark.cov report does not have any context information (if I use context = CHH/CHG, in methread() input does not have that information). Therefore, I used another method, by using --CX report and CpG/CHH/CHG context report, and made a methylkit supported format for all three context separately. But when I used this converted file issue come up in memory, to resolve it I did remove the duplicates and it worked fine for me (consulted thread "
https://github.com/al2na/methylKit/issues/63").
But further getting deep into CpG context 28Million Cs were there before duplicate removal, but after removal, I only got 7M, is it correct I am losing 3/4th of the Cs?. Second I was wondering where the duplicate arise when we ran the deduplication step to remove PCR duplicates, in bismark.
I still want to figure out what would be the correct approach to have an analysis done from all three (CpG/CHH, CHG context).
I will appreciate your suggestion
Thank you
_
Muktesh