Hi,
From what I see, the output of MLML data is not formatted in a way to be useful for Radmeth given that the data is split across 3 columns: hmC, mC, and C. Would it be possible and appropriate to create pseudo read counts and coverage of hmC for multiple files, combine those data files with methcounts merge, and then utilize radmeth regression?
I expect this would be possible, however, doing so would need some guidance on the levels of pseudo hmC. If it were 20% hmC, would that be best represented as 8 coverage, 2 hydroxymethylated reads, or 80 coverage and 20 hmC reads? This example is simple, but more diverse estimations like 24% or 35% would be poorly served by approximating them with values less than 10.
Thank you for all the help,
Chris