Hi Charlene,
It is possible to adapt Moccasin to other situations for confounding-factor adjustment with ratios, as for your Poly-A case. My disclaimer is that, while the code and math should work, I haven’t evaluated how it will perform for Poly-A ratio adjustments.
As to your specific questions:
For MAJIQ/LSVs, coverage_data is a matrix per input sample. Each row is a splice junction and each column is a bootstrap replicate index (default 30 for MAJIQ). The row i, column j entry is the read rate calculated for the i’th junction and j’th bootstrap. It is not normalized; Moccasin does normalization as part of its modeling, and then returns adjusted read rates. (That said, the calculation of read rates itself goes through some quality control steps upstream.) The MAJIQ Quantifier considers variance in the bootstrap read rates (i.e., across columns) to model the posterior PSI distribution. It’s possible your scenario would not use bootstraps, in which case you would have just one column. You could then convert the group values to ratios (normalizing within groups) to get the final Poly-A ratios per gene. Alternatively, if you had bootstrap replicates, you could consider variance across the bootstrap values per Poly-A site, analogous to what the MAJIQ Quantifier does for PSI calculations.
For MAJIQ/LSVs, grouping_data is an array of LSV (group) start indexes. For example if this starts [0, 3, 5…] it means coverage rows 0, 1, 2 are the junctions of the first LSV; 3, 4 are the junctions of the second LSV; etc… The order is assumed to be shared across the coverage_data matrices for all input samples. So, yes, I think this is what you wrote for grouping_data, with genes instead of LSVs.
Best Regards,
Barry
--
You received this message because you are subscribed to the Google Groups "Biociphers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to majiq_voila...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/majiq_voila/796e375b-dc76-453c-82d5-8e39bae57594n%40googlegroups.com.