Hi Alessandra,
Thank you for raising this question. Today, as you said, Moccasin adjusts the read rate data in .majiq files. The adjusted read rates are used by the Quantifier to calculate PSI. However, Moccasin does not update the splicegraph, which still uses the original (non-adjusted) read counts. Voila therefore uses the original read counts. In the future we plan to update Voila to additionally display read rate. This will reflect adjustments made by Moccasin if Moccasin was used.
I am curious if your results are otherwise unexpected. One thought is that you might include an "intercept" column (value 1 for each sample) in the model matrix. Does a low-dimensional clustering (e.g. two PCA components) with the non-adjusted vs adjusted sample data reveal surprising changes?
Barry
Hi Alessandra,
Today, I have no recommended alternative to Voila. In the future we plan to update Voila to additionally display read rate. This will reflect adjustments made by Moccasin if Moccasin was used. To be clear, Voila already does display adjusted PSI and dPSI, together with the original (non-adjusted) read counts.
If you really wanted to dig into it-- the .majiq files themselves include the read rates and genomic coordinate information for each LSV in each RNA-seq experiment. You could extract this data and transform it to use with another visualization tool such as IGV or Matplotlib. We don't formally support extracting from internals in this way, but it does contain the data you're interested in, so let me know if you want to go this route and I can share more details.
For your use-case, it should not make a difference whether you use the -F flag or not. The -F flag tells Moccasin to use only the named confounding factor columns and "intercept" column (if included by the user, with "intercept" spelled that way) from the model matrix for linear modeling. In other words, the -F flag will cause Moccasin to ignore all other (non-confounder / covariate) columns from the model matrix. But since you aren't including additional covariate columns, it should not make a difference for you.
Barry