Hi Hannah!
I am reaching out to improve my understanding of how Cicero works.
For the graphical LASSO, the observations are formed by groups of similar cells (metacells). There may be some overlap between these metacells based on the overlap filtering parameter (90% from the manuscript methods). Therefore, there are likely some metacells that still share cells (in other words, one scATAC-seq cell could be present in multiple metacells), meaning that the observations are not technically independent of one another.
To my understanding, one of the assumptions of regression-based methods is independence of observations, meaning that observations can only be counted once.
Since we have overlapping metacells, would this technically violate one of the assumptions of the graphical LASSO? Or does the 90% overlap filtering step address/dampen this concern?
Please let me know if I misunderstand something and/or feel free to share your "rebuttal statement" for a hypothetical reviewer.
I ask so that I can be more prepared for the scientific review process and for my thesis committee meetings. Thank you!