Hi,
Thanks to the team for the guidance with Moccasin code usage. I have a small technical doubt which I would like to clarify please. I have .majiq files of samples from three batches to be corrected:
Controls (3) batch1
Disease (2) batch2
Disease(1) batch3
Since the sample size is highest in batch1, it would be ideal to have the batch2 and 3 adjusted based on batch1. Right? (hoping this does not correct for the biological differences?)
So is it okay that, during the model matrix generation step, model A could be removed to make it full rank, and the effects of Batch 1 are captured by the intercept term in the matrix? Then, technically for the confounding factor selection, I could just "--confounding_factors batch2 0 batch3 0" ? in the end, I would get the two batches 'batch-less' but they will be anyway adjusted to Batch 1? Hope I understood it correctly, please correct me.
I was also wondering if the intercept step could be avoided and only the confounding factors can be mentioned as "batch1 1 batch2 0 batch3 0".
Thank you!
Kind regards,
Swethaa