Hi
This pipeline uses contrasts, which are defined in multiple ways
1. Contrasts file
My understanding that if it is defined this way, I can use the contrasts and comparison to identify the groups, and then compare both of them.
If that's the case, and I have 2 groups (WT/KO) with 3 samples each WT1, WT2, WT3, KO1, KO2, KO3
I should run something like
rmats.py --task prep -b1 WT1_bam.txt
... rmats.py --task prep -b1 KO3_bam.txt
rmats.py --task post -b1 WT1_bam.txt --statoff
..rmats.py --task post -b1 WT1_bam.txt --statoff
rmats.py --task stats -b1 group1_bam.txt -b2 group2_bam.txt
I should NOT use the paired statistical model in this case.
The rMATS model (as I understand it) doesn't support blocking, so just ignore that bit in modeling?
Have I got this correctly?
2. Formulas, which are more complicated
Formulas can have lots of interaction terms
Is there anyway to do a statistical analysis based on formulas? Is that something you might be considering in the future?
Thank you,
Uri David