Hi biociphers team,
while running my splicing analysis, on the heterogen step I stumbled upon a doubt regarding the .psicov files and the moccasin correction. To put some context, I have 725ish samples to perform splicing analysis, so as they are case-control samples, I did the psi-coverage step to group these samples into 2 groups (case and control).
Before performing the quantification (in my case heterogen) step, I wanted to do some correction with moccasin, so as prefix in the model matrix I used the names of the .sj files, but the input files where the 2 .psicov files generated on the previous step. Is this correct to do? Will moccasin actually be able to correct the data as they are grouped in 2 .psicov files? (For more info, i can't do a matrix for just case.psicov and control.psicov as the samples forming each group have different confounding factors).
If the data is actually corrected, I just obtain a single corrected.psicov file so, how should the different groups be indicated? By using the --select-grp1-prefixes/--select-grp2-prefixes on the heterogen step?
If the data isn't actually corrected, should I go back, do a .psicov file for each sample, correct each sample with moccasin and finally fuse the .psicov files into 2 .psicov files grouped by status (case/control)? If so, how can I do this final fusion?
Please let me know if something is unclear and thank you again for your help.
Best regards,
Miriam Martínez