Hi,
Does rMATS-turbo expect the input fastQ file to be trimmed prior to alignment?
If running STAR separately (as recommended in the paper), is it necessary to run cutadapt /other trimming softwares or some other QC step before?
Another related question- I ran rMATS from raw fastq files, combining prep and post steps with --task-both and enabling soft clipping and variable-reads length.
This is what I ran:
rmats.py \
--s1 ./header/ctl.txt \
--s2 ./header/treatment.txt \
--gtf ./genome/gencode.v49.annotation.gtf \
--bi ./genome/star_index \
-t paired \
--readLength 101 \
--nthread 60 \
--od ./rmats_output/ \
--tmp ./rmats_tmp/ \
--variable-read-length \
--allow-clipping \
--task both
The JCEC output file shows some strange features - I'm trying to figure out why:
a) several splicing event types, but they refer to the same genomic coordinates (false positives).
b) When I ran it without variable reads-length it gave me empty output files. I had set the read length as 101. The average read length of my control samples is 101, and the treatment was 98. Does this have to do with the repetitive events?
c) Events created on unannotated isoforms (I used the latest gencode gtf file for annotation).
Or maybe, does this have to do with the rMATS default STAR aligner allowing --twopassMode Basic? Meaning, will there be more likelihood of non-annotated isoforms with this? Should it be omitted in a manual STAR alignment prior to rMATS?
Let me know if you need me to clarify any part of these questions. Thanks!