Dear all,
I'm using rMATS to detect alternative splicing upon drug treatment.
I want to compare splicing changes in vehicle-treated vs increasing drug doses (75, 250 and 500 nM drug).
For this, I'm performing three rMATS runs where I compare each of the drug doses to vehicle.
I'm running rMATS 4.1.0 with the following parameters:
python rmats.py --b1 b1.txt --b2 b2.txt --gtf gencode.v31lift37.annotation.gtf --od out --tmp tmp -t paired --readLength 150 --cstat 0.0001 --libType fr-secondstrand --nthread 30 --novelSS
The number of significantly different splicing events in the output files are:
75 nM vs vehicle:
A3SS.......... 160,803 events
A5SS.......... 111,933 events
SE.......... 348,737 events
RI.......... 43,521 events
MXE.......... 170,759 events
250 nM vs vehicle:
A3SS.......... 177,807 events
A5SS.......... 120,127 events
SE.......... 394,799 events
RI.......... 47,434 events
MXE.......... 212,819 events
500 nM vs vehicle:
A3SS.......... 252,998 events
A5SS.......... 164,492 events
SE.......... 0 events
RI.......... 60,115 events
MXE.......... 488,539 events
Note that the number of splice events in each category increases with increasing drug concentration, except for SE events, where rMATS does not detect any events for the 500 nM drug treatment.
To make sure that the lack of SE events is not dependent on the cutoff, I have also changed cstat to 0.01 and 0.1, but still no SE events detected.
I have also rMATS-compared 500 nM vs. 75 and 250 nM, still with no SE events detected.
Running rMATS directly on original fastq files leads to the same result.
Other tools such as salmon or miso give good results with this data set.
Thus I don't believe that there is an issue with the raw data.
However, during the "processing of the count files" step, I get the following error
==========
Done processing each gene from dictionary to compile AS events
Found 700244 exon skipping events
Found 578900 exon MX events
Found 466469 alt SS events
There are 281872 alt 3 SS events and 184597 alt 5 SS events.
Found 64447 RI events
==========
ase: 69.2721428871
count: 583.429327965
Processing count files.
Traceback (most recent call last):
File "/home/mauer/software/rmats-turbo/rMATS_P/FDR.py", line 45, in <module>
ifile=open(sys.argv[1]);title=ifile.readline();
IOError: [Errno 2] No such file or directory: 'tmp/JC_SE/rMATS_result_P-V.txt'
paste: tmp_500.bam/JC_SE/rMATS_result_FDR.txt: No such file or directory
Traceback (most recent call last):
File "/home/mauer/software/rmats-turbo/rMATS_P/FDR.py", line 45, in <module>
ifile=open(sys.argv[1]);title=ifile.readline();
IOError: [Errno 2] No such file or directory: 'tmp/JCEC_SE/rMATS_result_P-V.txt'
paste: tmp_500.bam/JCEC_SE/rMATS_result_FDR.txt: No such file or directory
The run finishes without any additional errors.
So rMATS does not generate the rMATS_result_P-V.txt file for SE events and I was wondering what the reason for this could be. I find it highly unlikely that there are really no SE events in the highest drug dose as in all my previous rMATS run, I always got numerous SE events, even when comparing much more similar conditions than this one.
Any feedback or speculation on why this happens would be very much appreciated.
Thanks, Jan