Sorry for the delay! I have realised that my reply from last week was not saved.
Thank you very much for your helpful reply!
To answer your questions:
The RNA-seq data is stranded. I’ve updated the configuration file accordingly to include the strandness information (see attached). However, the results look very different. Do you mind clarifying what is the correct way to specify strandness in the configuration file?
I used gencode.v49.annotation.gff3 (the comprehensive gene annotation, chr version, from
gencodegenes.org/human) Is this the correct annotation file, or would you recommend a different one?
I was also wondering what it means when splicing events in the splicegraph don’t show any read counts (for example, exon 1 and exon 2 in the viola_1 file I initially attached)?
Regarding the red junctions, here are the exact coordinates: chr19:17,292,854–17,296,716. I’ve attached a screenshot of the coordinates to confirm we’re referring to the same region.
Sorry for all the questions, but could you please let me know where I can find more information on how the final list of genes is generated (since it only includes around 200 genes/LSVs)? Also, which output file should contain the splicing events that show the most statistically significant differences between the two groups?
It also doesn’t seem to work when I use Majiqlopedia. Here’s a possible link
https://tools.biociphers.org/majiqlopedia_normal/generate_ucsc_link?gene_id=gene%3AENSG00000158467Thank you so much for all your help! I really appreciate it!
All the best,
Alex