Dear Samantha,
The minimum overhang equal to 8 is hard-coded both in MAJIQ V2 and our coming release, MAJIQ V3.
The direct answer to the filtering/removal question is no. However, there is more to say about retained intron detection in MAJIQ V2 (I am not sure whether retained introns are relevant to your concern-- if so, read on!). MAJIQ V2 parses “annotated introns” from the input gff3 as annotated exonic regions which connect two other exons. Then, by default, retained introns (annotated and denovo) must pass coverage thresholds to be included in LSV definitions (args: --annotated_ir_always, --disable-denovo-ir). To determine whether an intron passes these thresholds, MAJIQ divides an intron into segments (“bins”) and checks that at least a certain proportion of bins (default 0.5, half of them; arg --irnbins) have a certain length-normalized read coverage level (default 0.01; arg --min-intronic-cov). So, it is possible to have some reads splitting an exon-intron boundary and yet the intron will not be included in LSV quantifications if detection coverage thresholds are not met. Moreover, MAJIQ V2 uses a heuristic to evaluate the above coverage criteria which can result in stricter-than-documented thresholds applied for relatively short introns. MAJIQ V3 replaces this heuristic and also does not require annotated introns to pass coverage thresholds for inclusion in LSVs. Our coming MAJIQ V3 preprint update will include more information on this.
Please feel free to ask any additional questions.
Best Regards,
Barry
Dear Samantha,
MAJIQ-CLIN is designed for the use-case you described. MAJIQ-CLIN is not yet generally available, however:
(1) If you are non-commercial (e.g. part of a university lab) and want to use MAJIQ-CLIN now, please have your lab’s PI contact Yoseph Barash at yos...@upenn.edu with a description of the use-case and request for access. We will provide early access for academic use-cases intended to identify aberrant splicing between patients and controls.
(2) MAJIQ-CLIN will be generally available for academic and commercial licensing after the paper is published, which we hope will be soon.
Best Regards,
Barry