Belatedly responding on this:
Thanks for the feedback Marjin, but I must disagree with this section that you wrote:
When considering the SNP that you mention it is good to specify that for
insertion deletions, given that they start at the same basepair, they
cannot really be ambiguous.
In the example that you give, both
variants are exactly the same, the only difference is what you consider
to be the reference. The links in the GnoMAD browser for example display
the same rsID for both variants.
I would first clarify that an rsID does not uniquely identify a (biallelic) variant - it really identifies a location, at which there could be any number of biallelic variants, each one being a different variance
from the genome reference. In my example (the two links to GnomAD), examining the population frequencies should make clear that these are two very different variants. If you click through the RS# hyperlink and head to the dbSNP resource, you'll find it got merged to another RS#, but eventually you'll see
here that that RS# actually covers 14 (if I counted correctly) different variants!
I suspect the confusion stems from your referring to "
what you consider
to be the reference". The 'reference' allele (never to be confused with the 'effect' allele) is determined by the published genome reference - it's not something I decide. I think
it is illustrative to click on the UCSC link on either of those GnomAD pages, and you can see the human genome reference sequence at top which shows, starting at position 69908498, an 'A' followed by 16 Ts. (if I counted correctly). And thus:
the variant: 2-69908498-AT-A results in a person having an A followed 15 Ts (one less T) the variant: 2-69908498-A-AT results in a person having an A followed 17 Ts (one more T)
which is why these are different variants, and the allele order is critical, when indels are involved.
So circling back to my original question: is it possible to allow (perhaps optionally) the disabling of the allele re-ordering in the FUMA output? If that were allowed, then a user of FUMA would know whether they supplied input data with Effect==REF or Effect==ALT, and thus it would possible to unambiguously determine the variants from the FUMA output. Otherwise, for the reasons I've stated, I'm afraid all Indels becomes ambiguous and cannot be used.
Thanks
-Matt