Dear JASPAR Team,
Hope you're doing well!
First of all, thank you so much for maintaining such an amazing database - it's been incredibly helpful for my research.
I'm writing to report a small issue I noticed while working with the JASPAR CORE PFMs (the non-redundant text file version). It looks like there might be a duplicate entry in the database.
I found that these two transcription factors have exactly the same PWM matrix:
MA2255.1 (Max)
MA2215.1 (dm)
Here's what the data looks like in the file:
>MA2255.1 Max
A [ 541 0 0 1000 0 0 0 0 ]
C [ 208 833 1000 0 958 0 0 0 ]
G [ 41 125 0 0 0 1000 0 1000 ]
T [ 208 41 0 0 41 0 1000 0 ]
>MA2215.1 dm
A [ 541 0 0 1000 0 0 0 0 ]
C [ 208 833 1000 0 958 0 0 0 ]
G [ 41 125 0 0 0 1000 0 1000 ]
T [ 208 41 0 0 41 0 1000 0 ]
As you can see, the numbers are identical for both entries. This caused some confusion in my analysis since I was expecting each TF to have its own unique binding pattern.
Could you please take a look when you have time? Maybe it's just a simple copy-paste error in the database.
Thanks again for all your great work! The database is really wonderful.
Best regards,
Xuefen