Hi there,
I am new in this area of research, so I was looking for some help in utilizing MEME Suite to solve my problem.
I have a murine and a homolog human data set of circa 1000 sequences of 5000bp upstream genes' regions each, which are known to be, from experiments of CHIP-CHIP, regulated by the same gene. We know that this gene most probably recognizes certain motifs, which are actually not stored in any of those motifs databases out there, and we know also that the regulation happens upstream (that is why I collected their upstream positions). I "manually" found (using fuzznuc command of EMBOSS) the positions of my motifs in all of the sequences of the two organisms and I "merely" compared the positional information between mouse and humans. Surprisingly (or not), I collected a bunch of motifs that have a very close positional information (as genetic distance from the start of transcription of every single gene) in homologous genes in both the species data sets, focusing my attention from -1 to -1000 of my upstream regions. Would you suggest me a tool which could validate, statistically, these occurances that I found, keeping in mind that my motifs are not known any published motifs database?
Thanks a lot in advance.
Mariano