Hi,
I performed motif discovery in a rather large sequence set of defined open chromatin regions with the command line version of MEME-chip and used the parameters -ccut 0 and -spamo-skip, otherwise default settings.
I am interested in where in my input sequence the significant motifs are located. As I understand it, this is provided in the FIMO output files generated by MEME-Chip for each significant motif.
I do however wonder why the FIMO output generated by MEME-chip gives a very different number of motif matches compared to when I use the FIMO web application? As input to the FIMO web application I use the same sequence as was used as input to MEME-chip and a selection of the significant motifs identified by analysis with MEME-chip. FIMO web application give a much higher number of matches for many of the motifs. I use default p-val threshold.
Does this have to do with the background sequence used by MEME-chip when scanning the sequence for individual matches with FIMO?
My set of open chromatin regions is larger than 1 Mb but the FIMO web application accepts the file. Could the size of the input file be the problem?
All in all, how do I best identify putative motif sites for the identified enriched motifs in my set of open chromatin regions to use in further analyses?
Victoria