I need to detect all the promoters with transcription factor motifs using FIMO, there are two methods:
fimo [options] <motif file> <sequence file>
The first method is putting only one promoter sequence in the sequence file in a single run for each time, and run the program for multiple times.
The second method is putting all the promoter sequencs (about 60,000) in the sequence fileand run them together.
Currently I use the second method, and I found that the running speed is very slow. In addition, it generates huge GFF files (fimo.gff), currently ~629G.
I want to know the difference of the two methods, and how to deal with the huge file properly. I only want fimo.txt, not fimo.gff