The range of possible log-likelihood scores varies with the length of the motif, the background model, and the information content of the motif. For a given motif and background model you can use the log likelihood score to compare the
relative quality of two matches, but a negative score in itself doesn't indicate a "poor" match. It may be that the range of possible log likelihood scores for a motif runs from -1e-4 to -1e-10, and achieving a score as high as -1e-10 by chance is extremely unlikely.
For each motif FIMO estimates the distribution of possible scores, and uses the computed distribution to compute measures of statistical significance for each observed score. The p-value or q-value is what you should threshold on, not the raw score.