Explanation for STREME site_distr

17 views

Skip to first unread message

alan.tou...@gmail.com

unread,

Dec 4, 2024, 10:20:50 AM12/4/24

to MEME Suite Q&A

Hello,

I struggle to understand how `site_distr` fromm STREME is used to find motif positions within the sequences. I'd like to know the most common relative position of the motif within the sequences.

Thank you for your help,

Alan

cegrant

unread,

Dec 5, 2024, 12:07:19 AM12/5/24

to MEME Suite Q&A

For each motif it identifies, STREME first makes a list of the highest scoring occurrence of that motif in each of the training sequences. Then, by default, STREME takes the sequences in training set, center aligns them, and trims them to their longest common length. STREME then walks through each position in the aligned sequences and counts how many of the sequences had a motif instance at that position, considering only at the highest scoring motif instance for each sequences. The array of numbers in the site_distr element are the counts of those motif occurrences for each position in the aligned and trimmed training sequences. In the "Advanced options" for the web application you can choose whether the sequences should be center, left, or right aligned. In command line version of STREME you can choose this via the '--align' option.

Reply all

Reply to author

Forward

0 new messages