Hi Everybody,
I have been trying to use MafFilter for describing the quality of my MAF alignment including 10 reference genomes.
1- I begin with a some crude description of the number of blocks their size and length, easy.
2- Then, I wish to concatenate the blocks up to 20kb, and split them up in windows of 2kb, and calculate the number sequences aligned in each window.
After concatenation and split, some windows can look like that:
Seq1 AT–GCCTTTGANA
Seq2 ATAGCCTTTGATA
Seq3 ATAGCCTTTGAAA
Seq4 NNNNNNNN–NN–N
If I use the stat BlockSize, MafFilter always returns 10 (which is the total number of sequence in my alignment), even if some sequence are only present as N's in a given window. Is there a way to ask MafFilter to count the size of my alignment that include sequenced that are actually defined as non-N's (non-Gaps) only? In other word, I want either to avoid count the sequences that only include N's and Gaps.
Thanks in advance
Michael