The HIV Databases at LANL prvide a quick and easy tool, called "GapStreeze" which removes columns from an alignment where one
or more sequences have a gap character. It can be set to remove columns where more than x% of sequences are represented by a
gap.
http://www.hiv.lanl.gov/content/sequence/GAPSTREEZE/gap.htmlThe question of whether stripping of masking gappy sites is good or not depends on what questions you are asking about
the phylogeny and evolution. Most programs will treat a string of six gap characters "------" as six mutations whereas the
more likely evolution is a single deletion event. Including gap sites is probably good for getting the right tree topology
but not so good for measuring accurate distances within the tree.
But anyway, tools like GAPSTREEZE make it so easy to test the question for yourself, that it seems foolish not
to test at least a couple of settings and see how much difference it makes to your phylogeny. If both leaving all
gaps in, and stringently removing all gaps results in essentially the same tree, that is a useful thing to know.
Brian Foley, PhD