In addition to what Marc mentioned, BEAGLE can also use OpenMP to
parallelize across multiple CPU cores. A single beagle instance can
currently compute likelihoods for multiple rate categories (think gamma
across site rate heterogeneity) using multiple cores. However, I can't
comment on whether current BEAST releases support the use of BEAGLE in
this way.