Well, apparently mclust and I don't get along very well, but I made some progress in two directions
1. When an algorithm fails, build a framework. Since frameworks can't run, they can't fail. So I wrote a generic mapreduce clustering function that takes a clustering function and cluster merging function as arguments.
2. Since the above is cheating, I instantiated the framework based on the clustering function clara, which, unlike mcclust, works. That made the mapreduce version work as well. Unfortunately, one needs to specify the number of clusters in advance and the compute time seems to rise quite drastically with the number of clusters.
The file is cluster.mr.R under examples in the branch dev for now. That alone should be a reminder that this code is highly experimental. Enjoy
Antonio