Hi Nikhil,
So in terms of K-means vs affinity propagation, the main reason our program using affinity propagation over something like K-Means is because of how it intrinsically works. Whereas K-Means can yield different initial clusters depending on the number of initializations, affinity propagation always yields the same results. The suggestion for `-C 25` was initially based on your number of contigs. A general rule of thumb I have used in the past is if you divide the number of contigs you have by 10000 that should be about the number you set to -C. I am currently working on `Binsanity2` which will incorporate Binsanity-LC into the core code and also will take out some of the need to manually set this parameter.
Having said that ultimately when I test on the example dataset whether I do 25 or 100 the results are nearly identical. And in practice I have erred toward the larger number because it improves both memory and speed of which the program runs with very few losses in accuracy.
-Elaina