Hi Runhua,
It is totally reasonable for the different methods to generate different numbers of subsets. The numbers that you list fall in line with what I would expect considering how the different methods work. The runs using unlinked branch lengths require many more parameters to be estimated for each subset, therefore, fewer total subsets would be expected because the much larger number of parameters estimated per subset incur a higher penalty in the calculation of AIC(c)/BIC.
To decide which scheme to use, you should first pick your preferred information criterion (Rob suggested AICc for good reason). Then you should select the scheme with the best score associated with that criterion. If you do decided to choose AICc, then the best scheme is the one generated with unlinked branch lengths. However, keep in mind that if you choose this scheme, and you are planning on estimating a phylogenetic tree, you should use software that allows you to estimate a tree with unlinked branch lengths among subsets. You can usually figure this information out by consulting the software manual/documentation.
My best,
Paul