Hard to say without knowing more about the individual or how its evaluated. A figure or logs may show a big problem in convergence or other issues in the evo process.
Instead of selection, could the problem be an underlying issue in the fitness calc or the application of operators? This can make it hard to explore the solution space as a variety of individuals may not be producing a variety of meaningful fitness. Perhaps an individual can change via mutation but perhaps its fitness does not actually differ for example. Perhaps some individuals fitness is wrong for some cases and bad individuals are promoted despite the selection method.
NSGA-III will work up for 4+ subgroups and much higher dimensional spaces:
https://deap.readthedocs.io/en/master/examples/nsga3.html#higher-dimensional-objective-spaceDepending on how your evo is progressing you may not necessarily get the desired difference in performance from just varying selection methods and breaking the fitness up from an aggregated value of 1 fitness dimension vs more. Here I tested that by using a base + 14 penalties as an example of similar results for an easy problem:
![test_selection.png](https://groups.google.com/group/deap-users/attach/55138bf0521b/test_selection.png?part=0.1&view=1)