ICA is not a clustering algorithm, it starts with PCA (which is an orthogonal decomposition based on the strongest shared variance), and then recombines the PCA components to maximize independence. In practice, ICA becomes unstable unless you use a smaller number of components than the PCA dimensionality (PCA usually uses the full data dimensionality, so in theory PCA by itself is lossless), so the ICA has to be run as a dimensionality reduction in order to be stable.
When we need to do a dimensionality reduction for analysis purposes, we generally parcellate based on functional area boundaries, instead of using ICA (or folding/gyrus based parcels). Doing it this way means you have a decent description of what each index in the reduced dimension means neurobiologically, and it minimizes the mixing of signal from different functional areas.
Tim