sorry for the delay with this clarification..
The dimensions are 28 observations of 2132 variables for the OTUs - I am not totally sure whether CCA is still a valuable method for such data set ?!
The OTUs are presence/absence transformed since the actual reading of the machine is biologically not 100% meaningful, I am told. So that means that many OTUs show exactly the same pattern for the 28 observations, and/or show the same pattern as other OTUs ... should I somehow filter to just a meaningful data set? is there a simple command or should I rather script something? I couldn't find the notion "anti-correlated" on Gusta-me, never came across it before, but I guess I can make sense of it. How would I test that - is there some command already? Does one of the walk-throughs show a relevant procedure - did I miss that?
Environmental measurements (28 observations of 11 features) are normalized.
The "worst" plots happen though when not the full set of environmental features is used, but a selection (either based on the strict research question, i.e. treatment, or based on e.g. step-wise regression - whether the latter is a valid procedure is another discussion I guess ..). So treatment of course is a factor, in this case with two levels - versus >2000 OTUs ..
Most of the environmental features do cover enough of the gradient of interest for unimodal distributions to be expected - but since some of them are so strongly correlated, it makes sense to only use a subset - well - same as for the OTUs of course .. but how??
Or should I use an alternative method altogether, or approach in a completely different way? Sorry if this is a basic question and covered in any second bioinformatics handbook..?..