PCA and Dataset treatment

16 views
Skip to first unread message

Hisham Abu Samra

unread,
Jan 18, 2012, 5:29:57 AM1/18/12
to visual-s...@googlegroups.com

Hiya,         

 

Thank you very much for all the efforts put into ViSta!

 

I have two questions regarding Principal Components Analysis (PCA) and dataset treatment using the latest version: ViSta 7.9.2.5.

 

Q1: The data consists of 150 observations (samples with different chemical compositions) and 130 variables (spectroscopic chemical composition-related signal intensities).

The dataset was first pre-treated in such a way that the signal intensities are normalised for each sample. Importing the data to ViSta and carrying out PCA was successful.

However, PCA analysis failed on the same data after mean-centering (that is each single variable is subtracted of its mean value across all samples). Surely there are no issues with importing the data.

 

The following error messages appeared:

(Error: Floating point exception. Happened in: #<Subr-LINPACK-DSVDC: #1d5ff88>)

 

On the other hand, this problem does not seem to occur when the input data matrix size is reduced to 150 Obs. X 110 Var. Could this be related to the input matrix size which ViSta could process?

 

Q2:  Is it possible to carry out specific data treatment prior to performing PCA using ViSta to obtain:

(i)                 mean-centering of the dataset.

(ii)               variance scaling (each single variable is divided by its standard deviation in the input dataset).

 

Thank you very much in advance.

 

Hisham

Pedro Valero

unread,
Jan 27, 2012, 5:55:02 AM1/27/12
to visual-s...@googlegroups.com
I am not very sure why this fails for you but if you use correlation matrix as option for input for PCA (this is the default), the software standardizes variables before analysis (this is answer to Q2).
Reply all
Reply to author
Forward
0 new messages