[SciPy-User] Cholesky for semi-definite matrices?

Nathaniel Smith

unread,

Jul 9, 2011, 11:19:55 AM7/9/11

to SciPy Users List

Hi all,

I've run into a case where it'd be convenient to be able to compute
the Cholesky decomposition of a semi-definite matrix. (It's a
covariance matrix computed from not-enough samples, so it's positive
semi-definite but rank-deficient.) As any schoolchild knows, the
Cholesky is well defined for such cases, but I guess that shows why
you shouldn't trust schoolchildren, because I guess the standard
implementations blow up if you try:

In [155]: np.linalg.cholesky([[1, 0], [0, 0]])
LinAlgError: Matrix is not positive definite - Cholesky
decomposition cannot be computed

Is there an easy way to do this?

-- Nathaniel
_______________________________________________
SciPy-User mailing list
SciPy...@scipy.org
http://mail.scipy.org/mailman/listinfo/scipy-user

Charles R Harris

unread,

Jul 9, 2011, 1:30:57 PM7/9/11

to SciPy Users List

On Sat, Jul 9, 2011 at 9:19 AM, Nathaniel Smith <n...@pobox.com> wrote:

Hi all,

I've run into a case where it'd be convenient to be able to compute
the Cholesky decomposition of a semi-definite matrix. (It's a
covariance matrix computed from not-enough samples, so it's positive
semi-definite but rank-deficient.) As any schoolchild knows, the
Cholesky is well defined for such cases, but I guess that shows why
you shouldn't trust schoolchildren, because I guess the standard
implementations blow up if you try:

In [155]: np.linalg.cholesky([[1, 0], [0, 0]])
LinAlgError: Matrix is not positive definite - Cholesky
decomposition cannot be computed

Is there an easy way to do this?

The positive definite Cholesky algorithm is basically Gauss elimination without pivoting. The semidefinite factorization is generally not unique unless pivoting is introduced to move zero factors to a common spot and if your matrix has errors the idea of semidefinite becomes pretty indefinite anyway. Depending on the use case, you will probably have more success with either eigh or svd. Note that in the svd case the singular values are always positive, but the resulting factorization will not be symmetric when one of the eigenvalues is negative. In the eigh case you can simply set small, or negative, eigenvalues to zero.

Chuck

eat

unread,

Jul 9, 2011, 1:36:01 PM7/9/11

to SciPy Users List

Hi,

On Sat, Jul 9, 2011 at 6:19 PM, Nathaniel Smith <n...@pobox.com> wrote:

Hi all,

I've run into a case where it'd be convenient to be able to compute
the Cholesky decomposition of a semi-definite matrix. (It's a
covariance matrix computed from not-enough samples, so it's positive
semi-definite but rank-deficient.) As any schoolchild knows, the
Cholesky is well defined for such cases, but I guess that shows why
you shouldn't trust schoolchildren, because I guess the standard
implementations blow up if you try:

In [155]: np.linalg.cholesky([[1, 0], [0, 0]])
LinAlgError: Matrix is not positive definite - Cholesky
decomposition cannot be computed

Is there an easy way to do this?

How bout a slight regularization, like:

In []: A

Out[]:

array([[1, 0],

[0, 0]])

In []: A+ 1e-14* eye(2)

Out[]:

array([[ 1.00000000e+00, 0.00000000e+00],

[ 0.00000000e+00, 1.00000000e-14]])

In []: L= linalg.cholesky(A+ 1e-14* eye(2))

In []: dot(L, L.T)

Out[]:

array([[ 1.00000000e+00, 0.00000000e+00],

[ 0.00000000e+00, 1.00000000e-14]])

My 2 cents,

- eat

Gael Varoquaux

unread,

Jul 12, 2011, 10:23:17 AM7/12/11

to SciPy Users List

On Sat, Jul 09, 2011 at 08:36:01PM +0300, eat wrote:
> > (It's a
> > covariance matrix computed from not-enough samples, so it's positive
> > semi-definite but rank-deficient.)

> How bout a slight regularization,

Indeed, from a statistics point of view, and non definite positive
covariance matrix is simply an estimation error. As a descriptive
statistic, it may look good, but it does not contain much information on
the population covariance. The easiest solution is to regularize it.

The good news is that there exists good heuristics (approximate oracles)
to find the optimal regularization parameter). If you assume that your
underlying data is Gaussian, the 'Approximate Oracle Shrinkage' is
optimal. If you don't want this assumption, the Ledoit-Wolf shrinkage
works great. In practice they are often similar, but LW tends to
under-penalize compared to AOS if you have little data compare to the
dimension of your dataset.

Another good news is that you can find Python implementations of these
heuristics in the scikit-learn:
https://github.com/scikit-learn/scikit-learn/blob/master/scikits/learn/covariance/shrunk_covariance_.py
you will find references to the paper if you are interested, and you can
simply copy out the function from the scikit-learn if you don't want to
depend on it.