How to derive the gradient for a minimization function on two distributions

Lance Huo

unread,

Aug 18, 2015, 12:42:21 AM8/18/15

to Manopt

Hello Guys,

How to derive the gradient for a minimization function on the Hellinger distance for two distributions? The detailed optimization problem is described in the attached file! I do think this problem should be solved by Manopt software, however, I really get confused to derive the gradient for the function. Can anyone give some suggestions? Thank you so much!

Lance

Problem.pdf

Nicolas Boumal

unread,

Aug 18, 2015, 8:27:51 AM8/18/15

to manopt...@googlegroups.com

Hello Lance,

Thank you for your question.

That is indeed a nasty function to differentiate... I started working it out, but it would take quite a bite of time to do this completely. Also, it will depend on the expressions for p(x) and q(x), and their derivatives.

I would advise you start by working out the gradient for this function:

f(W) = ( sqrt(T(W'x)) - sqrt(1-T(W'x)) )^2

Indeed, (1) is a linear combination of such functions, so getting the full gradient from that of f is easy.

It's not too difficult to get the differential of f:

Df(W)[Wdot] = \frac{2T-1}{sqrt(T*(1-T))} * DT(W'x)[Wdot'x],

where T = T(W'x) for short, and DT(W'x) is the differential of T(x) at W'x.

Now, to get DT, you'll need to look into p, q, Dp and Dq.

Once you get that, you want to figure out the adjoint of DT (it's a linear operator). Call the adjoint DT*, Then, the gradient is:

nabla f(W) = \frac{2T-1}{sqrt(T*(1-T))} x * (DT*(W'x)[1])'

The adjoint is defined here: http://ricerca.mat.uniroma3.it/users/sorrentino/Alfonso_Sorrentinos_Home_Page/MAT204Fall06_files/Lecture6.pdf.

Once you have the expression for DT, use the definition to get DT*.

I'm sorry if this probably looks just as complicated to finish as it did at first..

Message has been deleted

Nicolas Boumal

unread,

Aug 21, 2015, 3:54:55 AM8/21/15

to Manopt

Hello Lance,

Unfortunately, I won't have more time to help you with computing this gradient.

I do encourage you to derive it though. There is nothing hard to it, but it is a bit long to do. I suppose it takes practice too.

One suggestion would be for you to try easier functions first, to get the knack of it.

Good luck,

Nicolas

Lance Huo

unread,

Aug 23, 2015, 11:12:29 PM8/23/15

to Manopt

Hello Nicolas,

Thank you so much!

Best

Lance

Reply all

Reply to author

Forward