1. I derived the closed form wirtinger gradient and hessian matrix for f(A), but it seems that Manopt does not support wirtinger gradient and hessian.
2. Another way to tackle this manifold problem is to let A_{ij}=e^{i\theta_{ij}} and then solve the unconstrained problem over \Theta=[\theta_{ij}]. I wonder if solving the manifold optimization directly will have some benefits compared with this parameterized method.
Thanks for your reply. The background of my problem is to decompose a given matrix P into two matrices A and D, where each element of A should satisfy the unit modulus constraint |A_{ij}|=1. Then the problem can be formulated as
min_{A\in M,D} ||P-AD||_{F}^2
Given a full column rank A, this problem is a least square problem and the optimal solution D_{opt}=(A^*A)^{-1}A^*P. Inserting D_{opt} into the original problem we obtain
min_{A\in M} ||P-A(A^*A)^{-1}A^*P||_{F}^2
which is the problem I'm working on. This reformulation can reduce the optimization variables from (A,D) to A, so I expect that the number of saddle points is also reduced.
Could you give me some hints why you prefer to go with the complex circle approach than the parameterization method? Does the parameterized problem have more stationary points than the un-parameterized version?
Thanks. I tried an alternating minimization method to optimize A and D alternatively. The performance of this method is not bad, but I think there still has some space to improve the algorithm because sometimes this method will converge to a saddle point. I will try the method you provided later.
Best regards
juening
Thanks for sharing your thought. Indeed, I derived the gradient and Hessian matrix using the method you posted. The main problem is that I'm trying to seek an algorithm that can guarantee to converge to a local minimum. I read some papers on saddle-free algorithms, e.g, "Identifying and attacking the saddle point
problem in high-dimensional non-convex optimization". But it seems that these methods do not work on my problem. Since both Nicolas and you prefer to deal with the complex circle directly, I will try the trust region method on manifolds and see what happens.