Hello
mediapipe face mesh detect 468 key-points ("landmarks") quite successfully.
It provides 3 coordinates, X, Y and Depth.
X and Y are actual placement IN THE IMAGE plane.
In order to understand "real world" coordinates, one have to project back using depth information.
The meaning of the 3rd coordinates in mediapipe mesh is not clear to me. is it on the same scale? where is the origin? iz it z coordinate or distance from the viewpoint to the landmark?
thanks