Hi Jintu,
Regarding the "rough" MSM construction, I would suggest using
msm_we and just set 1 cluster per bin. The readthedocs there should be quite comprehensive. Here's the procedure if you decide to do it manually without
msm_we.
I would suggest just grabbing the state labels from assign.h5 and construct the dataset by hand. That way you have a better control about the lag time value you choose.
1. Create a n x n "count matrix" and zero it out. For a chosen lag time (example with your WE Tau here), loop through each segment and look at the last frame and its parent's state label (last frame of its parent). Add the weight of
that frame to the corresponding transition element in the "count matrix' .
2. Row normalize such that probability of each state sums up to 1. This will be your transition matrix.
3. Put the transition matrix into the chosen MSM program of your choice (PyEmma etc.) and continue. You have now interjected WE data into the program and can follow through.
Hope that helps. We're still looking into the w_direct discrepancies but we think it's related to the state definitions.
Regards,
Jeremy Leung
--
Jeremy M. G. Leung
PhD Candidate, Chemistry
Graduate Student Researcher, Chemistry (Chong Lab)
University of Pittsburgh | 219 Parkman Avenue, Pittsburgh, PA 15260
jml...@pitt.edu | [He, Him, His]