Using msm_we / haMSM restart plugin with equilibrium (non-recycling) WE simulations?

5 views
Skip to first unread message

Isabel Thompson

unread,
Mar 31, 2026, 12:27:25 PM (4 days ago) Mar 31
to westpa-users

Hi WESTPA community,

                                                                                                                                                          I'm building an analysis pipeline for RNA folding WE simulations that hinges on approximating the slow modes of the dynamics and building haMSMs from simulations that are run in equilibrium mode (no recycling boundary conditions, no source/sink states) — walkers explore freely with splitting/merging to maintain coverage across bins.                                

For validation, I'd like to build a reference haMSM in progress coordinate space (traditional haMSM implementation by Copperman and Zuckerman) to compare against the haMSM I build in learned slow mode space. The idea is to check whether the learned slow mode coordinates actually capture longer timescales than the raw pcoord and to cross-validate between the two. 

I have so far touched the surface for some of the literature but I have some overarching meta-questions about the approach:

  • Does `msm_we` work with equilibrium WE data? My understanding is that history labels are assigned based on which macrostate was last visited (determined from the MSM bin assignments and walker genealogy), not from recycling per se. But I'm not sure whether the `msm_we` code assumes recycling boundary conditions internally — e.g., whether it requires explicit source/sink state definitions determined by the basis and target states defined in `west.cfg` that wouldn't apply to my setup. 

  • Can the WESTPA 2.0 haMSM restarting plugin be used in "analysis-only" mode (e.g., `n_restarts: 0`) to build an haMSM from multiple independent equilibrium WE runs without triggering the restart protocol? I'd like to leverage the multi-run aggregation and variance estimation but manage the simulations myself. I currently use WESTPA 2022.10 with the WEED driver and no re-weighting.

  • If `msm_we` doesn't support equilibrium WE, is there a recommended approach for building a pcoord-space haMSM from equilibrium WE data as a validation reference? I already have the machinery for weighted count matrices, per-iteration flux aggregation, and genealogy tracing via parent_id — so I could implement history label assignment myself, but I'd rather not reinvent the wheel if there's an existing tool.

Any guidance would be much appreciated. Even pointing to relevant literature that answers any of these questions would be amazing. Happy to share more details about the pipeline if helpful.                                                             

Thanks so much!

Isabel Thompson
PhD Candidate | The Corcelli Lab
Department of Chemistry & Biochemistry
University of Notre Dame
itho...@nd.edu

Daniel Zuckerman

unread,
Apr 3, 2026, 11:51:30 AM (14 hours ago) Apr 3
to westpa-users
Hi Isabel.  Your situation is an important one because, as I'm guessing, you wanted to ensure good exploration of configuration space via WESTPA without committing yourself to particular source and sink states.

In principle the msm_we and haMSM tools should be able to achieve what you want, but I don't have the nuts-and-bolts knowledge to answer first two questions right now.  Jeremy from Lillian's group is on vacation this week, and I hope he can address those once he settles back to work.

On the other hand, we have been extremely focused on your general issue, framed as: I have a set of trajectory data, what's the best way to estimate steady state?  And by 'steady state', I mean either equilibrium or a nonequilibrium SS based on arbitrary source-sink choices.

The newer solution we propose to this problem seems to be better than what's possible with MSM/haMSM approaches because it does not rely on specific discretizations of phase space.  The analysis also lends itself to your question of whether a certain set of coordinates provides a better featurization than another.

The new approach, called RiteWeight, uses iterative solutions to (ha)MSMs with random clusterings in order to estimate steady-state weights for trajectory segments that are consistent with any clustering.  You can see our main preprint on it:
https://arxiv.org/abs/2401.05597 (now accepted to PNAS)
The code works with (single-tau) WESTPA segments, and we will be placing some code on the github to facilitate extracting those.
A second preprint describes our in-progress work addressing noise issues with RiteWeight and which we plan to expand to include uncertainty estimation:
Finally, some initial results applying RiteWeight to WE data can be found here

Let me know if you have any further questions.  --Dan Zuckerman


From: 'Isabel Thompson' via westpa-users <westpa...@googlegroups.com>
Sent: Tuesday, March 31, 2026 7:31 AM
To: westpa-users <westpa...@googlegroups.com>
Subject: [EXTERNAL] [westpa-users] Using msm_we / haMSM restart plugin with equilibrium (non-recycling) WE simulations?
 
--
You received this message because you are subscribed to the Google Groups "westpa-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to westpa-users...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/westpa-users/7ecb6eb8-60cf-4c30-a589-db1936369f57n%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages