NHC Project Update

28 views
Skip to first unread message

Jennings, Michael E

unread,
Dec 6, 2018, 4:15:06 PM12/6/18
to n...@lbl.gov
Hi everyone!

It's been awhile, so I wanted to provide an update on where things stand with NHC.

First, the news everyone has been waiting for: Things are moving again! Thanks to the efforts of some enterprising colleagues, we've obtained an updated approval from the Feynman Center for Innovation (essentially LANL's Tech Transfer department) for publication and contributions to open source projects in the HPC space. So yay!

This also means that I'll be able to make public the numerous cleanups, fixes, and additional checks Tess wrote last year during her time as a LANL post-bacc. I'll also be publishing a number of updates and enhancements we've been working on as part of deploying NHC on our flagship Cray XC40 supercomputer, Trinity, and our new Arm-based XC50s as well. This means that, in the very near future, NHC will be in production use on all LANL HPC systems, which means great things for the future of the project and the quality of the code going forward. Double-yay! :-)

And finally, a brief roadmap update. Those who follow the project on GitHub are likely already aware, but I've been hard at work getting things cleaned up and ready for a 1.4.3 release. I was hoping to get that done for SC18, but obviously that didn't quite happen. One way or another, I'm determined to get a new release out the door by the end of CY18.

What's next? We've already got a number of new feature-focused PRs pending that will be targeted for a 1.4.4 release. I'll also be adding much of the LANL-sponsored work from last year for that release as well. SchedMD have made some recommendations for improving Slurm integration which are also planned for the 1.4.4 release. I'm shooting for having that release ready for ISC'19, though depending on how the first two quarters of CY19 go at LANL, that may slip to SC'19.

I'm also planning for a 1.5 release that will happen most likely in late CY19 or early CY20 following the final release of RHEL8. The NHC 1.5 release will ADD support for RHEL8 and DROP support for RHEL4 and RHEL5. (Hopefully no one is still using RHEL5 since it's no longer receiving public security fixes, but if you are, please let me know! I'm open to continuing some level of support for the 1.4.x branch through 2020 if required.) This opens the window for making use of BASH associative arrays and the "coproc" keyword, both of which will immensely simplify key portions of the code. Hence the bump to 1.5....

I'm also starting to contemplate what a 2.0 release might look like, so if you have any thoughts on that (or any of this!), please feel free to chime in. :-)

Happy holidays, everyone!
Michael

--
Michael E. Jennings <m...@lanl.gov>
HPC Systems Team, Los Alamos National Laboratory
Bldg. 03-2327, Rm. 2341 W: +1 (505) 606-0605

Reply all
Reply to author
Forward
0 new messages