Hello again Michele,
I guess I solved the multiple cp2k instances issue, with the help of our HPC support.
As I used 8 beads, the default slots from i-PI is 4, then I changed the slots to be 8, so far the simulations are running well.
Even the technician at the HPC cluster suggested to use 32 for the slots.
However, about the performance, I have another question:
Currently, I tried the simulation for 25 steps, the average time for each step reported from the log file (i-PI) is 19.5 +- 0.4,
but the average time from CP2K of the 8 instances are: 6 out of 8, ~10.0 second/step, 2 out of 8, ~14.5 second/step.
For each step, there is a time difference of 4-5 seconds that I don't know where it cost.
Any ideas about where the time gap is coming from? Thanks very much!
All the best,
Qinghua