Hi all,
1. When I run step-3 on my laptop, which has 2 cores * 2 threads,
I found the cpu usage of "step-3" is about 370% with the "top" command; that means that all 4 threads are used.
So how does this happen? (as step-3 does not include any parallel computation setting).
2. Furthermore, when I run step-17 on my laptop, with "mpirun -np N ./step-17", with the top command I see N "step-17"s each takes about 200% cpu usage (two processes for one processor???).
3. I further tested step-17 on my PC, which has 4 cores * 2 threads.
In my PC's linux system:
With "mpirun -np 1 ./step-17", with the top command I see 1 "step-17" which takes about 200% cpu usage;
With "mpirun -np 2 ./step-17", with the top command I see 2 "step-17"s, and each of them takes about 200% cpu usage;
With "mpirun -np 3 ./step-17", with the top command I see 3 "step-17"s, and each of them takes about 260% cpu usage; this means when I use "mpirun -np 3 ./step-17", it takes all my 8 threads.
However, I also try step-17 in my PC's windows system with linux virtual machine. In that virtual machine, With "mpirun -np N ./step-17", I find N "step-17" each takes about 100% cpu usage with the top
command. And I feel only this last case is unstandable. Also, I find even the compuation in my virtual machine is faster than that in my PC's linux system for step-17 (this is very weird).
So how to explain these cases?
I am really confused, and any help is appreciated.
Best,
Liu