Regards your other questions:
> The total time I believe includes loading of the database (or files in WT_TEST directory) and the execution time.
I believe Don already answered that, but yes, the simple output shows how long the entire benchmark ran from start to finish. When I run the wtperf I pass in -o verbose=2 on the command line and it outputs some more information about what is happening as the run proceeds.
> This test is not showing increased core uses as I increased the run
threads from 20 "threads=((count=20,reads=1) to 40, 80, etc. "htop"
command shows only one core busy.
I suspect you are monitoring the test during the populate phase, which does happen single threaded. Note that the populate inserts 120 million records - so could take a while depending on your hardware. Generally single threaded populate is more efficient than multi-threaded populate, since there are some optimizations in WiredTiger related to single threaded bulk loading.
I ran wtperf against the most recent version of WiredTiger using the configuration file you referenced and see it utilizing all available CPU (I have a machine with 8 CPUs, and top reports 799% CPU utilized by wtperf). That is with the default 20 thread reader configuration - I didn't increase that, since CPU is already saturated.