Wellcome to Lumin's serial/parallel benchmark, init ... [OK]
I: [initialization] time cost is 0.811117 seconds.
--------------------------------------------------------------------------------
I: [dcopy in serial] time cost is 0.187758 seconds.
A 1.000000 1.000000 C 1.000000 1.000000
I: [dcopy in parallel] time cost is 0.110634 seconds.
A 1.000000 1.000000 C 1.000000 1.000000
--------------------------------------------------------------------------------
I: [dasum serial] time cost is 0.244707 seconds.
resA 67108864.000000
I: [dasum parallel] time cost is 0.126894 seconds.
resB 67108864.000000
--------------------------------------------------------------------------------
I: [ddot in serial] time cost is 0.249027 seconds.
resA 67108864.000000
I: [ddot in parallel] time cost is 0.119332 seconds.
resB 67108864.000000
--------------------------------------------------------------------------------
I: [dscal in serial] time cost is 0.244976 seconds.
A 0.500000 0.500000
I: [dscal in parallel] time cost is 0.102829 seconds.
A 0.250000 0.250000
--------------------------------------------------------------------------------
I: [daxpby in serial] time cost is 0.300344 seconds.
A 0.250000 0.250000 C 1.625000 1.625000
I: [daxpby in parallel] time cost is 0.178659 seconds.
A 0.250000 0.250000 C 2.562500 2.562500
--------------------------------------------------------------------------------
I: [dgemv in serial] time cost is 0.315916 seconds.
Y 0.250000 0.250000 DEST 2048.250000 2048.250000
I: [dgemv in parallel] time cost is 0.212380 seconds.
Y 0.250000 0.250000 DEST 2048.250000 2048.250000
I: [dgemv in parallelv2] time cost is 0.205021 seconds.
Y 0.250000 0.250000 DEST 2048.250000 2048.250000
--------------------------------------------------------------------------------
I: [dgemm in serial] time cost is 1.151119 seconds.
X 1.000000 1.000000 Y 1.000000 1.000000 DEST 512.000000 512.000000
I: [dgemm in parallel] time cost is 0.700665 seconds.
X 1.000000 1.000000 Y 1.000000 1.000000 DEST 512.000000 512.000000
--------------------------------------------------------------------------------
I: [All benchmark] time cost is 4.450948 seconds.