The "demo_MCORE_ballsSMC" with Multicore module has no acceleration effect

48 views
Skip to first unread message

henry zhang

unread,
Jan 15, 2026, 6:38:31 AMJan 15
to ProjectChrono
I increased the number of balls in "demo_MCORE_ballsSMC" project, and as the title said, the multi-core module has no acceleration effect.





 

 PC1 Snipaste_2026-01-15_18-22-44.png
 
 PC1 Snipaste_2026-01-15_18-20-49.png

Radu Serban

unread,
Jan 25, 2026, 11:29:44 AM (14 days ago) Jan 25
to ProjectChrono

I’m not exactly sure how you are measuring multicore performance.  I suggest you run the benchmark test provided precisely for this in the Chrono distribution. 

Make sure to enable ‘BUILD_BENCHMARKING’ during CMake configuration, rebuild, and run btest_MCORE_settling.

See below what I get running this benchmark test on my 2 machines.

--Radu

 

 

Windows 11 (Ryzen 9 5950X 16-core, 32GB RAM):

 

2026-01-25T17:12:22+01:00

Running E:\Build\chrono\bin\Release\btest_MCORE_settling.exe

Run on (32 X 3394 MHz CPU s)

CPU Caches:

  L1 Data 32 KiB (x16)

  L1 Instruction 32 KiB (x16)

  L2 Unified 512 KiB (x16)

  L3 Unified 32768 KiB (x2)

Simulated 48930 particles using 1 threads.

-------------------------------------------------------------------------------------------------

Benchmark                                                 Time   CPU        Iterations UserCounters...

-------------------------------------------------------------------------------------------------

TEST_NAME/Settle/1/iterations:1/repeats:1/real_time   22294 ms   22141 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0286 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.85529k Step_Total=22.2913k Step_Update=18.4354k

Simulated 48930 particles using 2 threads.

TEST_NAME/Settle/2/iterations:1/repeats:1/real_time   13330 ms   13281 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0344 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.62218k Step_Total=13.3277k Step_Update=9.70482k

Simulated 48930 particles using 3 threads.

TEST_NAME/Settle/3/iterations:1/repeats:1/real_time   10563 ms   10500 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0311 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.59963k Step_Total=10.5611k Step_Update=6.96091k

Simulated 48930 particles using 4 threads.

TEST_NAME/Settle/4/iterations:1/repeats:1/real_time    9315 ms    9297 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0322 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.6188k Step_Total=9.3133k Step_Update=5.69388k

Simulated 48930 particles using 5 threads.

TEST_NAME/Settle/5/iterations:1/repeats:1/real_time    8534 ms    8516 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0367 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.60529k Step_Total=8.53153k Step_Update=4.92561k

Simulated 48930 particles using 6 threads.

TEST_NAME/Settle/6/iterations:1/repeats:1/real_time    7988 ms    7953 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0335 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.56171k Step_Total=7.98622k Step_Update=4.42387k

Simulated 48930 particles using 7 threads.

TEST_NAME/Settle/7/iterations:1/repeats:1/real_time    7581 ms    7547 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.03 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.52162k Step_Total=7.57929k Step_Update=4.05705k

Simulated 48930 particles using 8 threads.

TEST_NAME/Settle/8/iterations:1/repeats:1/real_time    7312 ms    7266 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0297 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.5103k Step_Total=7.30995k Step_Update=3.79902k

Simulated 48930 particles using 9 threads.

TEST_NAME/Settle/9/iterations:1/repeats:1/real_time    7013 ms    6984 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0335 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.74403k Step_Total=7.011k Step_Update=3.26647k

Simulated 48930 particles using 10 threads.

TEST_NAME/Settle/10/iterations:1/repeats:1/real_time   6784 ms    6750 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0414 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.93555k Step_Total=6.7829k Step_Update=2.8468k

Simulated 48930 particles using 11 threads.

TEST_NAME/Settle/11/iterations:1/repeats:1/real_time   6489 ms    6484 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0306 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.84956k Step_Total=6.4865k Step_Update=2.63633k

Simulated 48930 particles using 12 threads.

TEST_NAME/Settle/12/iterations:1/repeats:1/real_time   6347 ms    6344 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0255 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.80542k Step_Total=6.34536k Step_Update=2.53932k

Simulated 48930 particles using 13 threads.

TEST_NAME/Settle/13/iterations:1/repeats:1/real_time   6364 ms    6359 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0302 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.95381k Step_Total=6.36169k Step_Update=2.40734k

Simulated 48930 particles using 14 threads.

TEST_NAME/Settle/14/iterations:1/repeats:1/real_time   6339 ms    6312 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0297 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=3.96912k Step_Total=6.33732k Step_Update=2.36758k

Simulated 48930 particles using 15 threads.

TEST_NAME/Settle/15/iterations:1/repeats:1/real_time   7321 ms    7328 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.0282 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=4.26353k Step_Total=7.31823k Step_Update=3.05394k

Simulated 48930 particles using 16 threads.

TEST_NAME/Settle/16/iterations:1/repeats:1/real_time   7463 ms    7438 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.035 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=4.33763k Step_Total=7.46085k Step_Update=3.12245k

 

Ubuntu 24.04 (Ryzen 9 8945H5 16-core, 32GB RAM)

 

2026-01-25T17:11:25+01:00

Running ./btest_MCORE_settling

Run on (16 X 2216.31 MHz CPU s)

CPU Caches:

  L1 Data 32 KiB (x8)

  L1 Instruction 32 KiB (x8)

  L2 Unified 1024 KiB (x8)

  L3 Unified 16384 KiB (x1)

Load Average: 0.51, 2.75, 4.23

Simulated 49073 particles using 1 threads.

------------------------------------------------------------------------------------------------------

Benchmark                                                 Time        CPU   Iterations UserCounters...

------------------------------------------------------------------------------------------------------

TEST_NAME/Settle/1/iterations:1/repeats:1/real_time   10710 ms   10706 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.085634 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.2904k Step_Total=10.7074k Step_Update=9.41615k

Simulated 49073 particles using 2 threads.

TEST_NAME/Settle/2/iterations:1/repeats:1/real_time    6057 ms    6055 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.07524 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.06143k Step_Total=6.05489k Step_Update=4.99256k

Simulated 49073 particles using 3 threads.

TEST_NAME/Settle/3/iterations:1/repeats:1/real_time    4951 ms    4949 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.055487 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.04459k Step_Total=4.94862k Step_Update=3.90306k

Simulated 49073 particles using 4 threads.

TEST_NAME/Settle/4/iterations:1/repeats:1/real_time    4371 ms    4371 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.06615 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=906.75 Step_Total=4.36818k Step_Update=3.46046k

Simulated 49073 particles using 5 threads.

TEST_NAME/Settle/5/iterations:1/repeats:1/real_time    3995 ms    3994 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.075298 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.01361k Step_Total=3.99224k Step_Update=2.97771k

Simulated 49073 particles using 6 threads.

TEST_NAME/Settle/6/iterations:1/repeats:1/real_time    3684 ms    3684 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.066199 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=901.109 Step_Total=3.68169k Step_Update=2.77968k

Simulated 49073 particles using 7 threads.

TEST_NAME/Settle/7/iterations:1/repeats:1/real_time    3528 ms    3528 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.076243 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.02457k Step_Total=3.52518k Step_Update=2.49969k

Simulated 49073 particles using 8 threads.

TEST_NAME/Settle/8/iterations:1/repeats:1/real_time    3373 ms    3373 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.070844 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.01938k Step_Total=3.37052k Step_Update=2.35017k

Simulated 49073 particles using 9 threads.

TEST_NAME/Settle/9/iterations:1/repeats:1/real_time    3690 ms    3687 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.126123 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=855.927 Step_Total=3.68635k Step_Update=2.82932k

Simulated 49073 particles using 10 threads.

TEST_NAME/Settle/10/iterations:1/repeats:1/real_time   3562 ms    3558 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.121574 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=858.595 Step_Total=3.55823k Step_Update=2.69864k

Simulated 49073 particles using 11 threads.

TEST_NAME/Settle/11/iterations:1/repeats:1/real_time   3599 ms    3596 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.127541 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.03347k Step_Total=3.59571k Step_Update=2.56116k

Simulated 49073 particles using 12 threads.

TEST_NAME/Settle/12/iterations:1/repeats:1/real_time   3632 ms    3630 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.130271 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.0425k Step_Total=3.62872k Step_Update=2.58515k

Simulated 49073 particles using 13 threads.

TEST_NAME/Settle/13/iterations:1/repeats:1/real_time   3474 ms    3474 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.10314 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=929.634 Step_Total=3.47033k Step_Update=2.5396k

Simulated 49073 particles using 14 threads.

TEST_NAME/Settle/14/iterations:1/repeats:1/real_time   3586 ms    3586 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.127894 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.06294k Step_Total=3.58277k Step_Update=2.51871k

Simulated 49073 particles using 15 threads.

TEST_NAME/Settle/15/iterations:1/repeats:1/real_time   3314 ms    3314 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.134358 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=921.601 Step_Total=3.31108k Step_Update=2.38844k

Simulated 49073 particles using 16 threads.

TEST_NAME/Settle/16/iterations:1/repeats:1/real_time   3388 ms    3387 ms   1 CD_Broad=0 CD_Narrow=0 CD_Total=0.120701 LS_Jacobian=0 LS_Setup=0 LS_Solve=0 Step Setup=0 Step_Advance=1.00086k Step_Total=3.38485k Step_Update=2.38288k

 

 

 

From: projec...@googlegroups.com <projec...@googlegroups.com> On Behalf Of henry zhang
Sent: Thursday, January 15, 2026 11:29 AM
To: ProjectChrono <projec...@googlegroups.com>
Subject: [chrono] The "demo_MCORE_ballsSMC" with Multicore module has no acceleration effect

 

I increased the number of balls in "demo_MCORE_ballsSMC" project, and as the title said, the multi-core module has no acceleration effect.

 

 

 

 

 

 

 

 

 

 

 

--
You received this message because you are subscribed to the Google Groups "ProjectChrono" group.
To unsubscribe from this group and stop receiving emails from it, send an email to projectchron...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/projectchrono/d1a666f2-6102-42d4-8954-05b4c803ed35n%40googlegroups.com.

Reply all
Reply to author
Forward
0 new messages