Benchmark simulation and best score

12 views
Skip to first unread message

Christoffer Fink

unread,
Nov 11, 2022, 11:50:18 PM11/11/22
to OptaPlanner development
Hi!

I am currently writing a master's thesis based on OptaPlanner. I have several planners/configs to compare and many problem instances. So the experiments take a very long time to run. It occurred to me that, if I want to use different termination criteria, I can look at the CSV files that track the best score and "simulate" a run (assuming the new criteria result in a lower max time). In other words, I can ask what would have happened if... This has saved me a huge amount of time! But there is one annoying wrinkle. I realized that the CSV files do not record the full score. The number of uninitialized variables is omitted. Anyway, I have two questions.

1. What's the reasoning behind omitting the init level? (Just curious.)

2. I am considering writing a more robust tool for simulating benchmark runs. (Right now it's a pretty simple and limited Python program.) Is this something that sounds like it would be of interest to anyone else? I'm thinking it would produce a real benchmark report just like the normal benchmarker, and possibly selectively re-run those experiments that cannot be simulated because the original running time was too short. (I suspect I'm not gonna bother if nobody else wants to use it :P) Of course, if anyone else wants to take the idea and run with it, be my guest. ;)

/Christoffer

Lukáš Petrovický

unread,
Nov 21, 2022, 2:42:07 AM11/21/22
to optapla...@googlegroups.com
Christoffer,

On Sat, Nov 12, 2022 at 5:50 AM 'Christoffer Fink' via OptaPlanner development <optapla...@googlegroups.com> wrote:
1. What's the reasoning behind omitting the init level? (Just curious.)

At the moment, I am not sure why we're omitting those. That said, I want to discourage you from using these CSVs - they were never meant to be an API surface, and we will change their format if we need to.
 

2. I am considering writing a more robust tool for simulating benchmark runs. (Right now it's a pretty simple and limited Python program.) Is this something that sounds like it would be of interest to anyone else?

I'd be interested to see where you get with that. Sounds like a cool experiment.

Regards!
 
--

Lukáš Petrovický

OptaPlanner Project Lead

lukas.pe...@redhat.com

My work week is Monday to Thursday.
No need to respond outside of your working hours.
Reply all
Reply to author
Forward
0 new messages