Hi all,
Thanks very much for each of your replies! I received feedback from most of the competitors on a private channel and will attempt to summarize my answers in one shot. Please view my attempt here as simply trying to avoid repeating conversations ... and if I still don't tackle a question you asked me feel free to ask again. I always appreciate reminders! :-)
Q: When is the planner submission deadline?
A: Yes, I intended that you would submit on 9 Mar, before the official domain list is released.
Q: How will you run our planner?
Q: (related) Should we expect a single domain model per domain, or one per problem?
A: Most of you were fine with the cloud proposal. I'll run your planner with a script similar to previous competitions. I believe the general format for the script is "plan domain.pddl problem.pddl [dckFile]" More than one of you are concerned about this aspect, and deservedly so. I'll work on this clarification once I secure some funding for the cloud platform.
Q: Where will the learning process be run? By us? By you? On our machines? On the cloud?
A: The learning computations are run by participants on their own computing platforms (I'd encourage you to look at cloud if you haven't...). You'll be expected to submit initial versions of the planner (9 Mar) and a mechanism by which I can verify you didn't change the learning algorithm during the learning phase such that it becomes a "hand tuned" planner. So I should be able to produce the same DCK files at the start and end of the learning phase for any problem in the initial set. I'll clarify the process I intend to use soon.
Q: How many competitors are there?
A: Sorry. I keep spacing to send this out. At the moment, I show 12 planners from 7 teams for the Quality track and 3 planners from 2 teams for the execution track.
Comment: uh, two weeks to run the competition... are you sure?!?
A: Well, I didn't see another way to handle the delay without encroaching on the paper deadlines and remaining fair to your respective efforts. I think it will work out because the could platform allows massively parallel execution.
Comment: No new domains...
A: I was absolutely open to new domains and I did my best to solicit promising applications for this track. Alas, none were submitted. I hesitate to add my own hand-crafted domains to sidestep injecting "my research" or "my view" into what should be a community effort on building benchmark domains. One might argue that the execution track consists of new domains I'm writing, but in fact, I'm carefully constructing it to use "domains" from existing literature on the topic.
Q: Haven't you primed everyone to the specific domains/problems by providing us a list?
A: My apologies. I should have been more clear. The final problem distribution(s) and the final list of domains is not set. I provided what I thought of as a "preliminary" list so that folks could make sure their planner supports key features (e.g., PDDL requirements, kinds of problems, scope). At the least, it was known that the competition would heavily build on previous competitions -- I recall saying this to several participants in Rome. (Sigh, Rome... that was a nice trip!) I think being upfront about this levels the field a bit for anyone not in those conversations. It was my understanding this format was followed in the last learning competition without ill effect. Maybe I misunderstood?
Comment: I already know my planner won't support DomainXYZ because of RequirementABC.
A: Ah, the classic n-armed bandit problem. You could work really hard to fix your planner for DomainXYZ (i.e., explore an arm you think might pay off). Or ... you could take a chance I won't use that domain and focus on the domains you know really well first (i.e., exploit the arm with the highest payoff) and leave a little time to extend into RequirementABC (i.e., explore new arms with a decaying probability). As with most things in life, strategic decisions are challenging. :-)
Q: Okay. I need a list of PDDL requirements you expect our planners to support.
A: A quick answer is the union of the requirements in the example set I gave you. Some problems may hit that union at different levels. I'll make sure to get you the expected list soon. (Keep in mind that you don't necessarily need to cover the union to have an effective approach. I think challenging such assumptions is where scientific thought guides us the most. )
I hope this helps clarify. Please ask more questions!
mak