Initial worker pods

Shafi Rasulov

unread,

Nov 17, 2020, 2:46:41 PM11/17/20

to MR3

Hi,

When a new cluster is created, or for whatever reason master pod restarted worker pods don't become available. So, the first query always takes few minutes because master tries to create worker pod, and if there are not any available worker nodes then new worker nodes are provisioned. That cause inconsistent performance.

Is it possible to configure the master to always have minimum number of worker pods? I know it is possible to do it in a active cluster. But, the question is about initial cluster state and the cases when master restarted.

Best,

Shafi

Sungwoo Park

unread,

Nov 17, 2020, 8:00:21 PM11/17/20

to MR3

Hi Shafi,

This is because until it receives the first query from Hive, MR3 does not know what containers to create. The first query specifies the memory size of containers, DaemonTasks in each container (such as MR3 shuffle handler and LLAP I/O daemon), whether containers should be reused across queries, and so on. Moreover those containers created for the first query are not reused for the next query if cross-DAG container reuse is disabled. So, DAGAppMaster should wait for the first query in order to decide what containers to create. In essence, as an execution engine, MR3 does not know anything about its applications. (For example, we are currently looking into running both Hive and Spark with a common MR3 DAGAppMaster).

Here is my initial thought on your use case (where I assume autoscaling is enabled). To solve your issue,

1) MR3 should be extended so that the first query would create a fixed number of containers right away, instead of creating the first few containers incrementally.

2) At the same time, your application could submit a dummy query in order to let MR3 know what containers to create.

Would this approach solve your issue?

Cheers,

--- Sungwoo

Sungwoo Park

unread,

Jan 28, 2021, 4:48:37 AM1/28/21

to MR3

In MR3 1.3, we will introduce `mr3.auto.scale.out.num.initial.containers` which specifies the number of new ContainerWorkers to create in a scale-out operation when no ContainerWorkers are running.

--- Sungwoo

Reply all

Reply to author

Forward