kubernetes spark CNI and data locality

vincent gromakowski

unread,

Jan 12, 2017, 12:02:51 PM1/12/17

to Kubernetes user discussion and Q&A

Hi all,

Does anyone have experience running Spark with CNI on Kubernetes and benefit from data locality with backend nodes like HDFS or Cassandra ? Is there any mechanism in Kubernetes to colocate containers ?

Tx

Vincent

vincent gromakowski

unread,

Jan 14, 2017, 6:08:48 AM1/14/17

to Kubernetes user discussion and Q&A

Some additional information on this issue:

- Kubernetes allows pod-to-pod affinity so it's easy to launch Spark executors on the same node as a backend (for instance Cassandra nodes) based on the Cassandra pods label

- For each tasks, Spark will try to schedule it on the executor with the best data locality as possible based on the hostname of the executors pods and the hostname of Cassandra pods

- Because Spark pods and Cassandra pods advertise different hostnames, Spark doesn't know which hostnames are physically identical. It can result in complete shuffle even executors and cassandra are colocated

Resolving this issue would mean to get the physical information (nodes, racks, DC) and use it in Spark and Cassandra advertising/scheduling.

David Oppenheimer

unread,

Jan 14, 2017, 3:56:12 PM1/14/17

to Kubernetes user discussion and Q&A

I don't know much about Spark, so just trying to understand your requirements.

If all you care about is making sure executors get scheduled to *some* node that has a Cassandra pod, then the existing pod affinity feature is sufficient. Put label "cassandra" on the Cassandra pods, and pod affinity for "cassandra" on the executor pods and you're done. Pod affinity works off of labels, not hostnames.

If you're looking for something more sophisticated than that, can you explain in more detail?

--
You received this message because you are subscribed to the Google Groups "Kubernetes user discussion and Q&A" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kubernetes-users+unsubscribe@googlegroups.com.
To post to this group, send email to kubernetes-users@googlegroups.com.
Visit this group at https://groups.google.com/group/kubernetes-users.
For more options, visit https://groups.google.com/d/optout.

David Oppenheimer

unread,

Jan 14, 2017, 7:06:31 PM1/14/17

to Kubernetes user discussion and Q&A

Looks like you also posted to this issue

https://github.com/kubernetes/kubernetes/issues/34377#issuecomment-272659800

which is probably a better place to discuss.

vincent gromakowski

unread,

Jan 16, 2017, 4:40:15 AM1/16/17

to Kubernetes user discussion and Q&A

Pod-to-pod affinity is mandatory to launch spark executors colocated with cassandra nodes but is not enough. Spark executors aren't transient and the driver reuse the same executors to chain tasks scheduling. This scheduling is done on hostname basis to try to colocate spark tasks and cassandra nodes. So even if we have pod-to-pod affinity, we may have another task distribution that may lead to no colocality

Le dimanche 15 janvier 2017 01:06:31 UTC+1, David Oppenheimer a écrit :

Looks like you also posted to this issue
https://github.com/kubernetes/kubernetes/issues/34377#issuecomment-272659800
which is probably a better place to discuss.

On Sat, Jan 14, 2017 at 12:56 PM, David Oppenheimer <davi...@google.com> wrote:

I don't know much about Spark, so just trying to understand your requirements.

If all you care about is making sure executors get scheduled to *some* node that has a Cassandra pod, then the existing pod affinity feature is sufficient. Put label "cassandra" on the Cassandra pods, and pod affinity for "cassandra" on the executor pods and you're done. Pod affinity works off of labels, not hostnames.

If you're looking for something more sophisticated than that, can you explain in more detail?

On Sat, Jan 14, 2017 at 3:08 AM, vincent gromakowski <vincent.g...@gmail.com> wrote:

Some additional information on this issue:
- Kubernetes allows pod-to-pod affinity so it's easy to launch Spark executors on the same node as a backend (for instance Cassandra nodes) based on the Cassandra pods label
- For each tasks, Spark will try to schedule it on the executor with the best data locality as possible based on the hostname of the executors pods and the hostname of Cassandra pods
- Because Spark pods and Cassandra pods advertise different hostnames, Spark doesn't know which hostnames are physically identical. It can result in complete shuffle even executors and cassandra are colocated

Resolving this issue would mean to get the physical information (nodes, racks, DC) and use it in Spark and Cassandra advertising/scheduling.

Le jeudi 12 janvier 2017 18:02:51 UTC+1, vincent gromakowski a écrit :
Hi all,
Does anyone have experience running Spark with CNI on Kubernetes and benefit from data locality with backend nodes like HDFS or Cassandra ? Is there any mechanism in Kubernetes to colocate containers ?
Tx

Vincent

--
You received this message because you are subscribed to the Google Groups "Kubernetes user discussion and Q&A" group.

To unsubscribe from this group and stop receiving emails from it, send an email to kubernetes-use...@googlegroups.com.
To post to this group, send email to kubernet...@googlegroups.com.

Reply all

Reply to author

Forward