Hi Mike,
I've been meaning to reply to your post for a short while, and am surprised nobody else has chipped in here yet, as this is a subject I'm sure many people deploying Cloud Foundry must be familiar (or having difficulty) with by now.
We have 2 vSphere datacenters, and 2-3 clusters in those datacenters, and wish to spread a single Cloud Foundry deployment across that as active/active as possible, using 2 BOSH deployments, and leveraging DEA placement strategy functionality.
So far we have a working CF deployment across these datacenters, with DEA placement strategy working, using mostly spiff generated manifests and a bit of manual manifest hackery on top. We are now looking at testing out spreading job instances across multiple vsphere clusters in a deployment as per
https://www.pivotaltracker.com/s/projects/956238/stories/62927424 which was delivered back in mid-Feb.
This deployment uses a single NATS and HM9000 instances in one of the DCs. Obviously we'd like to run this properly clustered, however it still seems there are issues doing this.
cf-acceptance-tests currently run across this deployment ok.
Unfortunately spiff does not support the concept of multiple deployments of cf-release. For example, the ability to list IPs of etcd's and nats from one or many DCs into other DC manifests. So this introduces an additional layer of YML trickery to get those placed in, which we can script easily - its just an additional layer of surgery on an existing BOSH manifest.
Another thing we're trying to iron out is how to deal with Postgres and whether to use DRBD to replicate its data to a passive DB located in other DCs. This obviously adds more complexity to the deployment though.
I'm very much a supporter of cf-release being a multi BOSH deployment, and also interested to hear what other people are doing :)
Cheers,
Ryan