Richard Watson
unread,Sep 22, 2009, 3:22:22 AM9/22/09Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Google App Engine
Hi there,
I know the GAE team is exceptionally competent and committed to the
reliability of the platform, so this message is only to understand the
"why" a bit better.
Obviously reliability will increase over time. I assume the end
result will be more reliable than most alternatives, but it'd be nice
to get more insight into the challenges you're facing. I do read the
mail and blog posts you put up - much appreciated.
Questions I can think of now, maybe others have more/better questions:
- Why are there so many system-wide failures? Are they single-point-of-
failure in nature, or do they emerge due to the overall complexity?
- Is there no way to prevent an entire datacenter from becoming
unhealthy?
- If not, does the App Engine have to have committed datacenters, or
could it e.g. run on fewer machines inside datacenters shared with
other services? I would imagine the latter is quicker to move - fewer
resources, and maybe could be located closer to users.
- What are your reliability goals?
Regards,
Richard