From an operations perspective what is the overhead of having 100s of
database vs 100s of collections especially in a subscription based
model like SalesForce.com or even Jira.
While I understand this is a VERY general question any blog posts or
shared experiences would be very beneficial.
Off the top of my head I can see that there are some benefits of
having 100s of database
- One you can drop a database which represents a customer without
impacting other databases or having to run a compaction on the
database.
- Statistics per database. I guess this can really be a pro and con,
but it seems that statistics per customer or database would give you a
better perspecitive into that customer's use of the application but it
can also create a management nighmare for the operations team if they
wish to monitor databases as this leve.
- What impact does multiple database have on sharding vs collections?
I know companies such as
cragislist.com have turned to mongodb for
archiving of their posts, (not sure if people can release this info)
but how does
craigslist.com decide to split databases vs collections?
Is there a database per city? Per category? One big huge database with
collections split by category or city (since I assume all categories
are the same across cities?).
Any info that can be presented would be helpful.