hundreds of schema vs hundreds of databases

olivier

unread,

May 28, 2007, 2:01:22 PM5/28/07

to

Hi group,

I have an application with some hundreds users, each one having the same
data definitions, and each one storing up to 2 GB of data.
A user have just access to his own data. His data will have its own
tablespace.

Therefore, it seems to me I have a choice between "one database per
user" and "one schema per user in the same database".

What is the best practice here ? Which solution will be the easiest to
manage ?

Cheers,

Olivier

Message has been deleted

Albe Laurenz

unread,

May 29, 2007, 5:02:13 AM5/29/07

to

Advantages of many databases:
- Each database is smaller.
- No danger of one user accessing another user's data (because of
misconfigured permissions and similar).
- Guaranteed independence of each user's data.
- More scalable: If you decide that one machine or one cluster
is not enough to handle the load, you can easily transfer some
of the databases somewhere else.

Advantages of one database with many schemata:
- Fewer databases to administrate.

I'd probably go for many databases.

Yours,
Laurenz Albe

---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faq

Merlin Moncure

unread,

May 29, 2007, 9:39:04 AM5/29/07

to

On 5/29/07, Albe Laurenz <a...@adv.magwien.gv.at> wrote:
> > I have an application with some hundreds users, each one
> > having the same
> > data definitions, and each one storing up to 2 GB of data.
> > A user have just access to his own data. His data will have its own
> > tablespace.
> >
> > Therefore, it seems to me I have a choice between "one database per
> > user" and "one schema per user in the same database".
> >
> > What is the best practice here ? Which solution will be the
> > easiest to manage ?
>
> Advantages of many databases:
> - Each database is smaller.
> - No danger of one user accessing another user's data (because of
> misconfigured permissions and similar).
> - Guaranteed independence of each user's data.
> - More scalable: If you decide that one machine or one cluster
> is not enough to handle the load, you can easily transfer some
> of the databases somewhere else.
>
> Advantages of one database with many schemata:
> - Fewer databases to administrate.
>
> I'd probably go for many databases.

you missed one possible advantage of schemas...database structures can
be more easily shared. For example, you can join one of the user's
private tables with a shared central table. With multiple databases,
you have to resort to other strategies to do that, for example dblink.

Schemas are designed to the effect of giving a private data area in a
large shared database. Separate databases would be preferred if the
databases are backing difrferent applications and completely
unrelated.

merlin

---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
choose an index scan if your joining column's datatypes do not
match

Ron Johnson

unread,

May 29, 2007, 10:59:30 AM5/29/07

to

On 05/29/07 04:02, Albe Laurenz wrote:
>> I have an application with some hundreds users, each one
>> having the same
>> data definitions, and each one storing up to 2 GB of data.
>> A user have just access to his own data. His data will have its own
>> tablespace.
>>
>> Therefore, it seems to me I have a choice between "one database per
>> user" and "one schema per user in the same database".
>>
>> What is the best practice here ? Which solution will be the
>> easiest to manage ?
>
> Advantages of many databases:
> - Each database is smaller.
> - No danger of one user accessing another user's data (because of
> misconfigured permissions and similar).
> - Guaranteed independence of each user's data.
> - More scalable: If you decide that one machine or one cluster

You could always dump a schema then drop it and restore it in a new
database. At 2GB, that should be quick.

> is not enough to handle the load, you can easily transfer some
> of the databases somewhere else.
>
> Advantages of one database with many schemata:
> - Fewer databases to administrate.

But since they all have to have the same schema, you'd still have
the same DDL overhead whether it's one DB or many.

Does PG set up buffers at the postmaster level or the database level?

If at the database level, then you'll be allocating memory to
databases that might not be in use at any one time, thus wasting it.
One database buffer pool would make more efficient use of RAM.

--
Ron Johnson, Jr.
Jefferson LA USA

Give a man a fish, and he eats for a day.
Hit him with a fish, and he goes away for good!

Guy Rouillier

unread,

May 29, 2007, 1:59:00 PM5/29/07

to

Albe Laurenz wrote:
>
> Advantages of many databases:
> - Each database is smaller.
> - No danger of one user accessing another user's data (because of
> misconfigured permissions and similar).
> - Guaranteed independence of each user's data.
> - More scalable: If you decide that one machine or one cluster
> is not enough to handle the load, you can easily transfer some
> of the databases somewhere else.
>
> Advantages of one database with many schemata:
> - Fewer databases to administrate.

Using different databases for each user incurs the full overhead of
creating and maintaining a database: all the system tables and all the
memory required to keep a database open. If the OP is allowing direct
SQL access to each user, then the risks you identify above must be
addressed, but tbey can fairly simply by using scripts to create each
new user. I'd opt for using schemas unless there is a compelling
evidence that different databases are required.

--
Guy Rouillier

Albe Laurenz

unread,

May 30, 2007, 2:38:35 AM5/30/07

to

Ron Johnson wrote:
> Does PG set up buffers at the postmaster level or the database level?
>
> If at the database level, then you'll be allocating memory to
> databases that might not be in use at any one time, thus wasting it.
> One database buffer pool would make more efficient use of RAM.

Shared memory is allocated at the cluster level.
See
http://www.postgresql.org/docs/current/static/runtime-config-resource.ht
ml#RUNTIME-CONFIG-RESOURCE-MEMORY

Yours,
Laurenz Albe

---------------------------(end of broadcast)---------------------------

Ron Johnson

unread,

May 30, 2007, 3:07:38 AM5/30/07

to

On 05/30/07 01:38, Albe Laurenz wrote:
> Ron Johnson wrote:
>> Does PG set up buffers at the postmaster level or the database level?
>>
>> If at the database level, then you'll be allocating memory to
>> databases that might not be in use at any one time, thus wasting it.
>> One database buffer pool would make more efficient use of RAM.
>
> Shared memory is allocated at the cluster level.
> See
> http://www.postgresql.org/docs/current/static/runtime-config-resource.ht
> ml#RUNTIME-CONFIG-RESOURCE-MEMORY

I read that page, but don't see any references to "cluster level".
Maybe I am misinterpreting "cluster"?

--
Ron Johnson, Jr.
Jefferson LA USA

Give a man a fish, and he eats for a day.
Hit him with a fish, and he goes away for good!

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Merlin Moncure

unread,

May 30, 2007, 1:31:00 PM5/30/07

to

On 5/30/07, Ron Johnson <ron.l....@cox.net> wrote:
> On 05/30/07 01:38, Albe Laurenz wrote:
> > Ron Johnson wrote:
> >> Does PG set up buffers at the postmaster level or the database level?
> >>
> >> If at the database level, then you'll be allocating memory to
> >> databases that might not be in use at any one time, thus wasting it.
> >> One database buffer pool would make more efficient use of RAM.
> >
> > Shared memory is allocated at the cluster level.
> > See
> > http://www.postgresql.org/docs/current/static/runtime-config-resource.ht
> > ml#RUNTIME-CONFIG-RESOURCE-MEMORY
>
> I read that page, but don't see any references to "cluster level".
> Maybe I am misinterpreting "cluster"?

Meaning the database cluster:
http://www.postgresql.org/docs/8.2/static/creating-cluster.html.

I can understand your confusion though.

merlin

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

http://archives.postgresql.org/