Asim R P
unread,Dec 1, 2018, 2:37:57 AM12/1/18Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Karen Huddleston, Pengzhou Tang, Oak Barrett, Adam Lee, Ashwin Agrawal, Heikki Linnakangas, Scott Kahler, Robert Eckhardt, gpdb...@greenplum.org
On Fri, Nov 30, 2018 at 12:02 PM Karen Huddleston
<
khudd...@pivotal.io> wrote:
>
> It sounds like the proposal is to disable the use of COPY ON SEGMENT for replicated tables and force them to use regular COPY without ON SEGMENT. I assume this would include both COPY TO ON SEGMENT and COPY FROM ON SEGMENT.
Yes, the proposed change applies to both to and from.
>
> This would definitely impact gpbackup and gprestore. COPY places the resulting data files on the master host. Currently, the architecture for backup is that the schema/metadata files go on the master and all data goes on the segments. This would break that pattern. I am not sure that customers would be ok with us placing data files on the master host, even if they are relatively small because they are replicated tables.
>
Well, the current behavior potentially leads to wrong results. That's
much worse as compared to the inconvenience of placing data files on
master.
Another alternative is to make COPY ON SEGMENT analogous to direct
dispatch when invoked on a replicated table. The command is
dispatched to all segments but QD includes additional information such
as content ID of one segment. Only that segment which is specified by
the QD actually writes data to the file. All other segments create
empty files. The dump thus gets created on segments. COPY FROM
SEGMENT may not even need any change.
Disclaimer: do not expect reason in my loud thinking, especially on a
Friday night!
Asim