Non-MD5 etag ("ba1264cfb028c7cb60d6e98e0da11c15-1") present for key <Key: gs_photos,001ef37b27e845af0/216cb6b9bd729ec.jpg>, data integrity checks are not possible.
# gsutil --versiongsutil version: 4.26
I cannot submit a support request ticket because google support up to 1000 characters :/ so I cannot explain everything in one ticket, weird... :/My goal is to copy only new or changed/modified files from AWS bucket to GCP bucket in order not to pay again and again for all the data transfer from AWS to GCP.It seems that if I repeat the same process copy AWS S3 bucket to GCP using Google Cloud Storage Transfer it copy only the new changes, can anyone from google please confirm that this is the right way to do it ?Sorry, but I could not find the answer for this question on GCP documentationThanks
Cloud Storage Transfer Service has options that make data transfers and synchronization between data sources and data sinks easier. For example, you can:
- Schedule one-time transfers or recurring transfers.
- Delete existing objects in the destination bucket if they don't have a corresponding object in the source.
- Delete source objects after transferring them.
- Schedule periodic synchronization from data source to data sink with advanced filters based on file creation dates, file-name filters, and the times of day you prefer to import data.
On Wednesday, July 5, 2017 at 2:51:38 PM UTC+3, Liran Gabay wrote:From what I understand that warning means there is no way to ensure the data that downloaded from S3 is actually the same data that stored on GCP and S3 does not provide an MD5 or other standard hash for to check against.So there is no real way around this?Thanks
By default, Cloud Storage Transfer Service copies a file from the data source if the file doesn't exist in the data sink or if it differs between the version in the source and the sink. The default is also to retain files in the source after the transfer.