Hi Paul,
Your initial hunch was correct, it was failing because of modification times not matching. I found that the property setting you suggested was not getting merged into the configuration settings because it was not included in the list of configs to be merged. After making the change below it seems to be working perfectly :) Note that I have only tried batch replication so far, not incremental so I don't know if there's a similar case there, but at least I'll know what to look for.
One more time, thanks for your help and for sharing this tool!
Best regards,
Joe
diff --git a/main/src/main/java/com/airbnb/reair/batch/hive/MetastoreReplicationJob.java b/main/src/main/java/com/airbnb/reair/batch/hive/MetastoreReplicationJob.java
index 228aff9..49da5ad 100644
--- a/main/src/main/java/com/airbnb/reair/batch/hive/MetastoreReplicationJob.java
+++ b/main/src/main/java/com/airbnb/reair/batch/hive/MetastoreReplicationJob.java
@@ -323,6 +323,7 @@ public class MetastoreReplicationJob extends Configured implements Tool {
ConfigurationKeys.BATCH_JOB_INPUT_LIST,
ConfigurationKeys.BATCH_JOB_METASTORE_PARALLELISM,
ConfigurationKeys.BATCH_JOB_COPY_PARALLELISM,
+ ConfigurationKeys.SYNC_MODIFIED_TIMES_FOR_FILE_COPY,
MRJobConfig.MAP_SPECULATIVE,
MRJobConfig.REDUCE_SPECULATIVE
);