Is it true that if we remove some partitions from Source side that will not be removed by Reair on destination side.I am facing this issue.I added 2 partitions then ran batch load everything was fine.Then i drop one partitions and again ran still i can see 2 partitions on Source side.
Is it bug in Reair or it is because of misconfigurations of properties.
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<value>DDH CLUSTER</value>
<comment> Name of the source cluster. It can be an arbitrary string and is used in logs, tags, etc. </comment>
</property>
<property>
<name>airbnb.reair.clusters.src.metastore.url</name>
<value>thrift://ip-addr:9083</value>
<comment>Source metastore Thrift URL.</comment>
</property>
<property>
<name>airbnb.reair.clusters.src.hdfs.root</name>
<value>hdfs://table_location</value>
<comment>Source cluster HDFS root. Note trailing slash.</comment>
</property>
<property>
<name>airbnb.reair.clusters.src.hdfs.tmp</name>
<value>dest-path_tmp</value>
<comment>Directory for temporary files on the src cluster. Will be however kept at destination location</comment>
</property>
<property>
<value>GOOGLE CLOUD</value>
<comment>Name of the source cluster. It can be an arbitrary string and is used inlogs, tags, etc.</comment>
</property>
<property>
<name>airbnb.reair.clusters.dest.metastore.url</name>
<value>thrift://ip-addr:9083</value>
<comment>Destination metastore Thrift URL.</comment>
</property>
<property>
<name>airbnb.reair.clusters.dest.hdfs.root</name>
<value>dest-path</value>
<comment>Destination cluster HDFS root. Note trailing slash.</comment>
</property>
<property>
<name>airbnb.reair.clusters.dest.hdfs.tmp</name>
<value>dest-path_tmp</value>
<comment>TEMP TABLE DETAILS</comment>
</property>
<property>
<name>airbnb.reair.clusters.batch.output.dir</name>
<value>dest-path_output</value>
<comment>This configuration must be provided. It gives location to store each stage MR job output.</comment>
</property>
<property>
<name>airbnb.reair.clusters.batch.metastore.blacklist</name>
<value></value>
<comment>Comma separated regex blacklist. dbname_regex:tablename_regex</comment>
</property>
<property>
<name>airbnb.reair.batch.metastore.parallelism</name>
<value>20</value>
<comment> The parallelism to use for jobs requiring metastore calls. This translates to the number of mappers
or reducers in the relevant jobs. </comment>
</property>
<property>
<name>airbnb.reair.batch.copy.parallelism</name>
<value>40</value>
<comment>The parallelism to use for jobs that copy files. This translates to the number of reducers in the relevant jobs.</comment>
</property>
<property>
<name>airbnb.reair.batch.overwrite.newer</name>
<value>true</value>
<comment>Whether the batch job will overwrite newer tables/partitions on the destination. Default is true.</comment>
</property>
<property>
<name>mapreduce.map.speculative</name>
<value>false</value>
<comment>Speculative execution is currently not supported for batch replication.</comment>
</property>
<property>
<name>mapreduce.reduce.speculative</name>
<value>false</value>
<comment>Speculative execution is currently not supported for batch replication.</comment>
</property>
Any help would be appreciated.