On 09/24/2015 08:29 AM, Chris Hirstein wrote:
> I am still seeing an issue after uplifting to 1.1.0. The exception is
> now being thrown in the avro appender since its using the schema from
> the descriptor.
>
> Do you have any suggestions handling a deployment concern when the
> dataset can be updated with a new passive version of the schema while
> old processing is still writing with the previous version of the schema?
>
> Our use case is having multiple oozie coordinators running workflows on
> different frequencies that write to the same partitioned dataset. When
> deploying new versions of code we allow the old processing to complete
> before the new instances being running. This means that we can have a
> period where two versions of the schema are being written to the same
> dataset.
Chris,
You're right, it looks like although the view's schema is checked to see
if it can be used to write, we don't pass it through to the writer.
Sorry about that, it's an oversight on my part. I think we just need to
pass the view's schema through, we already validate that it can be used
for writing.
I've opened KITE-1081 [1] to track this. It's not too difficult, so feel
free to fix it before I have time and I'll check it in. Thanks, Chris!
rb
[1]:
https://issues.cloudera.org/browse/KITE-1081