--
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/42f29e28-476a-40fb-8064-89914a0c8718%40googlegroups.com.
Thanks for reporting your approach Will!Using the metadata emitted from Gobblin's publisher seems like a better approach on paper. Could you explain what issues you ran into while trying to do that and why the solution was hacky?Also, if you could explain why you want to record the high watermark externally, that might help us understand the context better. e.g. Do you want to affect future runs of gobblin based on this value or merely want to record and observe this for monitoring and troubleshooting reasons.Shirshanka
On Tue, Aug 16, 2016 at 5:24 AM, Will Bertelsen <wbert...@foursquare.com> wrote:
If any future searchers are curious I ended up serializing the offsets with each record I emit and rolling them up in a later MR to find the largest for each partition/topic.
On Monday, July 18, 2016 at 1:57:41 PM UTC-7, Will Bertelsen wrote:Hi all,I have a kafka -> hdfs ingestion pipeline coming together as described by the docs (using the mapreduce method) but one thing I need to be able to do is record the high watermark kafka offset for each topic/partition processed by a job.I've managed to hook in a proof-of-concept by overriding `publishMetadata` in my publisher, but this seems a little messy. I know this data is recorded in the FsStateStore but accessing it externally seems non trivial. Any tips / best practices for this?
--
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-user...@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/ba4a9de4-c799-46aa-a6ad-33309d57636e%40googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/26e6a531-63d7-42ed-b98c-7e0bb1ad5c74%40googlegroups.com.