Is there any way to ingest HDFS data as source into Kafka topic?
--
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/1d872cba-c2bd-4aa2-a262-bc69b24da61b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/CAMZ-pYBwtEdZok4Zz4mqvSwR4cy5ZmPunRLyYH0bwbPbkgYTmw%40mail.gmail.com.
Hi Zhu,If your use case is to read Avro files from HDFS and write to Kafka: you can try the attached pull file; this should mostly work out of the box for you (just make sure to replace server urls).For plain text, etc files - it will be a bit extra work as suggested by Issac.Regards
Abhishek
On Thu, Oct 27, 2016 at 12:15 PM, 'Issac Buenrostro' via gobblin-users <gobblin-users@googlegroups.com> wrote:Hi Zhu,What format are your files in?Gobblin has a Kafka writer which can write the data into Hadoop. It also has most of the functionality for reading from a file system, and it is fully able to read records from an Avro file in HDFS. What it is unfortunately missing is a plain text reader from HDFS, but this should be very easy to implement. At that point, you could combine a file system reader and the Kafka writer to achieve what you want. If you need a non-avro fs reader, do you want to implement it and contribute it to the project? We can of course guide you on where to start.Best,IssacTo view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/CAMZ-pYBwtEdZok4Zz4mqvSwR4cy5ZmPunRLyYH0bwbPbkgYTmw%40mail.gmail.com.--On Thu, Oct 27, 2016 at 11:21 AM, Zhu Wayne <zhuw.c...@gmail.com> wrote:Is there any way to ingest HDFS data as source into Kafka topic?--
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/1d872cba-c2bd-4aa2-a262-bc69b24da61b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
Hi Zhu,If your use case is to read Avro files from HDFS and write to Kafka: you can try the attached pull file; this should mostly work out of the box for you (just make sure to replace server urls).For plain text, etc files - it will be a bit extra work as suggested by Issac.Regards
Abhishek
On Thu, Oct 27, 2016 at 12:15 PM, 'Issac Buenrostro' via gobblin-users <gobblin-users@googlegroups.com> wrote:Hi Zhu,What format are your files in?Gobblin has a Kafka writer which can write the data into Hadoop. It also has most of the functionality for reading from a file system, and it is fully able to read records from an Avro file in HDFS. What it is unfortunately missing is a plain text reader from HDFS, but this should be very easy to implement. At that point, you could combine a file system reader and the Kafka writer to achieve what you want. If you need a non-avro fs reader, do you want to implement it and contribute it to the project? We can of course guide you on where to start.Best,IssacTo view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/CAMZ-pYBwtEdZok4Zz4mqvSwR4cy5ZmPunRLyYH0bwbPbkgYTmw%40mail.gmail.com.--On Thu, Oct 27, 2016 at 11:21 AM, Zhu Wayne <zhuw.c...@gmail.com> wrote:Is there any way to ingest HDFS data as source into Kafka topic?--
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/1d872cba-c2bd-4aa2-a262-bc69b24da61b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
Abhishek,
I built from master branch gobblin. I ran example wiki pull and it ran fine. However, I ran into an exception on pulling hdfs. Could you take a look?
$ cat nohup.out
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/home/cloudera/gobblin-dist/lib/avro-tools-1.8.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/home/cloudera/gobblin-dist/lib/slf4j-log4j12-1.7.21.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
Exception in thread "main" java.lang.NoSuchMethodError: com.google.common.base.Stopwatch.createUnstarted()Lcom/google/common/base/Stopwatch;
at com.google.common.util.concurrent.ServiceManager$ServiceListener.<init>(ServiceManager.java:593)
at com.google.common.util.concurrent.ServiceManager.<init>(ServiceManager.java:177)
at gobblin.runtime.app.ServiceBasedAppLauncher.start(ServiceBasedAppLauncher.java:125)
at gobblin.scheduler.SchedulerDaemon.main(SchedulerDaemon.java:65)
Hi Zhu,If your use case is to read Avro files from HDFS and write to Kafka: you can try the attached pull file; this should mostly work out of the box for you (just make sure to replace server urls).For plain text, etc files - it will be a bit extra work as suggested by Issac.Regards
Abhishek
On Thu, Oct 27, 2016 at 12:15 PM, 'Issac Buenrostro' via gobblin-users <gobbli...@googlegroups.com> wrote:
Hi Zhu,What format are your files in?Gobblin has a Kafka writer which can write the data into Hadoop. It also has most of the functionality for reading from a file system, and it is fully able to read records from an Avro file in HDFS. What it is unfortunately missing is a plain text reader from HDFS, but this should be very easy to implement. At that point, you could combine a file system reader and the Kafka writer to achieve what you want. If you need a non-avro fs reader, do you want to implement it and contribute it to the project? We can of course guide you on where to start.Best,Issac
On Thu, Oct 27, 2016 at 11:21 AM, Zhu Wayne <zhuw.c...@gmail.com> wrote:
Is there any way to ingest HDFS data as source into Kafka topic?
--
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-user...@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/1d872cba-c2bd-4aa2-a262-bc69b24da61b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "gobblin-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-user...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to gobblin-users+unsubscribe@googlegroups.com.
To post to this group, send email to gobbli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gobblin-users/82105178-a8e3-4f56-9858-b7f994c16239%40googlegroups.com.