Unable to fetch Data from Topic in Kafka

764 views
Skip to first unread message

Kunal Gupta

unread,
Nov 19, 2015, 2:09:06 AM11/19/15
to gobblin-users
Hi All,

I have succeeded in executing Wikipedia Job but stuck in Kafka standalone Job.

When i provided kafka.pull to Gobblin job configuration then I got the following Error log in Gobblin.

2015-11-19 11:48:49 IST INFO  [TaskExecutor-0] kafka.utils.Logging$class  68 - Reconnect due to socket error: java.net.SocketTimeoutException

2015-11-19 11:49:19 IST INFO  [JobScheduler-0] gobblin.runtime.local.LocalJobLauncher  121 - 1 out of 1 tasks of job job_GobblinKafkaQuickStart_1447913898012 are running

2015-11-19 11:49:19 IST WARN  [TaskExecutor-0] gobblin.source.extractor.extract.kafka.KafkaWrapper$KafkaOldAPI  339 - Fetch message buffer for partition test:0 has failed: java.net.SocketTimeoutException. Will refresh topic metadata and retry

2015-11-19 11:49:19 IST INFO  [TaskExecutor-0] 
gobblin.source.extractor.extract.kafka.KafkaWrapper$KafkaOldAPI  233 - Fetching topic metadata from broker localhost:9092

2015-11-19 11:49:19 IST INFO  [TaskExecutor-0] kafka.utils.Logging$class  68 - Reconnect due to socket error: java.nio.channels.ClosedChannelException


Kafka Server also gives an error of Broken Pipe.

[2015-11-19 11:49:49,456] ERROR Closing socket for /192.168.1.18 because of error (kafka.network.Processor)

java.io.IOException: Broken pipe

at sun.nio.ch.FileDispatcherImpl.write0(Native Method)

at sun.nio.ch.SocketDispatcher.write(SocketDispatcher.java:47)

at sun.nio.ch.IOUtil.writeFromNativeBuffer(IOUtil.java:93)

at sun.nio.ch.IOUtil.write(IOUtil.java:65)

at sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:471)

at kafka.api.PartitionDataSend.writeTo(FetchResponse.scala:68)

at kafka.network.MultiSend.writeTo(Transmission.scala:101)

at kafka.api.TopicDataSend.writeTo(FetchResponse.scala:125)

at kafka.network.MultiSend.writeTo(Transmission.scala:101)

at kafka.api.FetchResponseSend.writeTo(FetchResponse.scala:231)

at kafka.network.Processor.write(SocketServer.scala:472)

at kafka.network.Processor.run(SocketServer.scala:342)

at java.lang.Thread.run(Thread.java:745) 


Please help me how should i fix the problem.

Kunal Gupta

unread,
Nov 19, 2015, 3:11:53 AM11/19/15
to gobblin-users
One problem that i face here is If i specify topic.whitelist with another topic then it give same error and unable to pull data from that topic and always give error of read time out error due to socket connection.

Please resolve this soon.

Kunal Gupta

unread,
Nov 19, 2015, 7:14:43 AM11/19/15
to gobblin-users
new error on retrieving data from topic 

2015-11-19 17:42:43 IST WARN  [TaskExecutor-0] gobblin.source.extractor.extract.kafka.KafkaWrapper$KafkaOldAPI  377 - Fetch message buffer for partition test:0 has failed: java.net.SocketTimeoutException. This partition will be skipped.


Why this error occurred ? 

and how to resolve it ?

Ziyang Liu

unread,
Nov 19, 2015, 1:02:57 PM11/19/15
to gobblin-users
Hi Kuanl,

I can repeat this problem on my Macbook running OS X Yosemite. In fact on my Macbook it couldn't connect to any Kafka broker (even localhost) with the same SocketTimeoutException. However, on my Linux machine there's no problem whatsoever. I don't yet know why, but it doesn't seem to be caused by Gobblin, and you may want to try it in a different environment. Meanwhile I'll do some further investigation.

-Ziyang

Shirshanka Das

unread,
Aug 22, 2016, 10:03:31 AM8/22/16
to gobblin-users
This has now been fixed as part of https://github.com/linkedin/gobblin/pull/1218/
Reply all
Reply to author
Forward
0 new messages