Is there a way to do Kafka batch ingestion in Druid staring form a specific offset or partition for a topic? All I see right now is real time ingestion from Kafka.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+unsubscribe@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/0ab6b0cf-4a1d-42ca-bc1f-5ef5ac7d6b2e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
{
"type": "kafka",
"dataSchema": {
"dataSource": "pageviews-kafka",
"parser": {
"type": "string",
"parseSpec": {
"format": "json",
"timestampSpec": {
"column": "time",
"format": "auto"
},
"dimensionsSpec": {
"dimensions": ["url", "user"]
}
}
},
"metricsSpec": [
{"name": "views", "type": "count"},
{"name": "latencyMs", "fieldName": "latencyMs", "type": "doubleSum"}
],
"granularitySpec": {
"type": "uniform",
"segmentGranularity": "PT5M",
"queryGranularity": "NONE"
}
},
"ioConfig": {
"topic": "pageviews",
"consumerProperties": {
"bootstrap.servers": "localhost:9092"
}, "useEarliestOffset" : "false", "taskCount": 1, "replicas": 1,
"taskDuration": "PT5M"
}
}The new Kafka indexing service (https://imply.io/docs/latest/tutorial-kafka-indexing-service.html) can ingest historical data. If you set "useEarliestOffset" : false then it will read from the beginning of the topic and ingest everything.
Gian
On Tue, Aug 30, 2016 at 1:50 PM, <sigo...@gmail.com> wrote:
Is there a way to do Kafka batch ingestion in Druid staring form a specific offset or partition for a topic? All I see right now is real time ingestion from Kafka.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+unsubscribe@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/a2ea8174-6dcf-4076-b1ac-417294c0d277%40googlegroups.com.
druid.extensions.loadList=["druid-hdfs-storage", "mysql-metadata-storage", "druid-kafka-eight", "druid-kafka-indexing-service"]
I am still getting the same error even when I included the "druid-kafka-indexing-service" extension?
Gian