Filip Dziuba
unread,Sep 23, 2021, 11:25:39 AM9/23/21Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Druid User
Hi,
I am trying to reindex data and change query granularity. I am reading from Druid data source A and writing to data source B.
Ingestion is separated into 20 subtasks with 10 running at the same time. One subtask usually fails but on retry it succeeded so the whole index_parallel task is successful.
I can see that segment and partitions are created, Druid UI see the number of rows, but query return no data. There are no errors on any node.
I am quite stuck, the spec work on smaller data set.
The only worrying thing I can see is that when one subtask fails the partition it was supposed to create is never created. The retry will create new partition. So I can have set of 20 partitions 0-20 with 12th partition missing, so count is OK, but they are not in sequence.
Is this a bug or should druid handle partitions with missing numbers?