Hi,i have a design doubt for kafka connect (SourceConnector): i want to read files from many different sources - machines (file with same format).I'm exploring the possibility to use only one target topic (same data domain and format) with 1...N partitions and use 1..N source tasks so i can parallelize the work but :
- i'm a bit scared by the "problem" that in the future we can have N threads (too many threads?) running on kafka cluster (at least one for each SourceTask) :
- probably if i can run the "task thread" on the source system it's better because i'll have 1 source machine = 1 thread and not (for example) 3 kafka nodes = N task threads
- i'm a bit scared by the "problem" that in the future we can have too many partition so we might encounter a performance problems on the topic
- i read that workers rebalance the sourcetask but if i have for example 100 max task and 100 partitions i have 100 workers also ?
- finally my question is : it's better to use many connector instance so we can manage better the scalability and configuration. For example (imagine 100 source) : 5 connector instance with 20 source task each . What is the advantages of having multiple instances of connector instead of one instances and many source task?
--Thanks in advance.
You received this message because you are subscribed to the Google Groups "Confluent Platform" group.
To unsubscribe from this group and stop receiving emails from it, send an email to confluent-platform+unsub...@googlegroups.com.
To post to this group, send email to confluent-platform@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/confluent-platform/c04385c1-ce1a-496d-bddc-e8b68b7ed454%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "Confluent Platform" group.
To unsubscribe from this group and stop receiving emails from it, send an email to confluent-platf...@googlegroups.com.
To post to this group, send email to confluent...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/confluent-platform/ea200679-2f5f-43e3-949e-620a40802fc7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.