While CDAP supports spark streaming, I was more interested in knowing when it would support structured streaming.
There are two aspects to this:
1) Spark version: CDAP 4.3.4 supports Spark 2.1 whereas structured streaming is stable from Spark 2.2 onwards. What is the approx time frame when CDAP would start supporting Spark 2.2 ?
2) interface between plugins: CDAP's interface states that plugins talk in terms of RDD[StructuredRecord] . However Structured streaming talks in terms of data frames. While dataframes can be created from RDD, its inefficient to do so for every plugin - create DF from RDD, do the processing, convert it back to RDD, only to do this again in the next plugin.
Has anyone succeeded in writing structured streaming components in CDAP ?
Regards,
Sanjay