Dear Dataproc team,
We have tested Spark 3.1.1 with Spark Structured Streaming (which we had an issue making it work with in Dataproc built on the image 3.1.1-RC2.).
By test, I mean we set-up a 3-node spark cluster sending streaming data from on-premise to GCP BigQuery. Obviously this cannot be used in anger because of latency incurred sending data from Spark cluster on-premise.
In short to get back to using Dataproc nodes, we need an image of Spark built on Spark 3.1.1.
Spark only offers 3.1.1 plus two other releases as STABLE versions
Release Notes for Stable Releases
Please bear in mind that Spark 3.0 and 3.1 are not available for download, meaning IMO they are defective releases Also few times Databricks in Spark user group stated that "3.1.1-RC2 is not a release" whatever that means. But taht is a moot point now.
Anyway to get in position, can you please build an image on 3.1.1 and I would be more than happy to test it as I am sure other users would agree with such release.
Finally I attach a spark GUI with structured streaming for version 3.1.1
Thanks,
Mich