I am looking through the source code. I realize that there is this TimeSeriesRDD class which seems to have some data prep functions. However, in the main model classes such as ARIMA, it uses the local Vector class as input, and uses regression methods from the math3 library. Excuses my ignorance, but it appears it is not at all a distributed implementation.
Please enlighten me how is this a Spark lib? Is it only that limited TimeSeriesRDD part?