Hello Lucas!
See the
Controlling Data Splits section and select forecasting, the answer to your question is in part there.
The TRAIN set is used for training during the model search, and evaluated on VALIDATE.
Once the top model architectures are selected the top 10 models architectures are selected, trained on TRAIN + VALIDATE, ensembled, and the predictions are generated for TEST.
Finally, the same architectures are selected and trained on TRAIN + VALIDATE + TEST, ensembled, and then the model is deployed for prediction.
It is not possible to omit the TEST split completely, but you can make it very small if you are not using it to maximize data for the model search (ex. hold just the last date of data). The final model exposed for batch prediction still uses all data available including TEST to train, so hopefully there is no concern in losing valuable training data. If your data set has a very brief history, one option is to split TRAIN and VALIDATE by time series, the final model may be less sensitive to non-stationarity, but it may be worth the tradeoff if the context and horizon are long relative to the history.
@Dawei, hopefully what I've said above is true, but please correct me if I'm mistaken :)
Best,
Andrew