Hi all!
I'm trying to setup a Bazel build that involves machine learning. What is the best practice for doing that?
E.g. an app could depend on a (pre-)trained model for providing some predictions at runtime. It would make sense to train the model using a genrule, based on a dataset which might be provided as a filegroup.
I'm a bit stuck here with the following issues:
- The dataset is fairly big. I don't want to add it to version control.
- new_local_repository breaks when the dataset is moved to another location or when it happens to be somewhere else on another machine.
- For local development the whole dataset is not necessary, only a small part would be enough. It would be great if it was possible to switch to a sample dataset easily.
I'd be happy about any hint on how I could get that working.
Cheers,
Tim