Distributed PMML scoring engine?

15 views
Skip to first unread message

Reda Bona

unread,
Dec 23, 2020, 11:48:23 AM12/23/20
to Java PMML API
Hello,

I am working on a machine learning project where we use the Openscoring REST API to convert our models to inference servers. However, we find ourselves now needing to handle multiple models (thousands) to allow for multitenancy in our application which we were able to do by running multiple Openscoring REST API nodes and  grouping a certain amount of models in each node. The most complicated problem arises when the requests for a certain model increase beyond the node’s computational capacity and we want to scale the computation for that particular model.

The available choices we can think of are:

  • Launch a new node with just the wanted model. In this case, the node will be severely underused which means wasting resources.
  • Duplicate the entire node which means also duplicating the other models which do not need to be scaled which means wasting memory.

Do you happen to have some advice as to how to surpass this problem please?

Thank you!
Reply all
Reply to author
Forward
0 new messages