Machine models need to be frequently deployed to support changes such as data drift, data decay and new target audiences. A deployment strategy is a way to update an existing ML deployed model in production and support the continuous testing and management of multiple models simultaneously.
Qwak now supports a variety of deployment strategies to enhance model performance, accuracy and flexibility.
Let’s see an example of how to create an A/B deployment where users from New York & California will receive different versions of the Model.
Step 1: Configure two separate audiences using a Yaml file, in our case we will start with “New York” audiences
Step 2: Run and apply using Qwak CLI:
Step 3: Deploy a build (specific model version) to the California audience
Step 4: Create a variation
Step 5: Connect the audience to the variation and deploy
Once both models are deployed you can see them in the overview dashboard
Prediction example using Python Client
The metadata configuration specifies which model version to create an inference request for
As you can see, we now have two models of different versions creating predictions for two separate audiences, one in New York and the other in California. Naturally using the variety of deployment strategies a user can create a robust deployment and a productionized flow to support model testing and production requirements.