Multi-Model Routing¶

AI APIs often require flexibility in routing requests to multiple models within the same AI provider. The Multi-Model Routing feature in WSO2 API Manager enables API creators to define and enforce routing strategies for distributing requests across different AI models. This ensures efficient resource utilization, enhances reliability, and optimizes performance.

With Multi-Model Routing, API designers can configure either of the following routing strategies:

Load Balancing – Distributes requests across multiple models in a round-robin fashion within the same AI provider.
Failover – Redirects traffic to configured fallback models within the same AI provider when the primary model becomes unavailable.

We cater to your multi-model routing requirements via AI Policies. Listed below are the policies that you can use:

Model Round Robin Policy
Model Weighted Round Robin Policy
Model Failover Policy
Intelligent Model Routing
Semantic Model Routing

Tip

You can only attach one of these AI Policies to your AI API based on your routing requirements, unless you are using the Failover policy. You can attach multiple Failover policies by configuring different primary models.

Prerequisites¶

Before moving into the routing stratergy enforcement be sure to complete the following two prerequisites:

Add the AI Service Provider supported model list under the Admin Portal
Add the required endpoints alongside the API Key for AI service access

Detailed below is how to complete these two prerequisites:

Configuring Model List

Make sure to add the model list you desire to utilize prior to moving into the AI policy configuration section.

Tip

Note that the models that are listed under the corresponding AI Service Provider gets listed down as available models at the time of policy configuration.

Login to the Admin Portal (https://<hostname>:9443/admin).
Navigate to AI Service Providers section from the left menu.
Select the Service Provider which you used to create your AI API.
Find the Model List section and make sure to add in the models which you used to consume using the AI API. You can do so by typing in the model name and pressing Enter to add them.
Once the model list is added, click on Update to save the changes you made to the AI Service Provider.

Configuring Endpoints

If you require multiple endpoints and multiple AI Service Provider API Keys to manage your multi-model routing requirements, make sure to add them before moving to the AI policy configuration section.

Login to the Publisher Portal (https://<hostname>:9443/publisher).
Navigate to API Configurations, and click Endpoints.
Click on Add New Endpoint.

Fill out the endpoint related details.

Field	Sample Value
Endpoint Type	Production
Endpoint Name	Prod
Endpoint URL	https://api.openai.com/v1
API Key	Add the API Key obtained from the AI Service Provider

Click on Create.
Repeat steps 3 to 5 if you wish to add more production/sandbox endpoints.