Question 1

What is the AI Router?

Accepted Answer

The AI Router automatically selects and routes to the best LLM for each request, helping you save costs and ensure high reliability. It optimizes for quality, cost, and latency while providing automatic fallbacks and full OpenAI API compatibility. Our customers typically save >60% on their LLM costs while maintaining or improving response quality.

Question 2

What are the different usage modes?

Accepted Answer

We offer three modes:

1) Model Selection - returns the best model name,
2) Full Privacy - determines the best LLM using embeddings to keep message content private, and
3) Model Routing - automatically calls the selected model and returns responses.

Question 3

Which models are used for routing?

Accepted Answer

We are always evaluating and optimizing current LLMs for the best model mix. You can find the current models here: Available Models.

Question 4

Who is the AI Router best suited for?

Accepted Answer

The AI Router is ideal for businesses that want to optimize their LLM usage across multiple providers. It's particularly valuable for companies that need to balance cost, performance, and reliability, handle varying workloads, or require privacy-focused solutions. Common use cases include customer support automation, content generation, data analysis, and any applications where consistent LLM performance is crucial, especially RAG applications.

Question 5

How does the model routing work?

Accepted Answer

The AI Router has multiple modules with different approaches built in to predict the expected quality, cost, and latency for each individual request for all the supported models. It then selects the best model based on your preferences.

Question 6

How does the full privacy mode work?

Accepted Answer

In full privacy mode, the AI Router makes model selections based on embeddings of your messages rather than the raw content. These embeddings can be generated automatically by our SDK or you can generate them yourself and provide them manually. This means your message content stays private and never leaves your infrastructure. This feature is available in the pro plan.

Question 7

Can I customize the routing preferences?

Accepted Answer

Yes, you can adjust the weighting between quality, cost, and latency using API parameters to match your priorities. Enterprise customers can also get custom router optimization for their specific use cases.

Question 8

What happens if an LLM provider like OpenAI is down?

Accepted Answer

Our smart fallbacks will simply choose the next best model for the requests without you even noticing.

Question 9

How do I integrate the AI Router with my existing OpenAI implementation?

Accepted Answer

Simply change your API endpoint to use airouter.io instead of api.openai.com. No other code changes are needed as we maintain full OpenAI API compatibility. See the docs for more details.

Question 10

Is there an SDK?

Accepted Answer

You don't need an SDK to use the AI Router for model routing as it is fully compatible with any OpenAI integration, such as the openai lib and langchain. For model selection and full privacy mode, we provide SDKs for Python and NodeJS to make the integration even easier. Check out our SDK documentation for more details.

Question 11

Do you support streaming responses?

Accepted Answer

Yes, for model routing we fully support streaming responses just like the OpenAI API.

Question 12

Do you have a rate limit?

Accepted Answer

Currently, we don't enforce specific rate limits. If a model's rate limit is reached, our router automatically selects the next best available model to handle your request, ensuring continuous service. This means you can send requests at the rate you need without worrying about hard limits. We plan to introduce configurable rate limits in the future for better predictability and cost control.

Question 13

How do you ensure the quality of model recommendations?

Accepted Answer

We continuously evaluate and benchmark LLM performance across different types of tasks and contexts. Our router uses multiple sophisticated algorithms to predict model performance for each specific request, considering factors like content, length, and complexity.

Question 14

I am more interested in low-latency LLMs, is this also possible?

Accepted Answer

Yes, you can boost the relevance of latency to get a lightning-fast model response, check out the weighting parameter.

Question 15

What is the latency overhead of using AI Router?

Accepted Answer

Our best model identification typically adds less than 100ms of latency, this could increase for very large requests. However, this overhead is usually compensated or even outperformed on average by automatically selecting faster models for your requests when speed is a priority.

Question 16

Do I always save money compared to OpenAI?

Accepted Answer

Our customers typically save >60% on average compared to using GPT-4 directly. However, the actual savings depend on your priorities - if you choose to prioritize quality or speed over cost, the savings might be lower. The AI Router intelligently selects the best model based on your preferences, which you can adjust using the weighting parameter to balance between cost, quality, and speed.

Question 17

Do you offer a free trial?

Accepted Answer

We offer a free plan for up to 10,000 requests in model selection mode. For full privacy mode and model routing, you can try our pro plan which includes 100,000 requests.

Question 18

What payment methods do you accept?

Accepted Answer

We accept all major credit cards and PayPal.

Question 19

What's included in the enterprise plan?

Accepted Answer

The enterprise plan can include custom model routing optimization, support for private models, dedicated support, custom SLAs, and integration assistance. Contact us to discuss your specific needs.

Question 20

How do you calculate the number of requests?

Accepted Answer

Each API call counts as one request, regardless of the token count or selected model. The free plan includes 10,000 requests for model selection, while the pro plan includes 100,000 requests with additional requests billed at $0.001 each.

Question 21

Do I need to pay for the underlying model costs?

Accepted Answer

Yes, for model routing you'll still need to pay for the actual usage costs of the selected models. AI Router's fees are only for the routing service itself.

Question 22

How do you handle API keys and security?

Accepted Answer

We use industry-standard encryption for all API keys and sensitive data. In routing mode, we proxy your requests to the selected model providers while maintaining all security best practices.

Question 23

Do you store any of our prompt data?

Accepted Answer

We only store usage statistics for your dashboards and for improving our product. In full privacy mode, we only see embeddings, not the actual content of your messages.

Question 24

Can I use my own model infrastructure, e.g. my Azure OpenAI deployment or AWS Bedrock models?

Accepted Answer

Yes, you can always use the AI Router with your own model infrastructure by using our model selection mode - when you receive the best model you can then call the model yourself. In the enterprise plan we can also automatically call private models in model routing mode.

Question 25

Can I cancel my subscription?

Accepted Answer

You can cancel your subscription at any time. You can do this from your subscription settings.

Question 26

Where can I find my invoices?

Accepted Answer

You can find your invoices in your subscription settings.

Question 27

Can I upgrade or downgrade my plan?

Accepted Answer

Yes, you can upgrade or downgrade your plan at any time. You can do this from your subscription settings.

Question 28

How can I change my payment method?

Accepted Answer

You can update your payment method in your subscription settings. We accept all major credit cards and PayPal.

Question 29

Can I have multiple users on my account?

Accepted Answer

Yes, you can add team members to your team/organization.

Question 30

Where can I find documentation?

Accepted Answer

You can find the full documentation here: Documentation

Question 31

Do you offer technical support?

Accepted Answer

We offer email support for all plans and dedicated E-Mail and Slack support for enterprise customers.

FAQ

Frequently asked questions about the AI Router

Basics & Overview