Multi-Tenant AI API Gateway
| LocalAI | ● Online |
| LiteLLM | ● Offline |
| Cortex API | ● Online |
| Model ID | Type |
gpt-4o | model |
jina-reranker-v1-base-en | model |
qwen3.5-4b | model |
silero-vad | model |
gpt-4 | model |
stablediffusion | model |
text-embedding-ada-002 | model |
tts-1 | model |
whisper-1 | model |
POST /v1/chat/completions | OpenAI-compatible chat (proxied) |
POST /v1/embeddings | Text embeddings |
POST /v1/audio/transcriptions | Whisper transcription |
GET /v1/models | List available models |
POST /admin/tenants | Create tenant |
POST /admin/tenants/{id}/keys | Generate API key |
GET /admin/analytics/usage | Usage analytics |