Ship agents faster. Plano is delivery infrastructure for agentic applications - The AI-native proxy & dataplane that offloads plumbing work, so you stay focused on product logic.
-
Updated
Jan 4, 2026 - Rust
Ship agents faster. Plano is delivery infrastructure for agentic applications - The AI-native proxy & dataplane that offloads plumbing work, so you stay focused on product logic.
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.
A neural multi-armed bandit framework for routing prompts to the most suitable LLM in a multi-agent system.
NeurIPS 2025 paper "MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees"
Compose, train and test fast LLM routers
Unified interface server for various LLM providers with OpenAI API format
Hybrid AI routing: LOCAL Ollama + CLOUD GitHub Copilot
An applied AI system using LLM routing, hybrid retrieval, and structured positive/negative reasoning for decision support.
Enrutador inteligente de LLMs para el IndesIAhack 2025 que selecciona el mejor modelo por consulta para minimizar coste y consumo energético manteniendo la calidad.
Add a description, image, and links to the llm-routing topic page so that developers can more easily learn about it.
To associate your repository with the llm-routing topic, visit your repo's landing page and select "manage topics."