Smart Routing Plugin 'OrcaRouter' for AI Agents Launches on Dify Marketplace: Manages 200+ LLMs and Cuts Inference Costs by Up to 70%

FlashLabs and Continuum AI have partnered to launch 'OrcaRouter' on the Dify Marketplace, a plugin that centralizes over 200 LLMs. It automatically selects the optimal model for each task within a workflow, reducing AI inference costs by up to 71%.
新製品NQ 90/100出典:PR Times

📋 Article Processing Timeline

  • 📰 Published: May 23, 2026 at 04:00
  • 🔍 Collected: May 22, 2026 at 19:31
  • 🤖 AI Analyzed: May 23, 2026 at 05:39 (10h 7m after Collected)
## Smart Routing Plugin 'OrcaRouter' for AI Agents Launches on Dify Marketplace

FlashLabs announced that its partner, Continuum AI, a developer of next-generation AI infrastructure, has released the smart routing plugin 'OrcaRouter' on the Dify Marketplace. This plugin enables Dify users to access over 200 large language models (LLMs) through a single API. By automatically selecting the most suitable model based on prompt content, users can maintain output quality while reducing AI inference costs by up to 70%.

### Background and Purpose

With the rapid adoption of Dify for building AI applications, users have faced challenges such as complex contracts with multiple model providers, excessive costs from fixed-model operations, and the difficulty of choosing the optimal model for varying prompt complexities. OrcaRouter solves these issues through 'adaptive routing' technology.

### Overview of OrcaRouter x Dify Integration

- **API Consolidation**: Access over 200 models from 15+ providers, including OpenAI, Anthropic, Google, xAI, Meta, Mistral, DeepSeek, Alibaba, Moonshot, and ByteDance, under a single API.
- **Intelligent Routing**: Every request is analyzed to select the most cost-effective model that meets specified quality standards.
- **Continuous Optimization**: Routing policies are updated based on quality signals and user feedback, making the system smarter and more cost-effective over time.
- **Real-time Market Adaptation**: Continuously monitors provider pricing, latency, and new model releases to switch routing destinations automatically.

### Supported Models and Benefits

OrcaRouter supports cutting-edge models including DeepSeek V4 Pro, Anthropic Claude Opus 4.7, OpenAI GPT 5.5, and Qwen3.7 Max. Users can reduce inference spending by 47–71% depending on their workload without needing to redesign their existing Dify workflows.

### How to Use

After installing the plugin from the Dify Marketplace, users can configure their routing strategy within the Dify workflow using an OrcaRouter API key. Pricing is based on provider costs with zero markup platform fees.

FAQ

How much cost can I save using OrcaRouter?

Depending on the workload, you can reduce inference spending by 47% to 71% compared to using a single fixed model.

Do I need to modify my existing Dify workflows?

No, you can continue using your existing workflows without modifications just by updating the Base URL and API key.

Which AI models are supported?

It supports over 200 models, including the latest versions from providers like OpenAI, Anthropic, Google, Meta, and DeepSeek.