back
Diagram of mcp-scheduling system architecture

mcp-scheduling

A framework-agnostic tool router for MCP (Model Context Protocol) servers. Routes agent requests to the fastest available tool when similar tools exist, decreasing response latency by ~64% (tool dependent) on average across 824 benchmark questions.

Tool similarity is determined by computing vector embeddings of each tool's description, then grouping duplicates via cosine similarity, letting the router identify functionally equivalent tools automatically.

Tested against Google's Frames dataset using Gemini Flash with a custom agent built using LangChain.

README

mcp-scheduling

it's actually a router

A framework-agnostic tool router for MCP (Model Context Protocol) servers. Routes agent requests to the fastest available tool when similar tools exist, cutting response latency by ~64% (tool dependent) on average across 824 benchmark questions.

Tool similarity is determined by computing vector embeddings of each tool's description, then grouping duplicates via cosine similarity. This lets the router identify functionally equivalent tools automatically.

image

The router is framework agnostic, not dependent on ReAct. image

image

Testing Methodology

NOTE

Names of the MCP servers/tools are generic when provided to the agent. No mention of slow or fast.

Agent uses google/gemini-2.0-flash-001 and is provided two MCP servers:


scrape-slow (5 second delay)

ts

scrape-fast (no delay)

ts

The agent response verifier used openai/gpt-oss-120b.

The scheduler was tested against all 824 questions in the Google Frames Dataset.

jsonc

Results

image Agent using the router was faster by 7112.14 ms (63.53%) on average.

image Expected, since the current implementation does not track tool result quality.