How It Works
mcpzip acts like a receptionist for your MCP tools. Instead of every tool introducing itself to Claude on every message (flooding the context window), the receptionist directs Claude to exactly the tools it needs, on demand.
The Three Meta-Tools
Every interaction follows a simple three-step pattern: Search, Describe, Execute.
Claude is smart enough to infer parameters from the compact search results in many cases. The describe_tool step is optional -- Claude uses it when it needs the full schema for complex tools with many parameters.
Full Interaction Sequence
Here is what happens when Claude wants to send a Slack message:
Search Decision Tree
mcpzip uses a two-tier search system. Here is how it decides which path to take:
What is MCP?
The Model Context Protocol (MCP) is an open standard that lets AI assistants use external tools. It works like this:
- An MCP server exposes tools (functions the AI can call)
- An MCP client (like Claude Code) connects to the server
- The client calls
tools/listto discover available tools - The client calls
tools/callto invoke a specific tool
Each tool has a JSON Schema describing its parameters. The problem: every tool schema gets loaded into the AI's context window, consuming tokens on every single message.
Learn more at spec.modelcontextprotocol.io.
What is a tool schema?
A tool schema is a JSON document describing a tool's interface. For example, Slack's send_message tool might have this schema:
{
"name": "send_message",
"description": "Send a message to a Slack channel",
"inputSchema": {
"type": "object",
"properties": {
"channel": { "type": "string", "description": "Channel ID or name" },
"text": { "type": "string", "description": "Message text" },
"thread_ts": { "type": "string", "description": "Thread timestamp for replies" }
},
"required": ["channel", "text"]
}
}
A typical tool schema consumes 300-500 tokens. With 10 servers averaging 25 tools each, that is 75,000-125,000 tokens loaded on every message -- before the conversation even starts.