Getting Started
The problem
Section titled “The problem”MCP servers expose dozens or hundreds of tools. Loading all tool definitions into context wastes tokens and slows down responses.
Before: 77 tools × ~500 tokens = 38,500 tokens per requestAfter: 1 compass tool + 3 results = ~2,000 tokens per request
Savings: 95%Option 1: Local installation
Section titled “Option 1: Local installation”# Prerequisites: Ollama with nomic-embed-textollama pull nomic-embed-text
# Clone and setupgit clone https://github.com/mcp-tool-shop-org/tool-compass.gitcd tool-compass/tool_compass
# Create virtual environmentpython -m venv venvsource venv/bin/activate # Windows: venv\Scripts\activate
# Install dependenciespip install -r requirements.txt
# Build the search indexpython gateway.py --sync
# Run the MCP serverpython gateway.pyOption 2: Docker
Section titled “Option 2: Docker”git clone https://github.com/mcp-tool-shop-org/tool-compass.gitcd tool-compass/tool_compass
# Start with Docker Compose (requires Ollama running locally)docker-compose up
# Or include Ollama in the stackdocker-compose --profile with-ollama up
# Access the UI at http://localhost:7860Gradio UI
Section titled “Gradio UI”Tool Compass includes a Gradio web interface for interactive exploration:
python ui.pyTesting
Section titled “Testing”# Run all testspytest
# Run with coveragepytest --cov=. --cov-report=html
# Skip integration tests (no Ollama required)pytest -m "not integration"