Getting Started

Synthesis is a deterministic eval framework that catches relational failure modes in AI assistant responses. No LLM judge, no probabilistic scoring — just rule-based pattern matching that produces auditable evidence.

Installation

From npm:

npm install @mcptoolshop/synthesis

Or clone and build from source:

git clone https://github.com/mcp-tool-shop-org/synthesis.git
cd synthesis
npm install
npm run build

Run your first eval

The quickest way to see Synthesis in action:

npm run build
npm run eval

This loads the bundled test cases from data/evals.jsonl, runs all three checkers, and writes a JSON report to out/report.json.

Exit code 0 means no unexpected failures.

Development mode

For faster iteration without a build step:

npm run dev

What happens during an eval

Load — Synthesis reads your JSONL test cases and validates each one against the JSON schema
Check — Each case runs through the checkers specified in its checks array
Compare — If expected labels are provided, computed results are compared against ground truth
Report — A structured JSON report is written with per-case results, evidence, and aggregate metrics

Next steps

Learn how the checkers work under the hood
Write your own test cases
Set up CI integration for automated empathy regression testing