feat(readme): add Mandoline MCP server to community servers list #2640
+1
−0
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Add Mandoline's MCP server to the community servers section.
Description
This server provides AI assistants with access to Mandoline's evaluation tools, allowing them to reflect on, critique, and continuously improve their own performance through self-evaluation.
Motivation and Context
Two-fold:
Direct, Real-world contexts: Embed eval tools directly in AI clients where users work, so they can measure model performance on their real day-to-day tasks. This helps users be more rigorous about which model works best in a given context. It also helps them do things like build up customized test sets over time.
AI self-improvement: AI assistants can check and improve their own work based on what users care about. This can happen proactively (get task → generate multiple solutions → eval each → send back the best) or reactively (get task → generate solution → get user feedback → run eval → try to do better).
How Has This Been Tested?
Tested with multiple LLM clients: Claude Code, Claude Desktop, and Cursor. All successfully connected to the
Mandoline MCP server and can access the evaluation framework tools.
Example interaction in Claude Code:
Breaking Changes
None - this is a documentation addition only.
Types of changes
Checklist