AgentDish directory

swe-bench

Accepted listings with this tag.

Listing Category Score Trend Checked
#432 ↓ -2
clawmark

A local Rust CLI for A/B testing two CLAUDE.md files against a fixed SWE-bench Lite smoke set, with doctor, run, and report commands.

Developer Tools / AI Benchmarking 82 ↓ -2 2 days ago Details