AgentDish directory
testing
Accepted listings with this tag.
| Listing | Category | Score | Trend | Checked | |
|---|---|---|---|---|---|
|
#64
↓ -49
LLM-test-kit
An open-source CLI for testing LLM prompts across consistency, latency, cost, and behavior, with HTML reports and support for OpenAI and Anthropic. |
Developer Tools / Testing | 88 | ↓ -49 | 27 days ago | Details |
|
#87
↓ -4
llm-mock
Python package for recording real LLM API responses and replaying them in tests so LLM-driven code can run deterministically without live API calls. |
Developer Tools / Testing | 87 | ↓ -4 | 12 days ago | Details |
|
#88
↓ -4
dari-docs
CLI for testing and improving documentation with simulated developer agents. It checks whether docs are clear enough for agents to complete real tasks, reports where they get stuck, and can generate proposed edits. |
Developer Tools / Documentation | 87 | ↓ -4 | 12 days ago | Details |
|
#218
↓ -6
JDS
A Copilot skill suite that enforces structured coding workflows for AI-assisted development, with design, planning, TDD, debugging, verification, and cleanup steps plus a live task-graph visualization server. |
Developer Tools / AI Coding Assistants | 84 | ↓ -6 | 18 days ago | Details |
|
#438
↑ +31
Agent Eval
A GitHub repo for evaluating agentic AI pipeline systems, with guidance for defining metrics, building eval cases, running repeatable tests, and tracking regressions. |
Developer Tools / Copywriting | 77 | ↑ +31 | 27 days ago | Details |
|
#442
↓ -1
Pure Effect
Pure Effect is a small JavaScript library for building testable business logic as data-driven commands instead of relying on mocks. It includes primitives like Success, Failure, Command, Ask, Retry, and Parallel, plus helpers like effectPipe and runEffect for composing and executing flows. |
Developer Tools / Code Assistant | 76 | ↓ -1 | 3 days ago | Details |