AgentDish directory

testing

Accepted listings with this tag.

Listing Category Score Trend Checked
#64 ↓ -49
LLM-test-kit

An open-source CLI for testing LLM prompts across consistency, latency, cost, and behavior, with HTML reports and support for OpenAI and Anthropic.

Developer Tools / Testing 88 ↓ -49 27 days ago Details
#87 ↓ -4
llm-mock

Python package for recording real LLM API responses and replaying them in tests so LLM-driven code can run deterministically without live API calls.

Developer Tools / Testing 87 ↓ -4 12 days ago Details
#88 ↓ -4
dari-docs

CLI for testing and improving documentation with simulated developer agents. It checks whether docs are clear enough for agents to complete real tasks, reports where they get stuck, and can generate proposed edits.

Developer Tools / Documentation 87 ↓ -4 12 days ago Details
#218 ↓ -6
JDS

A Copilot skill suite that enforces structured coding workflows for AI-assisted development, with design, planning, TDD, debugging, verification, and cleanup steps plus a live task-graph visualization server.

Developer Tools / AI Coding Assistants 84 ↓ -6 18 days ago Details
#438 ↑ +31
Agent Eval

A GitHub repo for evaluating agentic AI pipeline systems, with guidance for defining metrics, building eval cases, running repeatable tests, and tracking regressions.

Developer Tools / Copywriting 77 ↑ +31 27 days ago Details
#442 ↓ -1
Pure Effect

Pure Effect is a small JavaScript library for building testable business logic as data-driven commands instead of relying on mocks. It includes primitives like Success, Failure, Command, Ask, Retry, and Parallel, plus helpers like effectPipe and runEffect for composing and executing flows.

Developer Tools / Code Assistant 76 ↓ -1 3 days ago Details