AgentDish directory
workflow testing
Accepted listings with this tag.
| Listing | Category | Score | Trend | Checked | |
|---|---|---|---|---|---|
|
#400
↑ +6
LLM INQUISITOR
A GitHub repository that proposes a practical methodology for evaluating how AI systems behave during real work, with quick-start, practitioner, and methodology guides included. |
Developer Tools / AI Evaluation | 78 | ↑ +6 | 13 days ago | Details |