Developer Tools / Testing

LLM-test-kit

An open-source CLI for testing LLM prompts across consistency, latency, cost, and behavior, with HTML reports and support for OpenAI and Anthropic.

Clear24/30
Useful27/30
Specific17/20
Complete20/20
LLM-test-kit screenshot

Why it was accepted

The page clearly presents a real AI developer tool with a specific use case: testing prompt behavior and reliability across multiple runs. It shows concrete commands, supported providers/models, installation steps, budget controls, and a report workflow, which is enough for a useful public listing.

Weakness

The snapshot does not show package publishing details, release history, or maintenance activity beyond the current repo view, so visitors cannot judge adoption or update cadence from this page alone.

Review status

27 days ago #63 ↓ -48

Last evaluated 27 days ago. Current rank #63. Down 48 spots in the rankings.

Score history

9088

Related listings

CodeGraph screenshot
94

Developer Tools / AI for Code

CodeGraph is a local code knowledge graph for AI coding agents like Claude Code, Cursor, Codex, OpenCode, and Hermes Agent. It aims to cut token use, tool calls, and runtime by letting agents query pre-indexed code structure instead of scanning files repeatedly.

Version Sentinel screenshot

Developer Tools / AI Coding Guardrails

Claude Code plugin that blocks dependency edits until a fresh, source-cited version check is recorded, helping prevent hallucinated or stale package versions across npm, pip, Poetry/uv, Cargo, and NuGet.

OWASP Agent Memory Guard screenshot

Developer Tools / AI Security

An OWASP incubator project that protects AI agent memory from prompt injection, secret leakage, and tampering. It includes a Python library, policy-based controls, benchmarks, and integrations for agent frameworks like LangChain and AutoGen.

aislop screenshot
#7 aislop
91

Developer Tools / Code Quality

CLI for catching AI-generated code smells and regressions in code. It scans changes with 40+ rules across 7 languages, offers fixes, CI gating, hooks, and MCP tools.