Developer Tools / AI Evaluation

showhn-rank

A Python pipeline that ranks 1,000 Show HN posts with an LLM judge and TrueSkill, then compares estimated merit against Hacker News points.

Clear24/30
Useful22/30
Specific16/20
Complete12/20
showhn-rank screenshot

Why it was accepted

The page clearly describes an AI-powered evaluation pipeline, shows how it works, and includes setup and run commands. It is useful to builders interested in LLM judging, ranking systems, and automated content evaluation, with concrete implementation details and a stated methodology.

Weakness

The snapshot does not show the report output itself, example rankings, or evidence of recent maintenance beyond a small commit history. It is also unclear how well the approach performs in practice across different kinds of posts.

Review status

23 days ago #451 ↓ -1

Last evaluated 23 days ago. Current rank #451. Down 1 spot in the rankings.

Score history

74

Related listings

CodeGraph screenshot
94

Developer Tools / AI for Code

CodeGraph is a local code knowledge graph for AI coding agents like Claude Code, Cursor, Codex, OpenCode, and Hermes Agent. It aims to cut token use, tool calls, and runtime by letting agents query pre-indexed code structure instead of scanning files repeatedly.

Version Sentinel screenshot

Developer Tools / AI Coding Guardrails

Claude Code plugin that blocks dependency edits until a fresh, source-cited version check is recorded, helping prevent hallucinated or stale package versions across npm, pip, Poetry/uv, Cargo, and NuGet.

OWASP Agent Memory Guard screenshot

Developer Tools / AI Security

An OWASP incubator project that protects AI agent memory from prompt injection, secret leakage, and tampering. It includes a Python library, policy-based controls, benchmarks, and integrations for agent frameworks like LangChain and AutoGen.

aislop screenshot
#7 aislop
91

Developer Tools / Code Quality

CLI for catching AI-generated code smells and regressions in code. It scans changes with 40+ rules across 7 languages, offers fixes, CI gating, hooks, and MCP tools.