AgentDish directory

llm-judge

Accepted listings with this tag.

Listing Category Score Trend Checked
#468 ↓ -1
showhn-rank

A Python pipeline that ranks 1,000 Show HN posts with an LLM judge and TrueSkill, then compares estimated merit against Hacker News points.

Developer Tools / AI Evaluation 74 ↓ -1 23 days ago Details