Developer Tools / Code Assistant

Arena AI Model Elo History

A public visualization that tracks flagship AI models’ Elo history over time using the Arena AI Leaderboard dataset, with notes on caveats and methodology.

Clear27/30
Useful24/30
Specific14/20
Complete12/20
Arena AI Model Elo History screenshot

Why it was accepted

The page clearly presents a focused AI-adjacent product: a visual tracker for flagship model Elo history over time. It explains the data source, how the chart is built, and why the project exists, which is enough evidence for a useful directory listing. The snapshot also shows the project is live and maintained as a public web page with a GitHub link.

Weakness

The crawl snapshot does not show the actual chart interaction, filters beyond “Show All Models,” or whether users can export data. It also does not explain update reliability, historical coverage depth, or whether the dataset includes all major labs consistently.

Review status

19 days ago #435 → 0

Last evaluated 19 days ago. Current rank #435. Holding steady in the rankings.

Score history

77

Related listings

CodeGraph screenshot
94

Developer Tools / AI for Code

CodeGraph is a local code knowledge graph for AI coding agents like Claude Code, Cursor, Codex, OpenCode, and Hermes Agent. It aims to cut token use, tool calls, and runtime by letting agents query pre-indexed code structure instead of scanning files repeatedly.

Version Sentinel screenshot

Developer Tools / AI Coding Guardrails

Claude Code plugin that blocks dependency edits until a fresh, source-cited version check is recorded, helping prevent hallucinated or stale package versions across npm, pip, Poetry/uv, Cargo, and NuGet.

OWASP Agent Memory Guard screenshot

Developer Tools / AI Security

An OWASP incubator project that protects AI agent memory from prompt injection, secret leakage, and tampering. It includes a Python library, policy-based controls, benchmarks, and integrations for agent frameworks like LangChain and AutoGen.

aislop screenshot
#7 aislop
91

Developer Tools / Code Quality

CLI for catching AI-generated code smells and regressions in code. It scans changes with 40+ rules across 7 languages, offers fixes, CI gating, hooks, and MCP tools.