Under the hood

AI queue

Watch AgentDish crawl new pages and run AI reviews in public.

Processing queue

Live queue state for crawls and AI evaluations.

#205 submission_evaluation succeeded

https://agentsarena.vercel.app

complete 2026-05-06 15:32:56
#204 submission_evaluation succeeded

https://curl.md

complete 2026-05-06 15:32:51
#203 submission_evaluation succeeded

https://mczaykowski.com/articles/smallest-ax-surface

complete 2026-05-06 15:32:45
#202 submission_evaluation succeeded

https://github.com/marvkr/better-design

complete 2026-05-06 15:32:40
#201 submission_evaluation succeeded

https://github.com/sqliteai/adam

complete 2026-05-06 15:32:31
#200 submission_evaluation succeeded

https://finbarr.site/2026/05/05/treat-your-coding-agents-like-developers.html

complete 2026-05-06 15:32:25
#199 submission_evaluation succeeded

https://github.com/OxideAV/oxideav-magicyuv/issues/3

complete 2026-05-06 15:32:20
#198 submission_evaluation succeeded

https://songshift.reachnick.co

complete 2026-05-05 15:35:50
#197 submission_evaluation succeeded

https://bugs.launchpad.net/bugs/1996267

complete 2026-05-05 15:35:45
#196 submission_evaluation succeeded

https://github.com/rdmsr/sectorllm

complete 2026-05-05 15:35:39
#195 submission_evaluation succeeded

https://github.com/grimm67123/grimmbot

complete 2026-05-05 15:35:33
#194 submission_evaluation succeeded

https://medium.com/@cmitre/the-week-my-ai-assistant-tried-to-end-me-and-accidentally-helped-me-build-a-better-model-a2a6e43a5c52

complete 2026-05-05 15:35:28

Processing logs

Newest worker events first.

info / evaluating Running the AI evaluation. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / crawl_saved Saved crawl snapshot with status 200. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / crawling Crawling https://arxiv.org/abs/2605.24218. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / checking_duplicate Checking for an existing accepted listing. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / starting Started processing job. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / complete Processing job completed. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:19
info / screenshot_captured Screenshot captured and linked. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:19
info / capturing_screenshot Capturing screenshot for Building the harness around our coding agents: eight failure modes, eight pillars. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / complete Accepted and published Building the harness around our coding agents: eight failure modes, eight pillars. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / publishing Publishing accepted listing. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / evaluation_saved Saved accepted evaluation with score 74. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / evaluating Running the AI evaluation. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:12
info / crawl_saved Saved crawl snapshot with status 200. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:12
info / crawling Crawling https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:11
info / checking_duplicate Checking for an existing accepted listing. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:11
info / starting Started processing job. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:11
info / complete Processing job completed. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:10
info / screenshot_captured Screenshot captured and linked. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:10
info / capturing_screenshot Capturing screenshot for A 400-hour forensic audit of LLMs using multi-model context saturation. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:05
info / complete Accepted and published A 400-hour forensic audit of LLMs using multi-model context saturation. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:05