Under the hood

AI queue

Watch AgentDish crawl new pages and run AI reviews in public.

Processing queue

Live queue state for crawls and AI evaluations.

#181 submission_evaluation succeeded

https://developers.googleblog.com/supercharging-llm-inference-on-google-tpus-achieving-3x-speedups-with-diffusion-style-speculative-decoding/

complete 2026-05-05 15:33:42
#180 submission_evaluation succeeded

https://github.com/microsoft/vscode/issues/314311

complete 2026-05-05 15:33:34
#179 submission_evaluation succeeded

https://github.com/aragossa/pii-shield

complete 2026-05-05 15:33:27
#178 submission_evaluation succeeded

https://genosyn.com

complete 2026-05-05 15:33:14
#177 submission_evaluation succeeded

https://github.com/fsilavong/agent-eval

complete 2026-05-05 15:33:09
#176 submission_evaluation succeeded

https://github.com/boldsoftware/shelley

complete 2026-05-05 15:33:03
#175 submission_evaluation succeeded

https://github.com/rodriguezaa22ar-boop/atlas-trust-infrastructure

complete 2026-05-05 15:32:57
#174 submission_evaluation succeeded

https://github.com/kouhxp/liteflow

complete 2026-05-05 15:32:51
#173 submission_evaluation succeeded

https://github.com/SprintiQ-Incorporated/sprintiq

complete 2026-05-05 15:32:45
#172 submission_evaluation succeeded

https://agents2agents.ai/bonsai

complete 2026-05-05 15:32:37
#171 submission_evaluation succeeded

https://github.com/angelos-p/llm-from-scratch

complete 2026-05-05 15:32:31
#170 submission_evaluation succeeded

https://app.rudel.ai/wrapped

complete 2026-05-04 15:33:32

Processing logs

Newest worker events first.

info / evaluating Running the AI evaluation. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / crawl_saved Saved crawl snapshot with status 200. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / crawling Crawling https://arxiv.org/abs/2605.24218. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / checking_duplicate Checking for an existing accepted listing. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / starting Started processing job. https://arxiv.org/abs/2605.24218 / 2026-05-26 15:33:21
info / complete Processing job completed. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:19
info / screenshot_captured Screenshot captured and linked. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:19
info / capturing_screenshot Capturing screenshot for Building the harness around our coding agents: eight failure modes, eight pillars. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / complete Accepted and published Building the harness around our coding agents: eight failure modes, eight pillars. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / publishing Publishing accepted listing. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / evaluation_saved Saved accepted evaluation with score 74. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:15
info / evaluating Running the AI evaluation. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:12
info / crawl_saved Saved crawl snapshot with status 200. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:12
info / crawling Crawling https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:11
info / checking_duplicate Checking for an existing accepted listing. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:11
info / starting Started processing job. https://nimbalyst.com/blog/agent-harness-above-claude-code-codex/ / 2026-05-26 15:33:11
info / complete Processing job completed. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:10
info / screenshot_captured Screenshot captured and linked. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:10
info / capturing_screenshot Capturing screenshot for A 400-hour forensic audit of LLMs using multi-model context saturation. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:05
info / complete Accepted and published A 400-hour forensic audit of LLMs using multi-model context saturation. https://github.com/alanscalone/llm-behavior-analysis / 2026-05-26 15:33:05