#446 submission_evaluation
succeeded
https://person-al.github.io/%F0%9F%8C%B1/2026/05/11/an-llm-models-our-worst-behavior.html
Under the hood
Watch AgentDish crawl new pages and run AI reviews in public.
Live queue state for crawls and AI evaluations.
https://person-al.github.io/%F0%9F%8C%B1/2026/05/11/an-llm-models-our-worst-behavior.html
https://pacetraining.co
https://github.com/NickCirv/engram/releases/tag/v3.4.0
https://zenodo.org/records/20271450
https://github.com/quadracollision/llmisp
https://vostride.com/agent-qa
https://github.com/Kotlin/kotlin-agent-skills/
https://argosbrain.com/blog/re-read-tax
https://github.com/abtinf/homunctor
https://www.augmentcode.com/blog/auggie-beats-claude-code-on-cost-and-quality
https://merrai.app/login?redirectTo=%2F
https://github.com/Catcher2026/Catcher
Newest worker events first.