AgentDish directory

behavior-analysis

Accepted listings with this tag.

Listing Category Score Trend Checked

A GitHub research project documenting a long-form, multi-model analysis of LLM behavior across Claude, Gemini, ChatGPT, and Grok. The repo includes an executive summary, screenplay, technical white paper, and archive of logs and chat records.

AI Research / LLM Evaluation & Analysis 75 → 0 7 days ago Details