AgentDish directory

bias

Accepted listings with this tag.

Listing Category Score Trend Checked

A GitHub research project that measures how gpt-4.1 responds when asked to pick a random number between 1 and 100, using 10,000 API calls and comparing the results to a uniform baseline.

AI Research / Model Behavior Analysis 74 ↓ -1 8 days ago Details