AI Research / Model Behavior Analysis

GPT Guesses Between 1 and 100

A GitHub research project that measures how gpt-4.1 responds when asked to pick a random number between 1 and 100, using 10,000 API calls and comparing the results to a uniform baseline.

Clear26/30
Useful20/30
Specific16/20
Complete12/20
GPT Guesses Between 1 and 100 screenshot

Why it was accepted

The page clearly describes an AI-related research project with a concrete methodology, model name, sample size, and results. It is useful to AI builders and researchers interested in model sampling behavior, and the README gives enough evidence for a public listing.

Weakness

The snapshot does not show the actual charts, dataset contents, or code-level instructions for reproducing the experiment from start to finish, so a visitor cannot fully assess the outputs without opening the repository.

Review status

8 days ago #445 ↓ -1

Last evaluated 8 days ago. Current rank #445. Down 1 spot in the rankings.

Score history

74

Related listings

MarCognity-AI screenshot
81

AI Research / Evaluation / Verification Framework

An open-source research framework for structured LLM evaluation, claim verification, and source-grounded reflective reasoning. The repo describes modular components for retrieval, semantic scoring, skeptical claim checking, and benchmark-style epistemic assessment.

Agora-1: The Multi-Agent World Model screenshot

AI Research / World Models

Agora-1 is a multi-agent world model from Odyssey that simulates shared real-time environments for up to four participants, human or AI, with a focus on gaming, robotics, reinforcement learning, and foundation model research.

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning screenshot

AI Research / LLM Reasoning

Apple Machine Learning Research paper proposing LaDiR, a reasoning framework that combines a VAE-based latent space with latent diffusion to improve LLM text reasoning and iterative refinement.