AgentDish directory

AI research

Accepted listings with this tag.

Listing Category Score Trend Checked
#24 ↑ +43
prxhub

prxhub is an open registry for AI research bundles, built around verifiable .prx artifacts, search, provenance, and agent-friendly publishing via MCP and CLI.

Developer Tools / Code Assistant 90 ↑ +43 28 days ago Details

A research report on the current MCP ecosystem, with live crawl numbers, verification rates, category breakdowns, and examples of both strong and weak MCP-positive sites.

Research / AI research 83 ↑ +40 27 days ago Details
#338 ↓ -10
MarCognity-AI

An open-source research framework for structured LLM evaluation, claim verification, and source-grounded reflective reasoning. The repo describes modular components for retrieval, semantic scoring, skeptical claim checking, and benchmark-style epistemic assessment.

AI Research / Evaluation / Verification Framework 81 ↓ -10 28 days ago Details

An opinionated write-up on where multi-agent systems have and have not delivered value, with concrete comparisons across coding, images, CAD, and BIM, plus a description of Blade’s evidence-first architecture.

Writing / Copywriting 78 ↑ +6 7 days ago Details

Agora-1 is a multi-agent world model from Odyssey that simulates shared real-time environments for up to four participants, human or AI, with a focus on gaming, robotics, reinforcement learning, and foundation model research.

AI Research / World Models 78 ↑ +6 14 days ago Details

Apple Machine Learning Research paper proposing LaDiR, a reasoning framework that combines a VAE-based latent space with latent diffusion to improve LLM text reasoning and iterative refinement.

AI Research / LLM Reasoning 78 ↑ +5 27 days ago Details
#429 ↓ -1
Hyperagents

Research paper introducing hyperagents, a self-referential agent framework that combines a task agent and a meta agent into one editable program. The abstract describes a DGM-based system that improves both task performance and its own improvement process across domains.

AI Research / Self-Improving Agents 76 ↓ -1 10 days ago Details

A GitHub research project that measures how gpt-4.1 responds when asked to pick a random number between 1 and 100, using 10,000 API calls and comparing the results to a uniform baseline.

AI Research / Model Behavior Analysis 74 ↓ -1 8 days ago Details