AgentDish directory

AI Research AI Tools

Accepted listings in this category.

Listing	Category	Score	Trend	Checked
#263 ↑ +2 Prometheus An autonomous research system that runs on a single workstation and aggressively checks its own claims with adversarial self-verification, replication, and calibration audits.	AI Research / Autonomous Research Systems	86	↑ +2	7 days ago	Details
#733 ↓ -2 Socrates Open-source multi-agent protocol for AI research agents. It pairs a tool-using Scientist with a question-only advisor that can only ask questions and approve plans, and the README includes quick-start setup plus notes on reproducing results on MLE-bench/Kaggle tasks.	AI Research / Multi-agent systems	82	↓ -2	22 days ago	Details
#818 ↑ +2 EuroMesh A sourced model and short report exploring whether Europe could train a sovereign frontier AI model using public compute it already owns, with reproducible code, datasets, and a PDF report.	AI Research / Analysis / Reports	81	↑ +2	32 days ago	Details
#835 ↓ -35 MarCognity-AI An open-source research framework for structured LLM evaluation, claim verification, and source-grounded reflective reasoning. The repo describes modular components for retrieval, semantic scoring, skeptical claim checking, and benchmark-style epistemic assessment.	AI Research / Evaluation / Verification Framework	81	↓ -35	73 days ago	Details
#940 ↑ +6 MiroThinker MiroThinker is a science-focused AI research app that emphasizes prediction, verification, and evidence-backed answers. The page also points to a MiroMind app and suggests use cases across finance, medicine, and regulation.	AI Research / Deep Research Agent	78	↑ +6	35 days ago	Details
#951 ↑ +6 Learning from AVA: Early Lessons from a Curated and Trustworthy Generative AI for Policy and Development Research arXiv paper describing AVA, a GenAI platform for policy and development research built on 4,000+ World Bank reports. The abstract highlights multilingual support, evidence-based synthesis, citation verifiability, and reasoned abstention when queries cannot be supported.	AI Research / Trustworthy Generative AI	78	↑ +6	50 days ago	Details
#964 ↑ +6 Agora-1: The Multi-Agent World Model Agora-1 is a multi-agent world model from Odyssey that simulates shared real-time environments for up to four participants, human or AI, with a focus on gaming, robotics, reinforcement learning, and foundation model research.	AI Research / World Models	78	↑ +6	59 days ago	Details
#976 ↑ +5 LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning Apple Machine Learning Research paper proposing LaDiR, a reasoning framework that combines a VAE-based latent space with latent diffusion to improve LLM text reasoning and iterative refinement.	AI Research / LLM Reasoning	78	↑ +5	72 days ago	Details
#1038 ↓ -1 Hyperagents Research paper introducing hyperagents, a self-referential agent framework that combines a task agent and a meta agent into one editable program. The abstract describes a DGM-based system that improves both task performance and its own improvement process across domains.	AI Research / Self-Improving Agents	76	↓ -1	55 days ago	Details
#1054 → 0 A 400-hour forensic audit of LLMs using multi-model context saturation A GitHub research project documenting a long-form, multi-model analysis of LLM behavior across Claude, Gemini, ChatGPT, and Grok. The repo includes an executive summary, screenplay, technical white paper, and archive of logs and chat records.	AI Research / LLM Evaluation & Analysis	75	→ 0	52 days ago	Details
#1082 ↓ -1 GPT Guesses Between 1 and 100 A GitHub research project that measures how gpt-4.1 responds when asked to pick a random number between 1 and 100, using 10,000 API calls and comparing the results to a uniform baseline.	AI Research / Model Behavior Analysis	74	↓ -1	53 days ago	Details