AgentDish directory

Recently Evaluated

The newest checks across accepted listings.

Listing Category Score Trend Checked
#41 ↓ -3
NUA

NUA is a compliance-testing agent for regtech teams. It ingests rules, runs generated checks against a product, flags gaps, and produces an audit-ready coverage report that updates on every release.

AI Product / Compliance / RegTech 88 ↓ -3 14 hours ago Details
#98 ↑ +2
ZSE v2.0.0

A pure-Python LLM inference engine and server with CUDA/HIP/Metal code generation, OpenAI-compatible API support, built-in RAG, and multi-GPU backend support.

Developer Tools / AI / ML Infrastructure 86 ↑ +2 14 hours ago Details

A blog post about verifiable RAG that benchmarks open-source NLI verifiers against Claude on RAGTruth and describes a Python library for sentence-level citation and claim verification.

AI / RAG / Verification & Hallucination Detection 78 ↑ +6 14 hours ago Details

PaperProfit explains an AI-assisted stock evaluation approach that combines fundamentals, technical signals, and qualitative analysis from transcripts and SEC filings into a weighted score.

Research / Knowledge Work 77 → 0 14 hours ago Details
#42 ↓ -3
Codex SDK

OpenAI’s SDK docs for programmatically controlling local Codex agents, with coverage of setup, core concepts, tools, orchestration, evaluation, and deployment paths.

Developer Tools / Code Assistant 88 ↓ -3 14 hours ago Details

A technical guide that shows how to build a small CLI-based LLM agent for microservices using tool discovery, model calls, conversation history, and a REPL loop.

Developer Tools / AI Development 81 ↑ +2 14 hours ago Details
#408 → 0
NoSleepAgent

A macOS utility that keeps a MacBook awake while AI agents are running, then lets normal sleep resume when they go idle. It includes install scripts, hooks for Claude Code and Codex, and a menu bar status indicator.

Developer Tools / Productivity 77 → 0 14 hours ago Details
#285 ↓ -2
Viveka

A Python filter layer for LLM apps that evaluates responses against a Lean-verified Scherf logic backend and can pass, flag, correct, or block output.

Developer Tools / AI Safety / LLM Guardrails 82 ↓ -2 14 hours ago Details

A detailed tutorial that explains what agent tools are and walks through a basic set of implementation patterns for an AI agent in Python, including bash, file, search, edit, and web fetch tools.

Writing / Copywriting 74 ↓ -1 14 hours ago Details

A GitHub-hosted guidelines file for AI coding assistants used in Stanford CS336. It tells assistants how to help students with debugging, explanations, and feedback while avoiding direct solutions or code generation.

Writing / Copywriting 72 ↑ +1 14 hours ago Details

An open-source ecommerce starter that boots a Magento or Mage-OS store in GitHub Codespaces and uses Claude to make changes directly in the live dev environment.

Developer Tools / Code Assistant 86 ↑ +2 38 hours ago Details
#286 ↓ -2
WhatsKept

A local Go tool that turns encrypted iOS WhatsApp backups into a searchable workspace for AI agents, with OCR, transcription, and SQLite/FTS5 indexing.

Developer Tools / AI Data Processing 82 ↓ -2 38 hours ago Details
#174 ↓ -6
dygit

A local AI IDE plugin that cleans up messy prompts before they reach Claude, Cursor, or OpenCode. It fixes typos, dropped words, and split tokens on-device with no network or token cost.

Developer Tools / AI IDE Plugins 84 ↓ -6 38 hours ago Details
#175 ↓ -6
QVAC by Tether

QVAC SDK 0.12.0 adds TurboQuant, a KV-cache quantization feature for local LLM inference that claims up to 5x more context on-device with no retraining or code changes.

Developer Tools / AI SDK 84 ↓ -6 38 hours ago Details

JetBrains introduces Mellum2, an open-source 12B model built for software engineering workflows, routing, Q&A, RAG, sub-agents, and private deployment.

AI Model / Code/Workflow Model 88 ↓ -3 38 hours ago Details

A Git-native CLI task ledger for coordinating work between humans and coding agents, with task claiming, notes, releases, handoffs, and audit history stored in files and versioned in Git.

Developer Tools / CLI / Workflow Tooling 82 ↓ -2 38 hours ago Details

Docker blog post about a real AI coding agent failure and how Docker Sandboxes aim to contain destructive execution mistakes.

Developer Tools / Code Assistant 75 → 0 38 hours ago Details
#176 ↓ -6
OP

OP is an API that gives AI agents real phone numbers for account registration, OTP/2FA codes, SMS, and voice calls. The page shows a live product pitch, feature comparison, a simple API example, and multiple agent communication use cases.

Developer Tools / API 84 ↓ -6 38 hours ago Details

A Codacy blog post explaining GitHub Copilot’s new AI Credits billing for code review and positioning Codacy AI Reviewer as an alternative with fixed per-seat pricing, static analysis, and pull-request review features.

Developer Tools / Code Assistant 74 ↓ -1 38 hours ago Details
#154 ↓ -3
Deliberate

Deliberate is an SDK and proxy that records the alternatives an AI agent considered, what it rejected, and why, with replay, policy gates, approval workflows, and exportable decision logs.

Developer Tools / AI Agent Tooling 85 ↓ -3 38 hours ago Details
#252 ↓ -3
Ouijit

Ouijit is an open-source task and terminal session manager for coding agents. It adds agent status, notifications, automatic worktree management, and VM sandboxing, with support shown for Claude Code, Codex, and Pi.

Developer Tools / Agent Orchestration 83 ↓ -3 38 hours ago Details
#177 ↓ -6
Odysseus

Self-hosted AI workspace with chat, agents, deep research, document editing, email triage, notes, calendar, and model comparison. The README includes setup instructions for Docker and native installs, plus a broad feature list and demo references.

AI Productivity / AI Workspace 84 ↓ -6 38 hours ago Details
#178 ↓ -6
screenshotter

A macOS utility that compresses screenshots locally and copies them to the clipboard for use with coding agents like Codex and Claude.

Developer Tools / AI Developer Utilities 84 ↓ -6 2 days ago Details
#354 ↑ +2
agentpack

agentpack is a tool for staging reproducible AI coding environments across agent harnesses like Claude Code, Codex, Cursor, and OpenCode. It manages skills, hooks, MCPs, and other agent configuration from a single manifest without polluting the project or user config.

Developer Tools / AI Coding / Agent Infrastructure 79 ↑ +2 2 days ago Details