AgentDish directory

openai

Accepted listings with this tag.

Listing Category Score Trend Checked
#31 → 0
OpenAI CLI

Official command-line client for the OpenAI API, with documented installation, environment variables, resource-based commands, file passing, and admin API support.

Developer Tools / CLI / API Tooling 89 → 0 25 days ago Details
#42 ↓ -3
Codex SDK

OpenAI’s SDK docs for programmatically controlling local Codex agents, with coverage of setup, core concepts, tools, orchestration, evaluation, and deployment paths.

Developer Tools / Code Assistant 88 ↓ -3 15 hours ago Details
#63 ↓ -48
LLM-test-kit

An open-source CLI for testing LLM prompts across consistency, latency, cost, and behavior, with HTML reports and support for OpenAI and Anthropic.

Developer Tools / Testing 88 ↓ -48 27 days ago Details
#83 ↓ -4
llm-mock

Python package for recording real LLM API responses and replaying them in tests so LLM-driven code can run deterministically without live API calls.

Developer Tools / Testing 87 ↓ -4 12 days ago Details
#143 ↓ -109
clawdex

A macOS companion overlay for Claude Code that shows an animated pet/sprite based on Claude Code activity. It includes a CLI, hook integration, a daemon, and a web-based atlas validator/renderer for Codex-pet-compatible pets.

Developer Tools / AI Developer Tools 86 ↓ -109 27 days ago Details
#209 ↓ -6
Ctx-opt

TypeScript middleware that trims or compresses LLM chat history to fit a token budget, with support for OpenAI, Anthropic, and AI SDK integrations.

Developer Tools / LLM Context Management 84 ↓ -6 19 days ago Details
#215 ↓ -6
Torrix

Self-hosted AI observability for tracking LLM requests, costs, latency, prompt traces, reasoning tokens, and PII masking across many model providers.

Developer Tools / AI Observability 84 ↓ -6 20 days ago Details

A tutorial and live demo showing how to build an AI SQL analyst agent that inspects schema, writes queries, runs them, and explains results using Node.js, OpenAI, Vercel AI SDK, and SQLite.

Data / SQL 84 ↑ +81 27 days ago Details

A GitHub research project that measures how gpt-4.1 responds when asked to pick a random number between 1 and 100, using 10,000 API calls and comparing the results to a uniform baseline.

AI Research / Model Behavior Analysis 74 ↓ -1 8 days ago Details