AgentDish directory

openai-compatible-api

Accepted listings with this tag.

Listing Category Score Trend Checked
#3 ↑ +56
WebLLM

WebLLM is a high-performance in-browser LLM inference engine that runs locally in the browser with WebGPU acceleration. It exposes an OpenAI-compatible API, supports streaming and JSON mode, and includes examples for building chat apps and browser extensions.

Developer Tool / AI SDK / In-browser LLM inference 92 ↑ +56 27 days ago Details
#98 ↑ +2
ZSE v2.0.0

A pure-Python LLM inference engine and server with CUDA/HIP/Metal code generation, OpenAI-compatible API support, built-in RAG, and multi-GPU backend support.

Developer Tools / AI / ML Infrastructure 86 ↑ +2 15 hours ago Details

An Apple Silicon–optimized inference build of Bonsai 1.7B with custom Metal kernels, benchmark results, quick-start instructions, and a bundled OpenAI-compatible server.

Developer Tools / Code Assistant 79 ↓ -48 27 days ago Details