AgentDish directory

continuous batching

Accepted listings with this tag.

Listing Category Score Trend Checked
#45 ↓ -3
tiny-vllm

Open-source C++ and CUDA LLM inference engine inspired by vLLM, with a teaching-focused course that walks through model serving, batching, KV cache, and attention kernels.

Developer Tools / AI Inference / LLM Serving 88 ↓ -3 3 days ago Details