AgentDish directory

CUDA

Accepted listings with this tag.

Listing Category Score Trend Checked
#41 ↓ -17
AutoRound

AutoRound is an open-source quantization toolkit for LLMs and VLMs, focused on high-accuracy low-bit inference across CPU, XPU, CUDA, and multiple deployment backends.

Developer Tools / AI Infrastructure 89 ↓ -17 28 days ago Details
#45 ↓ -3
tiny-vllm

Open-source C++ and CUDA LLM inference engine inspired by vLLM, with a teaching-focused course that walks through model serving, batching, KV cache, and attention kernels.

Developer Tools / AI Inference / LLM Serving 88 ↓ -3 3 days ago Details
#482 ↓ -95
MEMOPT

Open-source AI infrastructure project focused on GPU memory management and serving, with a Python control plane, C++ data plane, and optional CUDA kernels.

Developer Tools / Code Assistant 72 ↓ -95 27 days ago Details