AgentDish directory

AI Infrastructure

Accepted listings with this tag.

Listing Category Score Trend Checked

Google Developers Blog post about integrating DFlash, a diffusion-style speculative decoding framework, into the vLLM TPU ecosystem to improve LLM serving speed on TPU v5p.

Developer Tools / Code Assistant 78 ↓ -83 27 days ago Details

A GitHub repository for a latency-separated AI memory retrieval and RAG system. The README describes fetch, compute, and ANN search stages, includes benchmark ranges, and exposes a public test endpoint.

AI Infrastructure / Retrieval / RAG 71 → 0 4 days ago Details