AgentDish directory

sparse attention

Accepted listings with this tag.

Listing Category Score Trend Checked

A PyTorch reference implementation of DeepSeek Sparse Attention from the LLMs-from-scratch project, including a lightweight indexer, top-K token selection, and a GPT-style model with KV-cache support.

Developer Tools / Machine Learning / LLM Architecture 82 ↓ -2 9 days ago Details