AgentDish directory
attention
Accepted listings with this tag.
| Listing | Category | Score | Trend | Checked | |
|---|---|---|---|---|---|
|
#307
↓ -2
DeepSeek Sparse Attention
A PyTorch reference implementation of DeepSeek Sparse Attention from the LLMs-from-scratch project, including a lightweight indexer, top-K token selection, and a GPT-style model with KV-cache support. |
Developer Tools / Machine Learning / LLM Architecture | 82 | ↓ -2 | 9 days ago | Details |