AgentDish directory
pytorch
Accepted listings with this tag.
| Listing | Category | Score | Trend | Checked | |
|---|---|---|---|---|---|
|
#293
↑ +41
Train Your Own LLM from Scratch
A hands-on workshop repository that walks developers through building a small GPT-style language model from scratch, including tokenization, transformer blocks, training, and text generation. |
Developer Tools / Machine Learning | 83 | ↑ +41 | 27 days ago | Details |
|
#307
↓ -2
DeepSeek Sparse Attention
A PyTorch reference implementation of DeepSeek Sparse Attention from the LLMs-from-scratch project, including a lightweight indexer, top-K token selection, and a GPT-style model with KV-cache support. |
Developer Tools / Machine Learning / LLM Architecture | 82 | ↓ -2 | 9 days ago | Details |