AgentDish directory

pytorch

Accepted listings with this tag.

Listing Category Score Trend Checked

A hands-on workshop repository that walks developers through building a small GPT-style language model from scratch, including tokenization, transformer blocks, training, and text generation.

Developer Tools / Machine Learning 83 ↑ +41 27 days ago Details

A PyTorch reference implementation of DeepSeek Sparse Attention from the LLMs-from-scratch project, including a lightweight indexer, top-K token selection, and a GPT-style model with KV-cache support.

Developer Tools / Machine Learning / LLM Architecture 82 ↓ -2 9 days ago Details