AgentDish directory

quantization

Accepted listings with this tag.

Listing Category Score Trend Checked
#40 ↓ -16
AutoRound

AutoRound is an open-source quantization toolkit for LLMs and VLMs, focused on high-accuracy low-bit inference across CPU, XPU, CUDA, and multiple deployment backends.

Developer Tools / AI Infrastructure 89 ↓ -16 28 days ago Details
#312 ↓ -2
UltraCompress

UltraCompress is a Python-based compression tool for large language models. The repo describes lossless 5-bit transformer compression, verification via SHA-256, a CLI on PyPI, and published model packs on Hugging Face.

Developer Tool / ML / Model Compression 82 ↓ -2 24 days ago Details
#319 ↓ -2
sectorllm

An open-source Llama2 inference engine written in x86 real-mode assembly that fits in 1277 bytes and can boot directly from disk before any OS loads.

Developer Tools / AI Development 82 ↓ -2 27 days ago Details