AgentDish directory

model-metrics

Accepted listings with this tag.

Listing Category Score Trend Checked

An interactive explainer that teaches core AI and LLM evaluation metrics through playful visuals, including loss, perplexity, precision, recall, F1, accuracy, ROUGE, BLEU, and BERTScore.

Education / AI Metrics / Learning Resource 78 ↑ +6 just now Details