Developer Tools / Code Assistant

Show HN: 178K Parameter Neural Net That Wins Poke(rogue)like

A blog post describing a small reinforcement-learning agent trained with PPO to play and beat a Pokelike/Pokerogue-style game, including the input representation, model architecture, and training approach.

Clear27/30
Useful24/30
Specific16/20
Complete7/20
Show HN: 178K Parameter Neural Net That Wins Poke(rogue)like screenshot

Why it was accepted

The page clearly describes an AI agent project with enough technical detail to be useful in a public directory listing. It explains the game environment, the state/action design, the 1386-dimensional input vector, the shared trunk with policy/value heads, and the training approach with PPO. The post also includes the outcome of the experiment and concrete modeling choices that affected performance.

Weakness

This is a write-up of a project, not a product page or repo, so it does not show a downloadable artifact, code link, setup steps, or how others can run the agent. It also leaves out training data details, code availability, and whether the project is maintained beyond this post.

Review status

7 days ago #558 ↓ -1

Last evaluated 7 days ago. Current rank #558. Down 1 spot in the rankings.

Score history

74

Related listings

CodeGraph screenshot
94

Developer Tools / AI for Code

CodeGraph is a local code knowledge graph for AI coding agents like Claude Code, Cursor, Codex, OpenCode, and Hermes Agent. It aims to cut token use, tool calls, and runtime by letting agents query pre-indexed code structure instead of scanning files repeatedly.

Version Sentinel screenshot

Developer Tools / AI Coding Guardrails

Claude Code plugin that blocks dependency edits until a fresh, source-cited version check is recorded, helping prevent hallucinated or stale package versions across npm, pip, Poetry/uv, Cargo, and NuGet.

Omni screenshot
#6 Omni
91

Developer Tools / Search & Retrieval

Omni is a local-first semantic search app for macOS that indexes text, code, PDFs, images, audio, and video on-device. It supports multilingual search, private offline use, and exposes a local endpoint for agents to query indexed files.

OWASP Agent Memory Guard screenshot

Developer Tools / AI Security

An OWASP incubator project that protects AI agent memory from prompt injection, secret leakage, and tampering. It includes a Python library, policy-based controls, benchmarks, and integrations for agent frameworks like LangChain and AutoGen.