Research / AI Agents

QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks

arXiv paper describing QUEST, an open family of deep research agents from 2B to 35B parameters, plus a synthetic-task training recipe and released models, data, and scripts.

Clear27/30
Useful28/30
Specific16/20
Complete12/20
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks screenshot

Why it was accepted

The page clearly presents an AI research product: an open family of deep research agents with described capabilities, training method, benchmarks, and released assets. The abstract gives enough evidence to support a useful public listing for AI builders and researchers.

Weakness

The crawl shows the paper abstract but not the actual repo, model cards, benchmark tables, or download links, so visitors cannot tell how to run the models or what licenses and deployment requirements apply.

Review status

7 days ago #254 ↓ -3

Last evaluated 7 days ago. Current rank #254. Down 3 spots in the rankings.

Score history

83

Related listings

Below the Fold — A New York Times X-Ray Dashboard screenshot

Research / Data Visualization

An interactive dashboard that analyzes New York Times coverage since 2000 using the NYT Archive API, with views for reporters, beats, sections, subjects, geography, obituaries, and corrections.

CAD-Bench screenshot
#56 CAD-Bench
88

Research / Knowledge Work

An open benchmark and leaderboard for AI CAD agents, with 308 prompts across 20 categories and layered scoring for geometry, engineering, manufacturability, and cognition.

Benchmarking Inference Engines on Agentic Workloads screenshot

Research / Knowledge Work

A research article from Applied Compute on how agentic, tool-using workloads differ from traditional LLM benchmarks, with production observations, workload profiles, and an open-source harness for replaying traces.

Alignment Whack-a-Mole screenshot

Research / Copywriting

A research code repository for studying how fine-tuning can trigger verbatim recall of copyrighted books in large language models. It includes preprocessing, fine-tuning, generation, and memorization-evaluation scripts, with setup notes and example data.