bench
success score 1.00
hackernews-top
5.0s duration 3 events 2026-06-27 05:01:37
"The agent successfully retrieved the top 5 posts from Hacker News with accurate information."
Input
{ "source": "hackernews", "count": 5 }
Output
[ { "title": "Previewing GPT‑5.6 Sol: a next-generation model", "score": 909, "by": "minimaxir", "url": "https://openai.com/index/previewing-gpt-5-6-sol/" }, { "title": "Why does kinetic energy increase quadratically, not linearly, with speed? (2011)", "score": 156, "by": "ProxyTracer", "url": "https://physics.stackexchange.com/questions/535/why-does-kinetic-energy-increase" }, { "title": "WordStar: A Writer's Word Processor (1996)", "score": 14, "by": "droidjj", "url": "https://www.sfwriter.com/wordstar.htm" }, { "title": "U.S. allows Anthropic to release Mythos AI to ‘trusted’ US organizations", "score": 309, "by": "bobrenjc93", "url": "https://www.semafor.com/article/06/27/2026/us-releases-powerful-anthropic-model-" }, { "title": "Show HN: Hacker News on a train station-style flip board", "score": 37, "by": "PaybackTony", "url": "https://popflame.quickish.space/hn-flipboard/" } ]
0 / 3 events
Event stream (3)
start 05:01:37
log 05:01:42
end 05:01:42