Terminal-Wrench, a dataset of 331 realistic hackable environments

  • Hacker News

I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I noticed a lot of our tasks were hackable. I also noticed th...

  • Published: Apr 15, 2026
  • First seen: Apr 15, 2026

AI Summary

I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I noticed a lot of our tasks were hackable. I also noticed th...

Best for

Teams evaluating AI product workflows / Builders comparing emerging tools / Operators tracking early category shifts

Why it matters

Primary discovery source is Hacker News.

Key Features

  • Primary public product URL is https://github.com/few-sh/terminal-wrench.
  • Description: I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I....
  • GitHub repository is linked as few-sh/terminal-wrench.
  • Listed on Hacker News as "Terminal-Wrench, a dataset of 331 realistic hackable environments".
  • Source description: I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I....

Use Cases

  • Primary discovery source is Hacker News.
  • A public GitHub repo is available for direct technical review.
  • Hacker News mention is recent (2026-04-15).
  • Primary public product URL is https://github.com/few-sh/terminal-wrench.
  • Description: I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I....

Why Now

Terminal-Wrench, a dataset of 331 realistic hackable environments is appearing on fresh discovery surfaces, so it is worth reviewing while momentum is still forming. Confidence is currently medium (49/100), so treat this as an early signal rather than a settled trend.

Community Signals

Trend score

119

24h momentum

Rising

Hacker News points

6

Rising

Facts / Signals / Inference / Unknowns

Facts

  • Listed on Hacker News as "Terminal-Wrench, a dataset of 331 realistic hackable environments".
  • Source description: I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I....
  • Source publish date is 2026-04-15.
  • Description: I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I....
  • GitHub repository is linked as few-sh/terminal-wrench.
  • Primary public product URL is https://github.com/few-sh/terminal-wrench.

Signals

  • Hacker News mention is recent (2026-04-15).
  • A public GitHub repo is available for direct technical review.
  • Primary discovery source is Hacker News.

Inference

  • Public code access can lower evaluation friction for developer audiences.

Unknowns

  • Documentation is not explicitly linked in the current allowed evidence set.
  • No tagline is stored on the current product record.
  • Pricing details are not explicitly linked in the current allowed evidence set.
  • Recent changelog or release history is not explicitly linked in the current allowed evidence set.
  • Release cadence cannot be confirmed unless a changelog or release link is explicitly provided.

Evidence Snapshots

Terminal-Wrench, a dataset of 331 realistic hackable environments

Listed on Hacker News as "Terminal-Wrench, a dataset of 331 realistic hackable environments".

Terminal-Wrench, a dataset of 331 realistic hackable environments GitHub repository

GitHub repository is linked as few-sh/terminal-wrench.

Terminal-Wrench, a dataset of 331 realistic hackable environments official profile

Primary public product URL is https://github.com/few-sh/terminal-wrench.

Alternatives / Related

Original Sources