Listed on Hacker News as "Needle: We Distilled Gemini Tool Calling into a 26M Model".
Source description: Hey HN, Henry here from Cactus. We open-sourced Needle, a 26M parameter function-calling (tool use) model. It runs at 6000 tok/s prefill and 1200 tok/s decode on consumer devices. We were always frustrated by the litt....
Source publish date is 2026-05-12.
Description: Hey HN, Henry here from Cactus. We open-sourced Needle, a 26M parameter function-calling (tool use) model. It runs at 6000 tok/s prefill and 1200 tok/s decode on consumer devices. We were always frustrated by the litt....
GitHub repository is linked as cactus-compute/needle.
Primary public product URL is https://github.com/cactus-compute/needle.
信号
Hacker News mention is recent (2026-05-12).
A public GitHub repo is available for direct technical review.
Primary discovery source is Hacker News.
推断
Public code access can lower evaluation friction for developer audiences.
未知
Documentation is not explicitly linked in the current allowed evidence set.
No tagline is stored on the current product record.
Pricing details are not explicitly linked in the current allowed evidence set.
Recent changelog or release history is not explicitly linked in the current allowed evidence set.
Release cadence cannot be confirmed unless a changelog or release link is explicitly provided.
证据快照
Needle: We Distilled Gemini Tool Calling into a 26M Model
Listed on Hacker News as "Needle: We Distilled Gemini Tool Calling into a 26M Model".
Needle: We Distilled Gemini Tool Calling into a 26M Model GitHub repository
GitHub repository is linked as cactus-compute/needle.
Needle: We Distilled Gemini Tool Calling into a 26M Model official profile
Primary public product URL is https://github.com/cactus-compute/needle.