Early access Working with a small set of design partners

See how candidates actually work with AI.

Gonfire is an assessment platform built around the assumption that candidates will use AI — and that the way they steer the AI is the signal worth measuring. Real codebases, every interaction captured.

Request access Why this matters

same task · two candidates live capture

Candidate A jamie@example.com

Candidate B priya@example.com

The problem

Take-homes don't work
the way they used to.

A few years ago a take-home told you something. Today every candidate ships a clean PR generated mostly by an AI assistant. The PR isn't the signal anymore.

The output looks the same.

Whether the candidate carefully designed the solution or one-shot prompted the AI to write it, the PR usually looks fine. By the time you find out which it was, they're three months in.

Whiteboards still test the wrong things.

Inverting a binary tree never told you whether someone could design. Now that the actual job is "give the AI clear, well-structured context and review what comes back," whiteboards are even further from real work than they used to be.

The interesting signal is in the process.

How a candidate plans before prompting. Where they push back when the AI does the wrong thing. When they revert and try a different angle. None of this shows up in the final PR.

Before / after

What changes when you can see
the process, not just the output.

Gonfire isn't trying to replace your entire interview loop — culture and team-fit interviews still belong on the calendar. We're trying to replace the part where you stare at a take-home PR and try to guess what the candidate actually did.

Today 0 Minutes per candidate · staring at a diff

→

With Gonfire 0 Minutes per candidate · reviewing the assessment

Today's take-home review

Mostly guessing.

Send take-home prompt 5 min

Wait for submission 3–7 days

Read PR + reconstruct what happened 60–90 min

Schedule + run debrief about the take-home 60 min

Reviewer time~2 hrs

What you learnOutput looks AI-shaped. Unclear who steered.

With Gonfire

Process is observed.

Send Gonfire link 2 min

Candidate works async, configurable

Review the assessment 15–30 min

Optional debrief, anchored to specific moments 30 min

Reviewer time~45 min

What you learnHow they planned, prompted, recovered, decided.

Attribution

Behavior,
not byte-count.

We don't try to label individual lines as "human-written" vs. "AI-written." The candidate uses AI for the whole thing — that's the point. The label that matters is behavioral: which prompt led to which decision, where the candidate took the time to read the AI's output, and where they accepted it without review.

Behavioral Not "who typed what." Which prompt led to which decision, observed in order.

Decision points Where the candidate paused, reverted, or accepted AI output without review.

Per session The full interaction trail — not a single accuracy number to defend.

· [09:14] prompt: "explain the windowing logic" prompt

· [09:14] read AI output · paused 12s read

· [09:18] edited limiter.ts (+5 −2) edit

· [09:21] reverted edit, prompted again revert

· [09:24] prompt: "what edge cases am I missing?" prompt

· [09:27] accepted suggestion · no edits accept

· [09:31] ran tests · 14 passing test

Prompts

Reverts

Test runs

vs. the take-home, as it exists today

What we're actually
trying to do better.

	Today's take-home	Gonfire
What candidates work on	Whatever you set up	A repo or zip you specify
AI tools used	Whatever the candidate uses, invisible to you	Captured end-to-end
What you review	Output PR	Full assessment + final submission
What you learn about process	Inferred from commit history, if you're lucky	Observed directly
Reviewer time	2+ hours staring at a diff	~30 minutes reviewing the assessment
Setup cost for you	High — env, prompt, rubric, evaluation criteria	Low — paste a repo URL, write a brief
Setup cost for candidate	High — local environment, dependency hell	Low — minimal setup required

We don't position against HackerRank or CodeSignal — those platforms try to prevent AI use, which is the opposite of our approach.

Early signals

Working with a small set
of design partners.

Quotes here as we hear from them. If you'd like to try Gonfire on a real role and tell us what's broken, reach out.

Pricing

No pricing yet.

Gonfire is in early access. We're calibrating the product with a small set of design partners before publishing pricing. If you want to use it on a real role, get in touch — we'll set up an account and walk you through it.

FAQ

Things engineering leaders ask first.

What languages and frameworks work?+

The starter source can be any GitHub repo or zip. We've tested with TypeScript, Python, Rust, and Go.

Does the candidate know they're being recorded?+

Yes. The candidate setup page explicitly states that their AI interactions will be recorded and evaluated as part of the assessment. There's no covert capture.

Can I customize the rubric or what the analysis looks for?+

Not yet. Custom rubrics are on the roadmap.

Can I see a real assessment?+

Reach out and we'll walk you through one. We're admin-provisioning accounts for design partners.

Get in touch

See how a candidate actually thinks.

We're working with a few teams at a time. If you have a role open and a take-home you're tired of grading, we'd like to talk.

Request access

Early access · we respond within a day.