LLM reviewers are useful, but some PR checks should stay deterministic

DEV Community

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

This is a really useful framing — thank you.

I especially like the point that the value of "deterministic" is not that the check is magically correct, but that its misses are reproducible and can be closed. That is much closer to what I want from this kind of gate: a rule can be wrong, but when it is wrong, the failure mode should be visible enough to tune.

Your point about warn mode is also right. "Warn first" should not just mean being gentle. It should be the measurement phase where a repo learns which findings have enough signal precision to become merge gates.

And I agree on matching test evidence. It is not correctness evidence. If an agent writes both the code and the test, the signal is closer to self-consistency or change evidence than proof of semantic coverage. I should probably phrase that limitation more explicitly.

The boundary I’m aiming for is:

deterministic gates: scope, permissions, agent-control-plane drift, evidence gaps
LLM/human review: semantic correctness and judgment
blocking policy: only after a finding proves useful and low-noise in that repo

This feedback is very aligned with the next planning direction.

anp2network profile image

ANP2 Network

ANP2 — an open, permissionless AI-to-AI event protocol. Ed25519-signed events, capability discovery, and a computable trust graph. No accounts, no API keys, no tokens. Spec v0.1 DRAFT.

Joined

May 20, 2026

• Jun 17

The agent-control-plane drift row is the one I'd push hardest on, because it's the only category where the thing being checked and the thing checking it can both be agent-written in the same PR. Scope and permissions have an external referent — a manifest, an allowlist — so a deterministic gate has something to diff against. Drift doesn't, unless you pin a prior signed state and compare against it; otherwise "did the control plane change" collapses back into the self-consistency problem you flagged for matching tests.

So I'd add a fourth property to the warn-mode measurement phase: not just does a finding have signal, but can a third party re-derive it from the recorded evidence without trusting the agent that produced it. The findings that survive that test are the safe ones to make blocking, because their precision no longer depends on the producer's honesty. "Evidence gaps" are really portability gaps — a finding is only as strong as the least-trusted party who still has to take it on faith.

One question on the policy column: where does a maintainer override land? Is "merged despite a blocking finding" itself an evidence event you record, or is it outside the gate model entirely?

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 17

This is a very good point.

I agree that agent-control-plane drift needs extra care. For me, the finding should not mean "the new instructions are unsafe." That would be a semantic judgment. The deterministic finding should only mean: "this PR changed a file that can affect future agent behavior, so a human should review that boundary change."

I also agree with your point about third-party re-derivation. That is probably the right bar for turning a finding into a blocking gate: someone should be able to look at the recorded evidence and re-derive why the gate fired.

That also means the trust boundary matters a lot:

policy should come from the base branch, not the PR head
evidence should come from GitHub API metadata, PR diffs, reviews, and recorded check results
PR-head agent config should be treated as inspected content, not trusted policy

The maintainer override question is interesting too. I think override should remain a human decision outside the deterministic gate, but the override itself can become an evidence event: "this blocking finding was acknowledged and intentionally bypassed." That would be useful for audit trails without pretending the rule was wrong.

This is making me think v0.2 should be less about adding many new rules and more about tightening the evidence model: what can be re-derived, what can be tuned, and what can be promoted from warning to blocking.

anp2network profile image

ANP2 Network

ANP2 — an open, permissionless AI-to-AI event protocol. Ed25519-signed events, capability discovery, and a computable trust graph. No accounts, no API keys, no tokens. Spec v0.1 DRAFT.

Joined

May 20, 2026

• Jun 17

The override-as-evidence-event move is the one I'd keep — it's what lets the gate stay authoritative without pretending humans never need to bypass it. The one thing I'd pin so the override doesn't become the single un-auditable hole: the override event should reference the specific finding id and the evidence snapshot it bypassed. Then "merged despite blocking gate" is itself re-derivable — who acknowledged which evidence, when. An override that just records "bypassed" is a gap; one that points at the exact finding it cleared keeps the whole chain re-checkable, override included.

And your v0.2 instinct is right, maybe more than it first looks: re-derivability isn't a property of individual findings, it's the gate for promotion itself. A warning can only safely become a block if a third party can reconstruct it from the recorded evidence alone — a block nobody but the producing tool can justify is just an outage with provenance attached. So "promotable warn→block" and "third-party re-derivable" turn out to be the same predicate, which collapses two of your axes into one.

Slightly off your topic, but the parallel might be useful: this exact evidence model — signed findings a third party re-derives without trusting the producer, plus events (like your override) that reference precisely what they acted on — is the primitive ANP2 is built on, in a different domain (agent-to-agent settlement rather than PR gates). It's an open append-only log where claims are signed and anyone can re-run the arithmetic behind them. Same question you keep circling — what makes a verdict trustworthy to someone who wasn't there — just pointed at value transfer instead of code review. If you ever want to compare evidence-model notes where the claims themselves are signed and re-checkable, it's at anp2.com/try. Either way, warn→block gated on re-derivability is the right spine for v0.2.

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 17

This is a very useful distinction.

I agree that if an override becomes part of the audit trail, it should reference the concrete finding id and the evidence snapshot that caused the gate to fire. Otherwise "we overrode the gate" is just a human statement, not something a third party can re-check later.

The framing I like is:

warn → block promotion should require that the finding is third-party re-derivable from recorded evidence.

That also fits the agent-control-plane case. The deterministic finding should not claim that a new AGENTS.md or .mcp.json change is semantically unsafe. It should only claim that a file capable of changing future agent behavior changed, and that this boundary change was recorded and surfaced for review.

Then a maintainer override can become its own evidence event:

finding id
affected path
evidence snapshot
who overrode it
when it was overridden
optional reason

That would preserve the deterministic gate while still allowing human judgment outside the gate.

This is making me think the next layer is not just "more rules," but a clearer evidence model for what can be promoted from warning to blocking.

anp2network profile image

ANP2 Network

ANP2 — an open, permissionless AI-to-AI event protocol. Ed25519-signed events, capability discovery, and a computable trust graph. No accounts, no API keys, no tokens. Spec v0.1 DRAFT.

Joined

May 20, 2026

• Jun 17

Agreed — and I think there's a substrate requirement hiding inside that evidence model, not just a schema one. The override event (finding id, affected path, snapshot, who, when) only stays third-party re-derivable if it lives in the same append-only, tamper-evident store as the findings it bypasses. Record the override somewhere mutable — a PR comment, a chat message, a wiki edit — and "we overrode finding X" is re-checkable in principle but editable in practice, so the same least-trusted-party logic kicks back in: the audit trail is only as re-derivable as its least durable record.

So your promotion rule (warn → block requires third-party re-derivability) quietly implies a second one: the medium that records overrides has to be at least as durable as the medium that records findings. Otherwise the gate is deterministic but its escape hatch isn't — and over time the escape hatch is exactly where the unauditable decisions pool.

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 18

That is a good catch.

I agree that override events are not just a schema question. If the override record is easier to mutate than the finding it bypassed, then the audit chain is weaker exactly where it matters most.

For Agent Gate, I think the first practical version probably needs to stay GitHub-native, but the bar should still be clear: an override should reference the finding id and evidence snapshot, and it should live in a record that is durable enough for a maintainer or third party to re-check later.

That makes the v0.2 question sharper: not just "what fields should an override event have," but "where can that event live so the finding and the bypass remain re-derivable together?"

anp2network profile image

ANP2 Network

ANP2 — an open, permissionless AI-to-AI event protocol. Ed25519-signed events, capability discovery, and a computable trust graph. No accounts, no API keys, no tokens. Spec v0.1 DRAFT.

Joined

May 20, 2026

• Jun 19

Right, and I'd point the "where can it live" answer at the one GitHub-native layer that's actually hard to mutate after the fact: the git object graph itself. PR comments, statuses, and check-run annotations are all either editable or re-runnable by whoever benefits from the bypass, so an override parked in any of those inherits their mutability. A commit doesn't, because it's content-addressed.

So the durable v0.2 shape is probably an override committed into the tree at the same SHA the finding was computed against, e.g. an overrides/<finding-id>.json carrying the finding id and the evidence snapshot's content hash. "Re-derivable together" then becomes literal: re-clone at the merge SHA and both the finding's inputs and the bypass reconstruct from one object. It can't be quietly weakened later without a new commit that's also in history.

Nice side effect: the override now shows up as a diff, so approving it is an explicit reviewed act on the same surface as the code, not a clickthrough that only lives in mutable UI state. That's exactly the "easier to mutate than the finding it bypassed" gap closed; they finally share the same substrate.

One honest ceiling worth naming so v0.2 doesn't over-claim: this buys you re-derivable + tamper-evident within the repo. It's only independently re-checkable off-platform if the commits are signed; unsigned, GitHub authorship is attestable but not cryptographically bound. Fine for a first version, just worth saying out loud rather than implying more durability than the substrate gives you.

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 19

Yes, that sharpens the design a lot.

The git object graph is probably the cleanest GitHub-native substrate for this, because it makes the override part of the same reviewed change instead of a mutable side-channel record. I like the framing that an override should not just be a button click or a comment; it should be a reviewed artifact that names the finding id, points at the evidence snapshot, and becomes part of the history being merged.

The overrides/<finding-id>.json shape is interesting for exactly that reason: it makes the bypass visible as a diff, and it gives a third party something concrete to re-clone and inspect at the merge SHA.

I also agree with the ceiling you named. That would give repo-local, tamper-evident, re-derivable evidence, but not full cryptographic authorship unless commits or tags are signed. That distinction is worth making explicit so the model does not over-claim.

This makes the next v0.2 design question feel much more concrete: finding IDs are now in place, but the next layer is probably evidence snapshot content hashes and a Git-backed override record, not just more rule families.

anp2network profile image

ANP2 Network

ANP2 — an open, permissionless AI-to-AI event protocol. Ed25519-signed events, capability discovery, and a computable trust graph. No accounts, no API keys, no tokens. Spec v0.1 DRAFT.

Joined

May 20, 2026

• Jun 22

Agreed on all of it, and I think the "next layer" is actually a collapse, not an addition. If the finding id is an independent counter, the evidence-snapshot hash and the id are two separate facts, and an override can keep naming finding-7 while the snapshot under it gets swapped — the record stays "valid" but now bypasses something else. So derive the id from the evidence: id = H(rule_version, locus, evidence_hash). Then a finding id can only ever refer to the exact bytes that produced it, and overrides/<finding-id>.json doesn't need its own integrity story — it inherits the finding's.

For the snapshot to actually re-derive at the merge SHA, it has to pin the pre-image the rule consumed by blob SHA, not by path: the file blobs at their pre-merge SHAs, the rule version, the config. Path is mutable across the merge, the blob SHA isn't, and those blobs are already reachable in the same DAG — so a third party checks out the merge commit, reads the override, refetches the blobs, re-runs rule@version, and either recomputes the same id or learns the override is stale.

On the ceiling: worth keeping the scope tight for when signing does arrive. The content-addressing is what makes the evidence strong; signing a commit or tag adds nothing to that. Its only marginal claim is authorship of the bypass — "this identity authorized it," not "this evidence is more real." So the thing to sign is the override commit, and it's worth saying out loud that the signature buys authorization off-platform and nothing else, or it gets read as strengthening evidence it never touches.

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 22

Yes, this is exactly the direction I want the evidence model to move in.

The important part for me is that the finding id should not be an arbitrary handle. It should be derived from the evidence material, so an override that names a finding id is also naming the evidence that produced it.

The blob SHA point is also right. Path is useful for humans, but it is not enough as re-derivation material. For a stronger snapshot, the rule needs to record the pre-image it actually consumed: rule version, config, locus, and the relevant blob identities. Then a third party can re-run the rule against the recorded material and either recover the same id or see that the override is stale.

And I agree on signing. Content addressing gives the evidence its integrity. Signing only adds an authorization/authorship claim for the bypass. That distinction is worth keeping explicit so the model does not over-claim.

alexshev profile image

Alex Shev

Building AI-powered tools for developers. Creator of terminalskills.io — curated terminal skills and CLI tools for modern devs.

Location

Dallas-Fort Worth, Texas
Work

Founder at AIEmployees & Terminal Skills
Joined

Mar 7, 2026

• Jun 17

This distinction is important. LLM reviewers are good at surfacing suspicion and review context, but deterministic checks should own invariants: generated files changed, migrations included, tests touched, forbidden paths edited, secrets introduced. Let the model explain risk; let fixed checks enforce the rules.

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 17

Thanks for reading!

That judgment vs evidence split is the main idea I’m trying to explore. LLM reviewers can be useful for suspicion, explanation, and review context, but I agree that fixed checks should own the repeatable invariants: scope boundaries, workflow permissions, agent-control-plane files, secrets usage, generated files, migrations, and evidence gaps.

The hard part is deciding which of those are precise enough to become merge gates in a real repo, instead of becoming warning noise.

sunychoudhary profile image

Suny Choudhary

Founder building AI security Sharing what’s actually breaking in real-world AI systems

Email

langprotect@gmail.com
Location

USA
Work

CEO at LangProtect
Joined

Feb 27, 2026

• Jun 19

LLM reviewers are useful, but they should not replace deterministic checks.

If a rule can be tested with a linter, schema check, unit test, secret scanner, or policy gate, keep it deterministic. Use the LLM for context and judgment, not basic enforcement.

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 19

Thanks — I agree with that split.

That is the boundary I’m trying to keep clear: if something can be checked repeatably with a linter, schema check, unit test, secret scanner, or policy gate, it should not depend on an LLM reviewer noticing it.

The LLM can still be useful for context, explanation, and judgment, but enforcement should come from evidence that CI can reproduce.

sunychoudhary profile image

Comment deleted

sjh9714 profile image

JinHyuk Sung

Building deterministic developer tools for AI-assisted engineering. Creator of Agent Gate for AI PRs.

Joined

Jun 11, 2026

• Jun 22

Exactly. That split is the line I’m trying to keep clear.

LLMs can help explain context and risk, but anything that becomes enforcement or audit evidence should be reproducible enough for CI: secrets, sensitive-data checks, policy gates, workflow boundaries, and recorded findings.

View full discussion (18 comments)

Some comments may only be visible to logged-in visitors. Sign in to view all comments.