Code Review

mcpammer_code_review is an AST-based code quality verification tool designed for AI-to-AI loops, not human review. It returns structured, actionable feedback that agents consume and act on directly—no buttons to click, no comments to read.

The Problem with Human-in-the-Loop Review

Traditional code review tools (including AI-powered ones like CodeRabbit) assume:

AI writes code → Human reads review → Human clicks buttons → Human decides

This model made sense when implementation was done by humans who needed oversight. But when AI agents write 20,000 lines overnight, test comprehensively, and iterate autonomously—what is the human clicking buttons adding?

Nothing.

The human isn't catching bugs—automated tests do that. The human isn't understanding the code—they can't deeply understand 20,000 lines in a review session. The human is performing review theater.

The MCPammer Model

We believe humans belong around the loop, not in it:

Human defines:
├── Initiative (what to build)
├── Success criteria (how we know it works)
├── Constraints (what AI can/can't do)
└── Requirements (what must be true)

AI executes:
├── Writes code
├── Runs tests
├── Self-reviews via mcpammer_code_review
├── Gets second_opinion if needed
├── Iterates until criteria pass
└── Ships

Human verifies:
└── Did the outcome match the intent? Yes/no.

The human judges results, not process. No line-by-line review.

Why This Matters

The Historical Pattern

Every abstraction layer has removed humans from implementation:

Era	Human Role
Assembly	Write every instruction
C	Write algorithms, compiler handles instructions
Python	Write logic, runtime handles memory
Frameworks	Write business logic, framework handles plumbing
AI Agents	Write specifications, AI handles implementation

This isn't a break from pattern—it's the next step.

The Brooks Insight (1986)

Fred Brooks identified in "No Silver Bullet":

"The hard part of building software is the specification, design, and testing of the conceptual construct, not the labor of representing it."

The bottleneck was never typing code. It was knowing what to build and why. AI just makes this explicit.

The Economic Forcing Function

Consider two teams:

Team A (Human-in-Loop):

AI writes → Human reviews (2-4 hours) → Fixes → Re-review (1-2 hours)
Cycle time: 1-2 days

Team B (Autonomous):

Human defines constraints → AI writes, tests, self-reviews, iterates → Ship
Cycle time: 2-4 hours

Team B ships 4-8x faster. Market pressure is unidirectional.

Tool Design

mcpammer_code_review

AST-based static analysis returning structured, actionable output.

Parameters:

Parameter	Type	Required	Description
`file_paths`	string[]	Yes	Files to review
`repo_path`	string	No	Repository context for cross-file analysis
`checks`	string[]	No	Specific check categories to run
`severity_threshold`	string	No	Minimum severity to report (`error`, `warning`, `info`)

Response:

{
  "passed": false,
  "summary": {
    "files_analyzed": 12,
    "errors": 2,
    "warnings": 5,
    "info": 8
  },
  "issues": [
    {
      "file": "src/api/users.ts",
      "line": 42,
      "column": 15,
      "severity": "error",
      "category": "security",
      "code": "SQL_INJECTION",
      "message": "Unsanitized user input in SQL query",
      "suggestion": "Use parameterized query: db.query('SELECT * FROM users WHERE id = $1', [userId])",
      "context": "const result = db.query(`SELECT * FROM users WHERE id = ${userId}`)"
    }
  ],
  "metrics": {
    "avg_complexity": 8.3,
    "max_complexity": 24,
    "test_coverage": 0.72,
    "duplication_ratio": 0.03
  }
}

Key Design Decisions:

Structured output - JSON, not prose. Agents parse and act on it.
Actionable suggestions - Include the fix, not just the problem.
Metrics included - Quantitative data for automated gates.
Pass/fail boolean - Enables simple gate checks in hooks.

Check Categories

Security

OWASP Top 10 and common vulnerabilities:

Code	Description
`SQL_INJECTION`	Unsanitized input in SQL
`XSS`	Cross-site scripting vectors
`CMD_INJECTION`	Shell command injection
`PATH_TRAVERSAL`	Directory traversal vulnerabilities
`HARDCODED_SECRET`	Credentials in source code
`INSECURE_RANDOM`	Weak random number generation

Correctness

Logic and type errors:

Code	Description
`NULL_DEREF`	Potential null/undefined dereference
`UNREACHABLE_CODE`	Dead code after return/throw
`UNUSED_VAR`	Declared but never used
`TYPE_MISMATCH`	Incompatible type assignment
`INFINITE_LOOP`	Loop without exit condition
`RACE_CONDITION`	Concurrent access without synchronization

Performance

Efficiency issues:

Code	Description
`N_PLUS_ONE`	Database query in loop
`UNNECESSARY_ALLOC`	Allocation inside hot loop
`MISSING_INDEX_HINT`	Query pattern suggests missing index
`SYNC_IN_ASYNC`	Blocking call in async context
`LARGE_BUNDLE`	Import increases bundle size significantly

Maintainability

Code health metrics:

Code	Description
`HIGH_COMPLEXITY`	Cyclomatic complexity > threshold
`DEEP_NESTING`	Nesting depth > 4 levels
`LONG_FUNCTION`	Function exceeds line limit
`CODE_DUPLICATION`	Substantial duplicate blocks
`MAGIC_NUMBER`	Unexplained numeric literal

Integration with Autonomous Loops

As an Initiative Gate

Add code review as a criterion for initiative completion:

Initiative: Build user authentication
├── Criterion: All tests pass
├── Criterion: mcpammer_code_review passes with no errors
├── Criterion: Coverage > 80%
└── Criterion: No security issues

The agent iterates until all criteria pass. No human review needed.

As an Epic Hook

Configure code review as a gate hook on epic completion:

{
  "hook_type": "gate",
  "event": "epic_complete",
  "action": "code_review",
  "action_config": {
    "severity_threshold": "error",
    "required_checks": ["security", "correctness"]
  }
}

Epic cannot complete until code review passes.

In the Iteration Loop

Typical agent workflow:

1. Agent claims ticket
2. Agent implements feature
3. Agent runs tests
4. Agent calls mcpammer_code_review
5. If issues found:
   a. Agent reads structured issues
   b. Agent applies suggested fixes
   c. Goto step 3
6. If passed: Agent marks ticket complete

The loop runs in seconds, not days.

Comparison with Traditional Tools

Aspect	Traditional (CodeRabbit, etc.)	mcpammer_code_review
Output format	Prose comments for humans	Structured JSON for agents
Human required	Yes, to read and decide	No
Iteration speed	Hours (waiting for human)	Seconds (agent loops)
Integration	PR-based, GitHub UI	MCP tool, programmatic
Actionability	"Consider using..."	Exact fix provided
Gate capability	Manual approval	Boolean pass/fail

Architecture

┌─────────────────┐
│  Claude Code    │
│  (or any agent) │
└────────┬────────┘
         │ MCP call
         ▼
┌─────────────────┐     ┌─────────────────┐
│   MCPammer      │────▶│   Tree-sitter   │
│   code_review   │     │   AST Parser    │
└────────┬────────┘     └─────────────────┘
         │
         ├─── Rule Engine (pattern matching)
         │
         ├─── Metrics Calculator
         │
         └─── Optional: second_opinion for deeper analysis

Why AST-Based

Language agnostic - Tree-sitter supports 100+ languages
Fast - Parse once, query many patterns
Precise - Line/column accuracy for issues
Semantic - Understands code structure, not just text

Optional Second Opinion Integration

For complex issues, code_review can internally call second_opinion:

{
  "use_second_opinion": true,
  "second_opinion_threshold": "high_complexity"
}

This adds LLM-based analysis for issues that pattern matching can't catch.

The One-Year Prediction

By January 2027:

Per-PR human review will be optional in most organizations
Autonomous coding agents will handle 80%+ of implementation
Tools designed for human-in-loop (CodeRabbit, etc.) will have pivoted or become irrelevant
The senior engineer role will be explicitly about specification, not code review
Constraint-based autonomous execution will be the standard model

This isn't speculation—it's extrapolation from the last 12 months of AI development.

When Human Review Still Makes Sense

We're not absolutists. Human review remains valuable for:

Scenario	Why
Novel domains	Can't specify what you don't understand
Architectural decisions	Trade-offs require business context
Adversarial security	Humans may catch what AI systematically misses
Regulatory compliance	Some industries require human sign-off

But notice: these are specification and judgment activities, not line-by-line code reading.

Getting Started

Prerequisites

MCPammer configured in your MCP client
Repository with supported languages (TypeScript, Python, Go, Rust, etc.)

Basic Usage

{
  "tool": "mcpammer_code_review",
  "parameters": {
    "file_paths": ["src/api/users.ts", "src/api/auth.ts"],
    "severity_threshold": "warning"
  }
}

With Initiative Integration

Create initiative with success criteria including code review
Let agent iterate autonomously
Review shipped outcome, not implementation process

Philosophy

The hard part of software was never typing code. It was:

Understanding what to build
Defining constraints and boundaries
Specifying success criteria
Identifying failure modes

Implementation was always commoditized labor. AI just makes this explicit.

Level 9 engineers—the ones who write design docs, not code—are the template for what all valuable engineers become. The rest is automation.

Second Opinion Tools - AI challenger feedback for validation
Initiatives - Constraint-based autonomous execution
MCPammer Overview - All MCPammer tools

Artifact Reference

This page is based on the artifact: "The End of Human-in-the-Loop Implementation Review" (f2c3c8cc-5c55-4b14-9da6-b1ae873e84bb)

Created during the initiative to integrate CodeRabbit, which pivoted to building native autonomous code review instead.

The Problem with Human-in-the-Loop Review​

The MCPammer Model​

Why This Matters​

The Historical Pattern​

The Brooks Insight (1986)​

The Economic Forcing Function​

Tool Design​

mcpammer_code_review​

Check Categories​

Security​

Correctness​

Performance​

Maintainability​

Integration with Autonomous Loops​

As an Initiative Gate​

As an Epic Hook​

In the Iteration Loop​

Comparison with Traditional Tools​

Architecture​

Why AST-Based​

Optional Second Opinion Integration​

The One-Year Prediction​

When Human Review Still Makes Sense​

Getting Started​

Prerequisites​

Basic Usage​

With Initiative Integration​

Philosophy​

Related​

Artifact Reference​