Article 03 · The Agentic Development Cycle · AI & Automation Chronicle

The Big Idea

For three decades, writing software meant a human at a keyboard translating intent into syntax - one function, one file, one pull request at a time. The entire toolchain - IDEs, linters, debuggers, CI pipelines - was built around augmenting that human loop, not replacing it.

That model is being structurally renegotiated. Not because LLMs write good code (they often do), but because of something more fundamental: AI agents can now run the development loop itself. They read files, execute commands, observe outputs, form hypotheses, revise code, and re-test - iterating without a human intervening at each step.

This is the agentic development cycle. It is not faster autocomplete. It is a different unit of work. The shift is from "AI helps me write code" to "AI runs the implementation loop while I direct the outcome."

The key distinction: Traditional coding assistants (early Copilot, ChatGPT) operate in a single-turn pattern - you ask, it answers, you copy. Agentic systems operate in a multi-step loop - observe environment, plan action, execute tool, read result, adjust. The loop does not stop until the goal is met or the agent is blocked.

Before vs After - Where the Loop Lives

The clearest way to see the shift is to map where cognitive work sits in the traditional cycle versus the agentic one. The work does not disappear - it relocates.

Development Cycle - Traditional vs Agentic

Traditional Development

Human writes every line of implementation
Debugging requires reading and reasoning manually
Each iteration needs human re-engagement
Context lives in the developer's head
Bottleneck: how fast the human can type and think
Cost scales with developer hours

Agentic Development

Agent writes implementation from goal + constraints
Agent reads errors, forms hypothesis, fixes autonomously
Loop runs without human re-engagement per step
Context lives in tool call history + scratch files
Bottleneck: quality of goal specification
Cost scales with token usage, not developer hours

How It Works - The Observe-Plan-Act-Reflect Loop

At the core of every agentic system - Claude Code, Aider, Devin, Cursor Agent, GitHub Copilot Workspace - is the same underlying control loop. The names differ but the structure is consistent.

👁

Observe

Read the codebase, file tree, error output, or test results. Build a model of current state.

📄

Plan

Decompose the goal into discrete sub-tasks. Identify which files to read, which to edit, what commands to run.

⚙

Act

Execute tool calls: read file, write file, run bash, search codebase, fetch URL. One concrete action per step.

🔄

Reflect

Read tool output. Did the action succeed? If not, revise the plan. If yes, move to the next sub-task.

What makes this mechanically different from single-turn assistance is the tool call layer. The agent does not just generate text - it calls structured functions with typed parameters: read_file(path), write_file(path, content), bash(command), grep(pattern, directory). The output of each tool call becomes the next observation that feeds back into the loop.

Why this matters architecturally: Tool calls give the agent access to ground truth. Instead of hallucinating what a file contains, it reads the file. Instead of guessing whether a test passes, it runs the test. The loop anchors the agent in real state rather than generated state - which is what separates agentic systems from glorified autocomplete.

The Tool Call Architecture

The scaffolding layer is where the agentic cycle is actually implemented. Here is how a modern coding agent like Claude Code is structured internally:

Agentic Coding System - Internal Architecture

The permission gate is worth emphasizing. Production agentic systems - Claude Code being a direct example - do not execute high-risk actions (bash commands that modify the filesystem, git pushes, destructive operations) without surfacing them to the human for approval. This is not a UX nicety. It is a structural safety mechanism that keeps the human in the loop at the right granularity - not for every file read, but for every irreversible action.

Key Findings

Tool-grounded reasoning Multi-step loops Context accumulation Permission layering Specification quality

Goal specification quality determines output quality. Vague prompts produce vague plans which produce vague code. The agentic cycle amplifies specification quality in both directions - a precise, constrained goal produces surprisingly good output; an ambiguous one compounds errors across every loop iteration.
Agents degrade with context growth. As the context window fills with tool call history, model attention diffuses. Tasks that require holding many simultaneous constraints - large refactors, cross-file architectural changes - are where current agents most visibly fail. Scoped, bounded tasks consistently outperform open-ended ones.
The bash tool is where real power lives. An agent that can execute shell commands can run your test suite, install dependencies, build Docker images, query databases, and check git history. The coding capability of an agent is therefore bounded by what it is permitted to execute - not what it can generate.
Agents struggle with architectural judgment. They are excellent at mechanical transformations: renaming, refactoring, adding error handling, writing tests for existing code. They are much weaker at decisions that require understanding of non-local constraints - why a particular data model was chosen, what technical debt is acceptable, where a boundary should sit.
Human-in-the-loop checkpoints matter more than people expect. The instinct is to approve tool calls quickly to keep momentum. The actual failure mode is approving operations you did not read carefully - especially git operations, file deletions, and test bypass flags.

4-6

average tool calls per agentic task completion in Claude Code sessions

~70%

of developer time is non-implementation work that agents can now absorb

new bottleneck: specification quality, not implementation speed

Why This Matters for AI and Automation Practitioners

The agentic development cycle is not a feature you enable - it is a fundamentally different working model that changes what skills matter, what failure modes to anticipate, and what "doing the work" means.

For automation practitioners specifically, this matters in two ways:

You can now build things faster than you can specify them. The implementation bottleneck has largely moved. A well-specified feature that would take a developer a day can take an agent twenty minutes. The new constraint is writing specifications precise enough for the agent to execute without requiring constant correction.
Agentic systems are themselves automation targets. If an agent can read a codebase, run tests, and push to a branch, that is a pipeline - one you can trigger from n8n, a cron job, a webhook, or a CI event. The combination of agentic coding and workflow automation is where the most interesting compounding happens.

The emerging pattern: A practitioner who can write precise goal specifications, understand the permission model of their agent, and compose agentic coding sessions into larger automation pipelines is operating at a qualitatively different leverage point than one who is still typing every function manually. The skill compound is: specification + direction + integration.

The risk: Agentic systems produce code that looks correct but lacks the architectural reasoning that produces maintainable systems. Ghost functions, redundant abstractions, and non-idiomatic patterns accumulate silently. The agent ships working code. Whether it is good code requires a human who can read it critically - not just run the tests.

My Take

The framing of "AI replacing developers" is the wrong frame. The more accurate observation is that the agentic development cycle raises the floor on what a single practitioner can build alone while shifting the ceiling toward people who can think clearly about systems - not people who can type faster.

The bottleneck has genuinely moved. I have watched myself go from spending most of a session on implementation to spending it on specification, review, and course correction. That is a material change in how the work feels. It is also a harder skill to develop than people assume - writing a goal that is precise enough for an agent to execute without accumulating errors across ten tool calls is a real discipline.

What I find most interesting is not the raw capability but the permission model. Claude Code's approach - auto-approve reads and searches, require approval for writes, require explicit approval for bash - maps closely to how you would want any autonomous system operating in a shared environment. It is a design pattern worth borrowing when you build your own agentic automation pipelines.

The agentic cycle is not the end state. Context windows will grow, memory architectures will mature, and agents will become capable of longer-horizon tasks without degradation. But the current moment - where scoped agentic loops work reliably and open-ended ones still need tight human steering - is a genuinely productive place to be building.

Discussion question: As agentic development tools continue to mature, does deep implementation knowledge become more or less valuable - and does the answer differ depending on whether you are an individual practitioner, a team lead, or an engineering organization making hiring decisions?

The Agentic Development Cycle - How AI Agents Actually Build Software

The Big Idea

Before vs After - Where the Loop Lives

Traditional Development

Agentic Development

How It Works - The Observe-Plan-Act-Reflect Loop

The Tool Call Architecture

Key Findings

Why This Matters for AI and Automation Practitioners

My Take

Share this discussion