OpenAI Harness Engineering In An Agent-First World

中文 | English

Summary

This source argues that once agents write nearly all code, the engineering bottleneck shifts from manual implementation to harness design, repository legibility, and feedback loops that let agents operate reliably with minimal human attention.

Source

Raw file: raw/openai/Harness engineering leveraging Codex in an agent-first world.md
Translated raw file: raw/openai/Harness engineering leveraging Codex in an agent-first world.zh.md
Original URL: https://openai.com/index/harness-engineering/
Author: Ryan Lopopolo
Published: 2026-03-11
Ingest date: 2026-04-08

Key Contributions

Reframes software engineering as environment design: humans steer, specify intent, and encode feedback loops while agents execute.
Treats repository-local documentation and plans as the system of record, with AGENTS.md serving as a map rather than an encyclopedia.
Emphasizes legibility to agents across UI behavior, logs, metrics, traces, architecture boundaries, and review workflows.
Argues that architecture constraints and custom lints are multipliers because they convert human taste and invariants into mechanically enforced guidance.
Connects high agent throughput to a different merge philosophy: cheap correction loops, short-lived PRs, and more review shifted agent-to-agent.

Strongest Claims

A fully agent-generated codebase can remain productive only if the repository itself exposes enough structure, tooling, and documentation for agents to reason directly from local context.
Human effort compounds best when invested in scaffolding and control systems rather than direct code writing.
Monolithic instruction files fail; progressive disclosure through a short map plus indexed deeper docs scales better.
Agent autonomy introduces entropy, so recurring cleanup passes and enforceable golden principles are required to prevent slop from compounding.

Practical Implications For This Vault

Keep AGENTS.md short enough to orient the model and push domain detail into linked canonical pages.
Treat topic pages and source pages as agent-facing infrastructure, not just human notes.
Prefer repository-local durable artifacts over relying on chat history or external documents.
Lint and ingest work should watch for places where structural guidance belongs in canonical pages rather than in transient conversation.

claude-code

everything-claude-code

claude-mythos-preview

multica-ai

piebald-ai

voltagent

awesome-design-md

design-md

codex

skills

llm-wiki

anthropic

claude-code

everything-claude-code

claude-mythos-preview

cocoon-ai

dbreunig

everyinc

garrytan

github

multica-ai

piebald-ai

voltagent

karpathy

openai

codex

configuration

skills

ralph

shanraisshan

OpenAI Harness Engineering In An Agent-First World

Summary

Source

Key Contributions

Strongest Claims

Practical Implications For This Vault

everything-claude-code

awesome-design-md

design-md

skills

llm-wiki

claude-code

everything-claude-code

claude-mythos-preview

multica-ai

piebald-ai

voltagent

codex

configuration

skills

OpenAI Harness Engineering In An Agent-First World ​

Summary ​

Source ​

Key Contributions ​

Strongest Claims ​

Practical Implications For This Vault ​

Related Pages ​

OpenAI Harness Engineering In An Agent-First World

Summary

Source

Key Contributions

Strongest Claims

Practical Implications For This Vault

Related Pages