Skip to content

Orcho in 5 minutes

Plain-language start

Your AI agent says “done” — then you find it quietly broke something that worked, or skipped the test. Orcho sits around the agent like a supervisor: it makes the agent plan first, checks the result, and refuses to call unfinished work “ready”. This page follows one task through Orcho, start to finish, in plain words.

plain words firstcommands nextreference later

It starts with one task — the thing you want done.

Add validation to the login endpoint

In a normal agent session, that task becomes one long chat. You have to remember what was planned, what changed, whether the tests ran, and whether the final “done” is true. Orcho keeps the task inside a run — a single tracked piece of work with a beginning, checks, and an end.

Before anything happens, Orcho needs to know how careful to be. That is all a profile is: not a theme, not a setting screen — just the level of strictness for this task.

Tiny fix → lightFixing a typo or a label does not need a plan and a review. A small-task profile just does it.New feature → fullAdding real functionality deserves the whole journey: plan, code, review, repair, final check.Not sure → let Orcho pickDescribe the task and Orcho suggests the right profile for you.

For this story the task is a real feature, so it uses the feature profile — the careful one.

Orcho Run 20260629_1427 profile=feature

That single line means: this work gets planned, written, reviewed, repaired if needed, and checked before it counts as ready. Want the deeper model later? Read Profile semantics.

With feature, Orcho runs the task through a fixed set of steps, in order:

plan → write code → review → fix if needed → final check → done

Each step has one job. Planning happens before code. Reviewing is separate from writing, so the work gets a real second look. The final check is its own gate, so “the code compiles” and “this is ready to ship” are not the same thing.

The shorthand you’ll see in the terminal

Orcho shows those same steps compressed into one line. It looks dense at first — here is how to read it:

⟳² (▶ plan [Claude] → · validate_plan [Codex]) → · implement [Claude]
→ ⟳² (· review_changes [Codex] → · repair_changes [Claude])
→ · final_acceptance [Codex]
⟳² this part can repeat (up to a limit) the step running now· a step waiting its turn[Claude] the agent doing the work[Codex] a different agent doing the review

Two different agents on purpose: one writes, another reviews — so the work does not get to grade its own homework. See Watch the run for the full live view.

The “write code” step is where the AI agent actually edits your project. Orcho does not replace the agent — it still does the real work.

The difference is that the agent is now boxed inside steps:

  • the task already has a profile, so the agent knows how careful to be;
  • the plan came first, so it is not improvising;
  • the review step that comes next knows what to check;
  • the proof at the end points to real files, not vibes.
[IMPLEMENT] edited api/auth.py and tests

This is the moment that matters. Not when the agent says “done” — when a review step decides whether the work is actually ready.

[REVIEW_CHANGES] verdict=REJECTED
blocker missing negative-path test
fix add a test for invalid input
[REPAIR_CHANGES] added the missing test

A normal chat would have ended at a confident “done”. Orcho stopped, named the exact problem, sent it back to be fixed, and kept that decision in plain view. That is the whole pitch: it catches the “looks done but isn’t” case that costs you hours.

Want the deeper version of this idea? Read False-ready delivery.

The last step asks one question: is this really ready to ship?

[FINAL_ACCEPTANCE] verdict=APPROVED
ship_ready yes

If it says no, you do not get a vague failure. You get the blocker and the next move — fix it, follow up, or pause for your decision. Either way, nothing broken slips through wearing a “done” label.

When the run ends, you are not stuck trusting a chat scrollback. Orcho leaves files behind:

output.log the full run, as it happened
events.jsonl the timeline of what happened, in order
plan.md the plan it agreed to follow
review.json what the review found
diff.patch exactly what changed in your code

You — or a teammate, or a tool — can open these later and see what happened, why it was accepted, and what was checked. No need to reconstruct it from memory.

Read Evidence bundle for what the proof contains, or Read the result for the first thing to open after a run.

Run quickstartInstall Orcho, run one task, and watch it happen for real.Let your agent drive itThe lighter way in: let the AI client you already use run Orcho over MCP.Watch a live runLearn to read the terminal stream as it works.Understand profilesThe one idea worth learning early: how strict each run should be.Open the docs mapPick your next layer when you want more depth.