autoresearch

Category: General Risk: High risk ★ 4.5 · Rating 4.5/5 (431) mxyhi/ok-skills Apache-2.0

Rating is derived from the repo's GitHub stars and shown for reference.

shell_executionfilesystem_access

name: autoresearch
description: "Autonomous iteration loop: modify, verify, keep/discard against any metric"
version: 2.1.4

Autoresearch — Autonomous Goal-directed Iteration

Safety Invariants (all subcommands)

  • Never push, publish, or deploy without explicit user approval.
  • Bounded by default. Override with Iterations: unlimited.
  • All results logged to autoresearch/{subcommand}-{YYMMDD}-{HHMM}/ directory.
  • Chain handoff via handoff.json. Evals reads *-results.tsv.

Subcommands

Command Does Default Iterations
Iterate against a metric: modify → verify → keep/discard 25
plan Convert a goal into validated Scope, Metric, Verify config N/A
debug Hunt bugs: hypothesize → test → falsify → repeat 15
fix Crush errors one-by-one until zero remain 20
security STRIDE + OWASP audit with red-team personas 15
ship Ship through 8 phases: checklist → dry-run → deploy → verify N/A
scenario Generate edge cases across 12 dimensions 20
predict 5 expert personas debate before implementation N/A
learn Scout codebase → generate docs or wiki → validate → fix loop 10
reason Adversarial debate with blind judges until convergence 8
probe 8 personas interrogate requirements until saturation 15
improve Research ICP challenges, discover improvements, generate PRDs 15
evals Analyze iteration results: trends, plateaus, regressions N/A
regression Regression stability gate: baseline vs candidate, verdict STABLE/UNSTABLE N/A

Universal Flags

Flag Applies To Purpose
Iterations: N All looping Set iteration count
Iterations: unlimited All looping Opt-in unbounded
--evals All looping Mid-loop checkpoints + final summary
--evals-interval N All looping Override checkpoint frequency
--chain <targets> All Sequential handoff after completion
--<subcommand> All Shorthand for --chain <subcommand>