Agentic Foundation

What it measures

Agentic Foundation answers a practical question: if someone (human or agent) opens this repo cold, how much friction will they hit?

Each connected GitHub repo gets a 0–100 score built from nine dimensions: docs, tests, CI, structure, agent context files, and similar signals. Lower scores aren’t moral failures. They’re a prioritized to-do list.

The headline number is not “how good your code is.” It measures readiness for AI-assisted development: can an agent find instructions, run tests, understand layout, and work within reasonable context limits?

What’s in this phase

Four screens work together. You can jump between them anytime after onboarding.

Readiness scores

Overall and per-dimension scores, confidence, and scan history for each repo.

The 9 dimensions

Agentic Readiness, tests, delivery, token efficiency, and the rest of the rubric.

AI Resolution Hub

Cross-repo fix queue sorted by impact. Start here when you want scores to move.

GitHub setup

Required input: which repos appear, what we read during a scan, and how access works.

Prerequisites

Agentic Foundation needs a connected GitHub App installation. Only repositories you granted at install time appear in the list. If a repo is missing, check GitHub integration for org, access scope, and repository selection.

After onboarding, the platform queues an initial scan for every visible repo. You don’t have to click anything for the first pass. Use Analyze when you want a fresh run after pushing fixes.

How a scan works

1

Find repos
Anything your GitHub App can access shows up in the Agentic Foundation list. Repos added or removed in GitHub App settings usually sync within a few minutes.

Example: Acme installs the app on acme-corp with access to payments-api, web-checkout, and handbook. All three appear on the dashboard after sync.
2

Queue a scan
Select repos and click Analyze. The scan runs in the background; you can leave the page. If a scan is already running, the dashboard reuses that run instead of starting a duplicate.

Example: The first scan for all three repos queues automatically after onboarding. Later, an engineer merges an AGENTS.md into payments-api and clicks Re-scan on that row only.
3

Read the breakdown
Overall score plus per-dimension detail on each repo. Open a repo to see individual criteria marked Met, Partial, Gap, N/A, or Blocked.

Example: payments-api scores 58 (Fair). Agentic Readiness is low (no AGENTS.md), Test Coverage is moderate (Jest present but no CI gate), and Delivery Readiness shows a Dockerfile plus GitHub Actions workflow.
4

AI Resolution Hub
Cross-repo list of the highest-impact fixes. Tackle these when you want scores to move.

Example: The hub surfaces “Add agent instructions” for payments-api and “CI does not run tests” for web-checkout ahead of minor gaps in handbook, because severity and dimension weight rank higher-impact items first.

Connected repos re-scan on roughly a 24-hour cadence. Manual re-scan anytime from the dashboard.

Example: first week with three repos

This walkthrough shows how the four steps play out for a typical team. Names are fictional; the flow matches what you see in the app.

Repo	What it is	Day 1 (auto scan)	After fixes + re-scan
`payments-api`	Node/TypeScript API	54 Fair. Gaps: no `AGENTS.md`, thin README, tests not wired in CI.	71 Good after adding `AGENTS.md`, expanding README build commands, and a test job in `.github/workflows/ci.yml`.
`web-checkout`	React SPA	68 Good. Partial: `.cursorignore` missing; CI runs build but not tests.	76 Good after ignore file and a `npm test` CI step.
`handbook`	Internal docs (MkDocs)	82 Excellent. Many delivery rows show N/A (no Dockerfile expected).	Unchanged. Docs repos often score high on hygiene with legit N/A on runtime criteria.

Day 1: GitHub connects during onboarding. Initial scans queue for all three repos. The engineering lead opens AI Resolution Hub and picks the top two items affecting payments-api.

Day 3: A PR merges with AGENTS.md and CI test wiring. They click Re-scan on payments-api. About ten minutes later the score updates and those hub items drop off.

Day 7: Scheduled re-scan runs overnight. No manual action needed unless they pushed more changes.

One fix, several queue items

Adding a solid AGENTS.md with copy-pasteable build and test commands can clear gaps in Agentic Readiness, Codebase Hygiene, and Token Efficiency at once. Re-scan to see the ripple effect.

What a scan inspects

Scans work from a read-only snapshot of the repository file tree at scan time. They do not run your app in production or read live secrets.

Signal category	What we look for	Concrete examples
Agent & contributor docs	Instructions a human or agent can follow on day one	`README.md`, `AGENTS.md`, `CLAUDE.md`, `CONTRIBUTING.md`
AI tool configuration	Rules, skills, and ignore scope for Cursor or Claude	`.cursor/rules/`, `.cursor/skills/`, `.cursorignore`, `.claude/`
Build & test entrypoints	Non-interactive commands in docs or CI	`npm test`, `mvn verify`, `bundle exec fastlane test` in README or workflow YAML
Quality & delivery	Lint, tests, containers, deploy config	`eslint.config.js`, `vitest.config.ts`, `Dockerfile`, `.github/workflows/`
Contracts & governance	APIs and change control	`openapi.yaml`, `CODEOWNERS`, branch-protection evidence via GitHub
Context efficiency	Files and docs sized for agent context windows	`.cursorignore` excluding `node_modules/`, layered docs under `docs/`

We do not keep a long-term copy of source code. The snapshot is discarded when scoring finishes. See GitHub integration for the full data-handling summary.

Common repo situations

The rubric adapts to repo type. A mobile app and a backend API are scored against different expectations.

Situation	Example repo	What to expect
Backend / API	`payments-api`, `user-service`	Full bars for CI, contracts, observability hooks where they belong in-repo.
Web SPA	`web-checkout`, `admin-portal`	Frontend-appropriate tests and delivery; no backend-only tooling required in-repo.
Mobile	`acme-ios`, `acme-android`	Xcode or Gradle signals instead of Docker; ignore files should exclude `build/` and `DerivedData/`.
Monorepo	`platform` with `apps/` and `packages/`	Scored on dominant layout at the root; secondary stacks noted but not double-counted.
Docs-only	`handbook`, `architecture-notes`	Delivery and runtime criteria often N/A; they do not count against you.
Infra elsewhere	`checkout-service` with Terraform in `acme-infra`	Delivery rows may show Blocked until infra evidence is reachable or documented.
Unfamiliar stack	Legacy COBOL mirror, niche DSL	Scan still runs; confidence may stay low when fewer recognizable signals are found.

N/A is not a penalty. Criteria marked N/A are excluded from that dimension’s score. A docs repo without a Dockerfile is expected, not a gap.

Timing, cadence & limits

Topic	Typical behavior
First scan	Queued automatically after GitHub connect and onboarding.
Scheduled re-scan	About every 24 hours per connected repo.
Manual re-scan	Anytime via Analyze or Re-scan on a repo row.
Scan duration	Usually 5–15 minutes for typical repos; large monorepos can take longer. Progress shows on the dashboard.
Concurrent runs	One active scan per repo at a time; duplicate requests attach to the in-flight run.
Score freshness	Scores reflect the last completed scan. Pushed fixes won’t move numbers until the next successful run.

If a scan fails (GitHub unreachable, tarball error, timeout), the previous score stays visible and the repo shows a failed status. Retry with Analyze once the underlying issue is resolved.

Criterion outcomes (quick reference)

Status	Meaning	Example
Met	Signal present and satisfies the bar.	`AGENTS.md` with concrete `npm test` and `npm run build` lines.
Partial	Something there but incomplete.	README mentions tests but no exact command to copy.
Gap	Expected signal missing. Feeds AI Resolution Hub.	No CI workflow file in a service repo.
N/A	Doesn’t apply to this repo. Excluded from scoring.	No Dockerfile expected for `handbook`.
Blocked	Couldn’t verify at scan time. Counts as not met until evidence is available.	K8s manifests live in another repo the scanner couldn’t reach.

Confidence on a dimension reflects how much evidence we found, not statistical test coverage. A repo with zero test files produces low confidence on Test Coverage before you even interpret the score.

For how overall and dimension scores combine, see Readiness scores.

Score bands

Band	Range	Rough read
Excellent	85–100	Agents and new hires should settle in quickly.
Good	65–84	Solid; a few targeted fixes left.
Fair	40–64	Meaningful gaps; start with the Resolution Hub.
Needs attention	0–39	Structural issues; don’t ignore the top items.

Bands are guides, not gates. A repo at 72 with a critical Gap in Agentic Readiness may still be painful for agents until that one item is fixed.

What scores don’t measure

Agentic Foundation is deliberately bounded. It does not:

Judge code correctness, security vulnerabilities, or performance in production
Replace human code review or compliance audits
Measure team productivity (that’s under Insights)
Guarantee an AI agent succeeds on every task. It measures repo affordances, not model quality.
Penalize you for choosing a stack we don’t have deep heuristics for. Those repos may show lower confidence instead.

Use scores to prioritize documentation, structure, and automation improvements that make both humans and agents more effective.

AI Driven SDLC Documentation

Agentic Foundation

What it measures

What’s in this phase

Readiness scores

The 9 dimensions

AI Resolution Hub

GitHub setup

Prerequisites

How a scan works

Example: first week with three repos

What a scan inspects

Common repo situations

Timing, cadence & limits

Criterion outcomes (quick reference)

Score bands

What scores don’t measure

Next