[Pelis Agent Factory Advisor] Agentic Workflow Maturity Report — March 2026 #1224
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-03-18T03:23:27.521Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Executive Summary
The
gh-aw-firewallrepository has an exceptionally mature agentic workflow infrastructure with 21 agentic workflows already in place — covering security, CI diagnosis, documentation, testing, release, and issue management. Compared to the Pelis Agent Factory's 100+ workflows, this repo scores 4/5 on the maturity scale and has several clear opportunities to close the remaining gap, particularly in meta-observability, issue triage, and continuous code quality improvement.🎓 Patterns Learned from Pelis Agent Factory
From the Documentation Site
The Pelis Agent Factory emphasizes these key patterns:
From the Agentics Reference Repository
The
githubnext/agenticsrepo contains these additional patterns not yet in this repo:grumpy-reviewer— Adversarial code reviewer that finds non-obvious issuesdaily-test-improver— Incremental test coverage (similar totest-coverage-improverbut daily)sub-issue-closer— Auto-closes completed sub-issuescontribution-check— Validates PRs against contribution guidelinesweekly-repo-chronicle— Weekly changelog-style summary of what changedissue-arborist— Organizes issues into parent/sub-issue hierarchiespr-nitpick-reviewer— Light-touch quality review on every PRHow This Repo Compares
What this repo does exceptionally well:
Gaps compared to Pelis Factory best practices:
📋 Current Agentic Workflow Inventory
build-testci-cd-gaps-assessmentci-doctorcli-flag-consistency-checkerdependency-security-monitordoc-maintainerissue-duplication-detectorissue-monsterpelis-agent-factory-advisorplan/planslash commandsecret-digger-claude/codex/copilotsecurity-guardsecurity-reviewsmoke-chrootsmoke-claude/codex/copilottest-coverage-improverupdate-release-notes🚀 Actionable Recommendations
P0 — Implement Immediately
🏷️ Issue Triage Agent
What: Auto-label incoming issues with appropriate labels (
bug,enhancement,documentation,question,security,performance) and post a welcoming comment summarizing the issue category.Why: The
issue-monsterworkflow assigns issues to Copilot for automated resolution, but currently has no labeling stage upstream. Adding labels would help issue-monster prioritize work and make the issue tracker useful for humans browsing by category. Currently there are 15+ open issues with inconsistent labeling.How: Add a new workflow triggered on
issues: [opened, reopened]using theissue-triagepattern from Pelis Factory. Usesafe-outputs: add-labelsandadd-comment.Effort: Low — well-established pattern, ~20 lines of workflow markdown
Example:
P1 — Plan for Near-Term
🔍 Workflow Health Manager / Audit
What: A meta-workflow that audits all agentic workflow runs (costs, error rates, turn counts, recurring failures) and creates issues when workflows are underperforming or failing repeatedly.
Why: Looking at the open issues, there are 5+
[agentics]failure issues open simultaneously (secret-diggers, smoke-codex, issue-monster, CI doctor all failing). A health manager would detect these patterns proactively and create consolidated diagnostic issues rather than having CI Doctor open individual issues. The Pelis Factory's Audit Workflows workflow created 93 discussions and contributed to 9 issues from which downstream agents fixed things.How: Use
agentic-workflowstool to fetch recent run data, analyze failure patterns, calculate cost trends. Create issues for repeated failures. Run daily.Effort: Medium — requires interpreting workflow run data
What: A workflow that detects potentially breaking changes in PRs — CLI flag removals/renames, type changes in public APIs, container image interface changes, domain whitelist format changes.
Why: AWF is a tool used by other teams and integrated into larger pipelines. A breaking change to
--allow-domainssyntax or container entrypoint could silently break downstream users. Currently there is no automated detection. Related issue:#1001(LD_PRELOAD breaking Deno scoped permissions) is an example of a compatibility problem that wasn't caught before shipping.How: Trigger on PR, diff
src/cli.tsoptions,src/types.ts,containers/agent/entrypoint.sh, andaction.yml. Compare against main branch to flag removals or signature changes.Effort: Medium
🤔 PR Code Quality Reviewer (Grumpy Reviewer)
What: An adversarial code reviewer triggered on every PR that looks for non-obvious issues: off-by-one errors, missing error handling, inconsistent naming, logic that seems correct but has edge cases. Different from
security-guard(which is security-focused).Why: The current
security-guardis excellent at security boundaries, but there's no general code quality reviewer. Thegrumpy-reviewerpattern in agentics deliberately takes a skeptical perspective to surface subtle bugs. Several recent PRs (test coverage PRs #1161, #1162, #1163) could benefit from code review.How: Use
githubnext/agentics/grumpy-revieweras a template. Trigger on PR, usesafe-outputs: add-comment(hidden older comments to avoid spam), 10-minute timeout.Effort: Low — template available in agentics repo
P2 — Consider for Roadmap
🧹 Continuous Simplicity / Code Cleanup PRs
What: A weekly or daily workflow that identifies overly complex code, deeply nested conditionals, functions that are too long, and proposes simplification PRs.
Why: As the codebase grows (now 24 source files in
src/), complexity creeps in. The Pelis Factory's "Continuous Simplicity" workflow had 22 merged PRs out of 28 proposed (78% merge rate), showing high signal quality. For a security tool, simpler code = fewer bugs.Effort: Medium — needs focused scope to avoid spurious PRs
🔀 Mergefest — Auto-Sync PR Branches with Main
What: A workflow that automatically merges the
mainbranch into open PRs that are behind, eliminating the "please merge main" ceremony.Why: Several current open PRs (#1079, #1150, #1163) are long-lived and will experience merge conflicts. With active development on main, keeping PRs current is manual overhead.
Effort: Low —
mergefestpattern available directly from Pelis Factory:gh aw add-wizard https://github.com/github/gh-aw/blob/v0.45.5/.github/workflows/mergefest.md🦠 Daily Malicious Code Scan
What: A workflow that reviews recent code commits (last 24h) for suspicious patterns — unexpected network calls, data exfiltration vectors, encoded payloads, unusual file operations.
Why: AWF is itself a security tool, making it a high-value target for supply chain attacks. A daily scan of new commits aligns with the repository's security-first mission. Pelis Factory's equivalent found real issues in production. The existing
secret-diggerworkflows test runtime secrets but not source code injection.Effort: Low — can be adapted from Pelis Factory:
gh aw add-wizard https://github.com/github/gh-aw/blob/v0.45.5/.github/workflows/daily-malicious-code-scan.md📰 Weekly Repo Chronicle
What: A weekly summary discussion of what changed in the repository — merged PRs, opened/closed issues, workflow highlights, notable commits — presented in a readable narrative format.
Why: With 21 agentic workflows generating activity and 10+ open PRs, it's hard to keep a holistic view of what's happening. A weekly digest makes the project more transparent to contributors and stakeholders. Reference:
githubnext/agentics/weekly-repo-chronicle.Effort: Low
P3 — Future Ideas
📊 Portfolio Analyst for Workflow Cost Optimization
What: Analyzes token usage, turn counts, and costs across all agentic workflow runs. Identifies wasteful workflows (too chatty, too many turns for simple tasks) and suggests optimizations.
Why: With 21 workflows running daily/hourly (especially 3 secret-digger variants running every hour each), costs could accumulate. The Pelis Factory's Portfolio Analyst identified workflows that were "way too chatty" and created optimization opportunities.
Effort: Medium — requires access to billing/usage data
🌳 Issue Arborist — Issue Organization
What: Groups related issues as parent/sub-issues, creating hierarchy in the issue tracker.
Why: Open issues like
#1039(integration test gaps) and#1103(shutdown performance) could each have multiple sub-issues tracking specific work items. The Pelis Factory's Arborist created 77 discussion reports and 18 parent issues.Effort: Medium
🔄 Sub-Issue Closer
What: Automatically closes sub-issues when their parent issue is resolved.
Why: Pairs with Issue Arborist. Available from agentics:
gh aw add-wizard githubnext/agentics/sub-issue-closer.Effort: Low (but depends on Issue Arborist first)
📈 Maturity Assessment
Current Level: 4/5 — "Advanced Practitioner"
Target Level: 4.5/5 — "Factory-Grade"
Gap Analysis: The 3 highest-leverage additions are:
🔄 Comparison with Best Practices
What This Repo Does Well
roles: allwhere appropriate, skip-if-match anti-spam, scoped tool permissionsWhat Could Improve
Unique Opportunities Given the Domain (Firewall/Security)
squid.confsyntax and semantics using Squid's built-in tools in CIRun date: 2026-03-11 | Workflows analyzed: 21 agentic, 11 standard CI | Open issues: 15 | Open PRs: 10
Cache updated:
/tmp/gh-aw/cache-memory/advisor-notes.jsonBeta Was this translation helpful? Give feedback.
All reactions