Features
Every feature exists to answer one question: how do we ship faster without breaking things?
Orchestration Engine
Manager-Worker DAG Architecture
Mahalaxmi organizes AI agents into a two-tier hierarchy. Manager agents analyze requirements and produce execution plans. Worker agents execute tasks in dependency order. Tasks within a phase run in parallel; tasks with dependencies wait for prerequisites.
Configurable number of managers (1–8)
Automatic DAG validation — rejects circular dependencies before execution starts
Phase-based scheduling — all tasks in a phase run concurrently
Per-task file scope, provider assignment, and complexity estimate
Consensus Engine — Four Strategies
When multiple manager agents propose execution plans, the consensus engine merges them into a single coherent plan.
Union (default) — combines all unique tasks; best for maximizing coverage
Intersection — only tasks all managers agreed on; best for conservative or safety-critical work
WeightedVoting — tasks weighted by manager provider historical performance
ComplexityWeighted — tasks weighted by complexity score; best for complex projects
Semantic Deduplication + LLM Arbitration
CamelCase-aware tokenization, stop-word filtering, multi-field Jaccard similarity (name + description + file scope), three-zone classification, and LLM arbitration for ambiguous pairs. No duplicate worker tasks, no missing coverage.
Adversarial Manager Deliberation
Structured 3-turn deliberation per domain: Proposer presents a task plan, Challenger critiques and identifies gaps, Synthesizer produces a refined plan. Mirrors how good engineering teams do design review.
Worker Execution
Git Worktree Isolation
Every worker task runs in a dedicated Git worktree — a separate working directory on the same repository. Workers cannot interfere with each other's file changes. Each worker's output is a clean branch ready for PR.
Branch names encode cycle label and a unique 8-character cycle ID
Re-running the same requirements file never produces colliding branch names
Intelligent Context Routing
Workers receive a curated file selection scored by three signals: Lexical Jaccard (keyword overlap), Import-graph proximity (BFS distance in dependency graph), Historical co-occurrence (files modified together in similar past cycles).
Configurable token budget per worker
Workers get what they need — nothing they don't
PTY-Native Terminal Control
Mahalaxmi controls AI coding tools by owning their terminal (PTY), not by scraping screens. Works with any terminal-based AI CLI tool, reads raw bytes with no rendering artifacts, detects interactive prompts via pattern matching and auto-responds.
Reliable regardless of font, color scheme, or terminal emulator
Captures raw terminal bytes for faithful replay in the UI
Self-Verification Pipeline
Before a worker pushes its branch, it runs your project's own verification tools. Workers that fail verification receive the failure output as additional context and retry. Workers that fail twice are flagged for human review.
Test runner: cargo test, pytest, jest, go test
Linter: clippy, eslint, pylint, golangci-lint
Build gate: verifies the project compiles after changes
Security gate: runs the full security pipeline on the diff
Auto-Chain Cycle Continuation
For large requirements files organized as wave groups, Mahalaxmi detects when a cycle completes a phase and automatically starts the next cycle for the next incomplete wave group.
Intelligence
Codebase Indexing
When a project is opened, Mahalaxmi indexes it using Tree-sitter parsers: function/class/struct definitions with line ranges, import/export dependency mapping, cross-file call graph edges, and file relevance scores.
Supported: Rust, TypeScript/JavaScript, Python, Go, Java, C/C++, and more via Tree-sitter grammar ecosystem
GraphRAG Knowledge Graph
A queryable code knowledge graph answering questions like "What calls authenticate()?" or "What is the blast radius of changing auth_middleware.rs?" The impact scorer computes a 0–10 risk score based on downstream dependency count and centrality.
Cross-Agent Shared Memory
Workers read from and write to a shared memory store. Discoveries propagate across the cycle. Memory entries have scope (Session / Project / Global), configurable decay, and — for Enterprise — team sync.
Codebase Q&A and Wiki Generation
Ask questions about your codebase in natural language. The same capability powers automatic wiki generation — structured technical documentation generated from codebase analysis.
Human-in-the-Loop
Interactive Plan Review
Before workers start, the full execution plan is displayed. Approve immediately, or add/remove tasks, adjust file scope, and add free-text instructions for individual workers. Every modification is stored in the plan audit log.
Budget Gate
Configure a cost ceiling per cycle. When estimated spend approaches the limit, execution pauses for confirmation. Shows tokens consumed, estimated remaining, cost at current provider rates, and which workers are running.
Post-Cycle Validation Dashboard
After workers complete, a validation run checks that acceptance criteria are met. Shows per-criterion pass/fail, which files changed for each criterion, per-file accept/reject controls, and gap task generation for unmet criteria.
PR Review Response Loop
After a worker's PR receives human code review comments, a fix worker is automatically dispatched — pre-loaded with the original task context, the diff, and the reviewer's comments.
Security & Compliance
Security Pipeline
Every worker diff is scanned by four parallel scanners before the branch is pushed.
Secrets detection: 40+ patterns — API keys, tokens, private keys, connection strings
Dependency audit: known CVEs via OSV.dev for npm, cargo, pip, go.sum
SAST: cargo-audit, semgrep common vulnerability patterns
License compliance: SPDX identification, blocks problematic licenses in commercial contexts
HIPAA and FedRAMP Compliance Profiles
Pre-configured compliance overlays enforcing relevant controls.
HIPAA: disables plaintext logging of PHI patterns, enforces secrets scanner, requires audit log retention, enforces TLS 1.2+
FedRAMP: restricts provider list to US-hosted services, enforces FIPS-compliant cryptographic operations, requires continuous vulnerability scanning
Enterprise
Team Collaboration & Seat Management
Configure a roster of named AI developer agents, each with assigned AI provider, priority weight, and maximum concurrent manager assignments. Tracks active sessions, enforces license limits, and generates per-developer cost reports.
Cost Analytics & Reporting
Pre-cycle cost estimates, actual token spend per provider per cycle, per-project cost history with bar chart visualization, CSV and JSON export for finance integration.
IDE Extensions
Native integrations for VS Code, JetBrains, and Neovim.
VS Code: live sidebar, plan approval UI, per-file accept/reject, worker terminal panels, cost status bar
JetBrains: tool window, plan approval dialog, file status decorations, cycle status bar widget
Neovim: Lua plugin with floating panel, Telescope picker, statusline integration, full command palette
Headless Service Mode
mahalaxmi-service is a standalone REST+SSE API server for CI/CD integration.
POST /v1/cycles — start a cycle from CI/CD
GET /v1/cycles/:id — poll cycle status
GET /v1/events — SSE stream of cycle events
Intake adapters: Jira, Slack, GitHub Issues, custom REST
Output adapters: Jira comments, Slack threads, GitHub issue auto-close, HMAC-signed webhooks
Platform Support
| Platform | Status |
|---|---|
| macOS (Apple Silicon + Intel) | Supported |
| Windows 10/11 | Supported |
| Linux (x86_64) | Supported |
| Linux (ARM64) | Beta |
Distribution: GitHub Releases, WinGet, Chocolatey, Scoop, Homebrew, Flathub, AUR, MSIX