Skip to content

HOMEBASE — Feature Backlog

Tracked features, extensions, and research ideas for the HOMEBASE multi-agent POC. Items are organized by priority tier and mapped to their enterprise analog where applicable.


In Progress

Nothing currently in active development.


Next

Version Feature Description Enterprise Analog
v1.20.0 Registry Bulk Operations Agent NL bulk updates across multiple registry items — "close all plumbing items", "mark all stale as in-progress"; extends current single-item update agent Batch ticket processing, bulk status normalization

Backlog

Version Feature Description Enterprise Analog
v1.20.0 Memory / Context Agent Orchestrator references past run decisions from run_history — "this item has been HU/HI for 3 consecutive runs", "last time we saw this pattern we deferred it"; episodic memory layered on existing run_history table Institutional memory for recurring risk patterns, time-to-resolution tracking
v1.20.0 Export / Reporting Agent Structured output beyond current PDF — markdown, JSON summary, email-ready HTML; trend analysis over run history (HU/HI frequency, deferral rates, stale item trajectory) Executive reporting, compliance documentation, RMA trend dashboards
v1.20.0 Scheduled / Triggered Runs Background scheduler auto-runs orchestrator on configurable cadence (daily briefing, weekly review) without manual UI input; notification panel surfaces results; demonstrates agentic autonomy beyond reactive chat Automated risk/compliance monitoring with scheduled escalation
future OpenAI o3 Integration Wire o3 / o3-mini as provider for schema metric discovery and scoring rubric agents; chain-of-thought reasoning for structured inference tasks Provider-agnostic deployment; structured reasoning over schemas and scoring rubrics

Architecture Notes

Multi-Provider Strategy

HOMEBASE intentionally demonstrates a multi-provider, multi-model agentic architecture:

Provider Model Role Justification
Groq Llama 3.3 70B Subagents (HVAC, Plumbing, Electrical, Appliance, General), orchestration, classification, RCA, registry commands Low latency, high throughput for parallel batch recommendation calls
Anthropic Claude Sonnet Synthesizer node — final action plan narrative (activated by ANTHROPIC_API_KEY) Superior instruction adherence and narrative quality for the high-visibility synthesis step; graceful fallback to Groq when key is absent
Gemini 2.5 Flash-Lite Document intake, spreadsheet analytics, schema discovery, multimodal understanding Native PDF/image support; strong extraction and data analysis performance
OpenAI o3 / o3-mini Structured reasoning over schemas, math-heavy scoring (future) Chain-of-thought reasoning with explicit steps; natural fit for schema metric discovery and scoring rubric agents

Enterprise Analog Map

HOMEBASE Feature Enterprise Equivalent
Registry item Risk register item / service ticket / compliance finding
Urgency × Impact quadrant Likelihood × Impact scoring framework
HU/HI → HITL escalation High-priority item requiring human approval
Specialist subagents Domain SME agents (security, ops, finance, legal)
Stale item detection SLA breach / aging ticket detection
Cross-item RCA Systemic root cause analysis across work item categories
5 Whys agent Structured RCA interview workflow
Predictive quadrant preview Ticket severity/routing prediction before submission
Completeness scorer Classifier-informed work item creation assistant
Document intake agent Attachment scraping and structured data extraction
Schema metric discovery Schema-aware agent for metric potential and gap analysis
TF-IDF duplicate detection Deduplication pipeline for intake queues (RMA, SNOW, Jira)
Guided intake flow (Submit New Issue) Structured ticket submission workflow with AI triage and HITL approval gate
Confidence scoring Model uncertainty quantification for stakeholder trust
LangSmith tracing Audit trail of model reasoning for compliance validation
Multi-provider architecture Provider-agnostic deployment for constrained environments (FedRAMP, ATO)

Completed

Version Feature
v1.0.0 Groq/Llama 3.3 70B integration, orchestrator, 5 subagents, HITL
v1.1.0 Plotly charts, trigger filtering, confidence scoring
v1.2.0 Run history tab
v1.3.0 PDF export
v1.4.0 SQLite backend
v1.5.0 LangSmith tracing
v1.6.0 Item detail drawer, NL item updates, API key in graph state
v1.7.0 Stale items alert panel
v1.8.0 Unified NL command field, hybrid intent router
v1.9.0 AI chart generation agent (two-tier), unified command field chart routing
v1.10.0 Cross-item RCA agent, category-scoped RCA, updated_at timestamp schema, registry seed expanded to 30 items, run history seed script
v1.11.0 5 Whys causal chain agent (category-based), RCA synthesis mode, safety keyword category resolution, stacked whys UI panels, sample documents PDF
v1.12.0 Predictive Quadrant Preview (inline badge, confidence bar, Groq/Llama)
v1.13.0 Completeness Scorer + Prompt Agent (per-category rubrics, follow-up questions, integrated into Predictive Quadrant Preview expander)
v1.14.0 Document Intake Agent (Gemini 2.0 Flash multimodal, HITL registry updates, PDF + image support, Google API key integration)
v1.15.0 Spreadsheet Analytics Agent (Gemini 2.5 Flash-Lite, pandas profiling, HITL registry correlation), chart generation from uploaded data (Option A + B), complex chart token fix, Streamlit deprecation fix, post-v1.14.0 bug fixes
v1.16.0 Schema-Aware Metric Discovery Agent (Gemini 2.5 Flash-Lite, CSV + Mermaid ERD input, computable metrics, derived fields, quality observations, schema gaps), HOMEBASE ERD, POC disclaimer, dependency fixes
v1.16.1 MkDocs documentation site (Material theme, GitHub Pages, auto-deploy workflow), README cleanup, Node.js 24 Actions upgrade
v1.17.0 Multi-provider LLM architecture (Claude Sonnet synthesizer, Groq subagents, runtime provider selection via ANTHROPIC_API_KEY, sidebar provider status, 29 new tests)
v1.18.0 TF-IDF Duplicate Detection (tools/duplicate_detector.py, dual-channel TF-IDF, threshold 0.55, HITL warning UI, execute_add integration, 36 new tests)
v1.19.0 Guided Intake Flow (📋 Submit New Issue expander) — 5-step HITL intake mirroring RMA checklist: Describe → Duplicate Check → Triage → Review & Approve → Done