Problem
Many business workflows still begin with unstructured documents: invoices, receipts, statements, forms, and messages. Extracting values is only part of the problem. The harder question is whether a person can review where each value came from and understand what was not found.
Pipeline
The Phase 4 demo focuses on the path from document text to structured, reviewable fields.
What the examples use
The invoice visual below is synthetic and uses the same public-safe values as the extracted fields shown in the demo. The current backend does not yet perform OCR on scanned images, so this page shows a realistic static visual plus the matching Phase 4 transcript-derived output.
Demo 1: Vendor invoice extraction
The invoice demo shows how synthetic vendor invoice text becomes structured fields such as invoice number, vendor name, invoice date, due date, totals, and payment terms.
The point is not that the fixture is complex. The point is that the output is structured enough for a downstream workflow to inspect.
INVOICE Invoice No: INV-DEMO-2026-0042 Invoice Date: 15 Jan 2026 Due Date: 14 Feb 2026 From: Northstar Office Supplies LLC Bill To: Blue Harbor Consulting LLC Currency: USD Subtotal: $1250.00 Tax Amount: $100.00 Total Amount: $1350.00
Demo 2: Expense receipt extraction
The receipt demo applies the same Phase 4 idea to a different business document shape. It extracts merchant, date, category, subtotal, tax, tip, total, currency, and payment method from a synthetic receipt.
This shows that the workflow is not tied to one document template.
RECEIPT Merchant: Harbor Cafe Supplies Receipt No: RCP-DEMO-2026-019 Transaction Date: 22 Jan 2026 Currency: USD Subtotal: $42.50 Tax Amount: $3.40 Total Amount: $45.90 Payment Method: card
Demo 3: Evidence and missing fields
The evidence viewer shows why structured extraction needs review context. Fields are displayed with values, confidence levels, and supporting snippets. Missing fields are shown explicitly instead of being silently ignored.
That makes the output easier to inspect, explain, and route into the next workflow step.
Evidence: Total Amount: $1350.00
- vendorAddress
- buyerAddress
- purchaseOrderNumber
- lineItems
The display model still reserves these buckets so future workflows do not hide uncertainty.
What this proves
Phase 4 proves that AI Workflow Lab can produce reviewable structured data from everyday business documents.
- Document-type-aware field extraction.
- Public-safe fixture outputs.
- Confidence and evidence display.
- Explicit missing-field reporting.
- A clear boundary between extraction and workflow decisions.
What comes next in Phase 5
Phase 5 should focus on what happens after extraction: human review, validation, exception handling, routing by confidence or missing fields, and audit-friendly workflow state.
Phase 4 does not decide whether a document should be approved, rejected, reimbursed, paid, or posted to accounting. It prepares data that can support those later workflow decisions.
- Human review screens
- Validation and exception handling
- Business-rule routing
- Workflow state and audit trail
Limitations
- The current demos use synthetic public-safe fixtures.
- The public invoice image is a visual reference; extracted fields shown here come from matching synthetic transcripts.
- The current backend does not perform OCR on scanned image input.
- The examples are static and do not call production APIs.
- The extraction outputs are demo artifacts, not a production review queue.
- The demos do not approve payments, reimbursements, or accounting entries.
- The demos do not make LLM-based decisions.
Source artifacts
The source demo package is maintained in the platform repository so the website copy, fixture references, QA notes, and claims review remain traceable to the backend platform work.