The Product Assessment Framework
27 dimensions across 6 attributes and 5 stages. The model for measuring whether your product is genuinely AI-native or whether AI is bolted onto a product that would work without it.
AI bolted onto an existing product. No architectural integration. Features work without AI.
AI features present and useful. UI adapted for AI interactions. Product enhanced but not dependent.
AI embedded in core workflows. Architecture designed around AI. Removing AI would break the product.
AI is the product. Every interaction involves inference. Learning loops active and compounding.
AI architecture creates a defensible moat. Network effects, institutional learning, self-improving systems.
Product Architecture
Dimensions 1–4How deeply AI is embedded in the product. Integration depth, model strategy, context architecture, and agentic capability determine whether AI is bolted on or load-bearing.
How deeply is AI integrated into the product's core architecture?
How sophisticated is the product's AI model selection and orchestration strategy?
How well does the product manage context for AI interactions?
Can the product's AI take autonomous multi-step actions on behalf of users?
Adaptive Experience
Dimensions 5–9How the user experience adapts to AI capability. Interaction model, progressive disclosure, adaptive interface, and confidence transparency separate intelligent products from form-and-table products.
How does the user interact with AI capabilities in the product?
How does the product reveal AI complexity to different user skill levels?
Does the interface adapt based on AI understanding of user context?
How transparent is the product about AI confidence and uncertainty?
How well does the product facilitate human-AI collaboration rather than replacement?
Learning Systems
Dimensions 10–13How the product learns from usage. Learning flywheel, personalization depth, knowledge architecture, and data quality determine whether the product gets smarter every day.
Does the product measurably improve from user interactions over time?
How deeply does the product personalize AI behavior to individual users?
How well does the product structure and leverage organizational knowledge?
How does the product maintain data quality and freshness for AI operations?
Product Economics
Dimensions 14–17How efficiently the product delivers AI value relative to inference cost. Cost per outcome, inference economics, pricing-cost alignment, and value attribution decide whether AI features are profitable or margin destroyers.
Does the product track and optimize the cost of AI-delivered outcomes?
How efficiently does the product manage AI inference costs?
Does the product's pricing model reflect its AI cost structure?
Can the product attribute business value to specific AI capabilities?
Trust & Reliability
Dimensions 18–22How the product manages AI risks. Hallucination management, security posture, privacy governance, ethical guardrails, and graceful degradation determine whether users keep trusting the AI when it inevitably gets something wrong.
How does the product detect and manage AI hallucinations?
How robust is the product's AI-specific security posture?
How does the product handle data privacy in AI operations?
Does the product have ethical guidelines governing AI behavior?
How does the product handle AI service failures?
Compound Mechanics
Dimensions 23–27How the product builds defensibility. Network intelligence, switching cost depth, expansion surface, platform leverage, and benchmark community separate temporary advantage from compounding moats.
Does the product get smarter as more users or organizations adopt it?
How deep are the AI-driven switching costs beyond basic data lock-in?
Does AI create new expansion opportunities (use cases, user types, revenue)?
Does the product leverage AI platform capabilities (APIs, SDKs, marketplace)?
Does the product create AI-native benchmark or community value?
Wrapper
27 – 49AI bolted onto existing product. No architectural integration. Features work without AI.
Single AI feature bolted on via API call. Architecture untouched. The product was built before AI and still runs without it.
Single model, single provider, default settings. No prompt engineering. No fallback when the API is down.
Stateless. Every AI interaction starts from zero context. The product remembers nothing between calls.
No agents. AI generates text or suggestions only. All actions require explicit human steps.
Forms, tables, and dashboards. Click-driven. AI lives in a sidebar or modal off to the side.
Every option visible at once. Users wade through settings and screens. Complexity is the user's problem.
Identical UI for every user. No personalization beyond manual settings the user toggles themselves.
AI output presented without uncertainty signals. Users cannot tell what the product is sure of and what it guessed.
Tool. Users operate the product. The product waits for input and responds. No initiative, no perspective.
No learning loop. Every user gets the same product. Usage data is collected but does not feed back into model behavior.
Generic experience. Names and avatars are the only signal that the product knows who is using it.
Knowledge sits in scattered docs, wikis, and tool settings. Nothing is structured for retrieval. AI cannot reach most of what the team knows.
Stale and unverified. The product trains or retrieves on whatever data was loaded once. Drift is invisible.
Unmeasured. Inference costs are invisible to product and engineering. Margins erode quietly as usage grows.
Naive: every request hits the most capable model regardless of difficulty. Costs scale linearly with usage with no controls.
Flat seat pricing on top of variable AI cost. Heavy users subsidize themselves out of margin. The pricing model leaks money.
No attribution. The product cannot answer 'what did this customer get out of us this quarter.'
Unacknowledged. The product confidently presents fabricated answers as if they were facts. Users discover errors after acting on them.
Generic web app security. AI-specific threats like prompt injection and data leakage are unaddressed.
Default privacy. Standard SaaS controls; AI features may share data with model providers without clear isolation guarantees.
No guardrails beyond what the model provider includes. The product inherits whatever the foundation model decided.
Cascading failure. When the AI provider is down, the product is down. Users see errors or blank screens.
Single-tenant intelligence. Each customer's product is shaped only by their own data. No cross-customer benefit.
Low switching cost. Customer data is in standard formats and could be exported and re-imported elsewhere in days.
Single-feature product. Customers buy it for one job and use it for that one job.
Closed product. APIs are minimal or absent. The product is a destination, not a platform.
No community. The product is sold and used in isolation. Customers do not share notes with other customers.
Augmented
50 – 71AI features present and useful. UI adapted for AI interactions. Product enhanced but not dependent.
Two or three AI features wired into adjacent surfaces. Core workflows still function with AI disabled.
One primary provider with light prompt engineering. Some experimentation with alternatives, but production runs one model.
Per-session memory inside a conversation. State evaporates when the session ends. No long-term user knowledge.
Single-step agents in scoped surfaces. The product can complete a defined task on request, but everything else is manual.
Chat or completion in primary surfaces, but the underlying structure is still form-driven. The intelligent surface is one tab among many.
Tabs and sections hide complexity, but disclosure is static. The same controls appear regardless of who is looking.
Light personalization: saved layouts, favorite items, recent history. The product remembers preferences but does not anticipate them.
Confidence shown as a generic badge or score. Useful as a visual cue, but not actionable in context.
Helper. The product makes suggestions and completes routine tasks, but the user is still doing the thinking.
Per-user learning. The product gets better for the individual who uses it more, but improvements do not transfer to other users.
Preference-based personalization: saved settings, default views, theme. Surface-level only.
Indexed knowledge base with embeddings. Retrieval works for direct queries but misses lateral connections.
Scheduled refresh on a slow cadence. Most data is days or weeks old. Quality is not measured systematically.
Aggregate cost tracking. The team knows total AI spend but cannot attribute it to features, users, or outcomes.
Budget-aware. The team has cost dashboards and per-month ceilings, but routing is uniform. Heavy users blow through margin.
Tiered pricing with vague AI quotas. Pricing acknowledges AI is variable but does not measure it carefully. Margin is fragile per tier.
Activity attribution: features used, requests served, hours spent. Useful for engagement, not for ROI conversations.
Generic disclaimers. The product warns that output may be inaccurate but does nothing structural to reduce or detect hallucinations.
Basic input validation and output filtering. Some awareness of AI threats, but no systematic testing or red-teaming.
Per-tenant data boundaries. Customer data is isolated, but AI training and improvement are still opaque to customers.
Policy-based guardrails. The team has written rules, but enforcement is inconsistent and only catches obvious cases.
Generic error states. The product fails clearly but does nothing useful when AI is unavailable.
Anonymous aggregate insights. The product surfaces benchmarks like 'teams like yours typically' but learning does not flow back into core behavior.
Workflow lock-in. Switching means retraining the team and rebuilding integrations, but the underlying data is portable.
Adjacent feature expansion. The product adds features the team can sell into the existing customer, but each is a separate motion.
Public API for read access. Other tools can pull data out, but the intelligence layer is not extensible from outside.
Customer comparison via case studies and decks. The product is benchmarked against, but the benchmarking happens off-platform.
Integrated
72 – 93AI embedded in core workflows. Architecture designed around AI. Removing AI would break the product.
AI sits inside multiple core workflows. Disabling it would degrade the product enough that users would notice immediately.
Multi-model strategy with task-aware routing. Model performance is evaluated systematically against business metrics, not vibes.
Persistent per-user context across sessions. The product remembers preferences, prior questions, and recent interactions.
Multi-step agents with clear boundaries. The product can chain actions across one workflow without human-in-the-loop on every step.
Conversational and structured surfaces work together. Users move between chat, primitives, and direct manipulation without switching products.
Progressive disclosure based on user role and recent actions. The product hides what the user is unlikely to need.
Behavior-aware personalization. The product reorders, surfaces, or pre-fills based on what this user has done before.
Confidence tied to source. Users can see why the product believes what it does and click through to underlying evidence.
Pair. The product surfaces relevant work, asks clarifying questions, and pushes back when the user is missing context.
Per-tenant learning. The product gets better for the team or account as collective usage grows. Cross-tenant learning is privacy-bounded.
Behavior-deep personalization. The product knows what this user works on, who they collaborate with, and what they tend to ignore.
Structured knowledge graph linking entities, decisions, and outcomes. The product can traverse relationships, not just match keywords.
Continuous refresh on critical paths. Quality metrics tracked per source. Stale or low-quality sources are flagged automatically.
Per-feature cost tracking. The team can answer 'how much does this feature cost to run for one user per month.'
Multi-tier routing. Cheap models handle classification and formatting; frontier models handle reasoning. Caching layers reduce duplicate calls.
Usage-based pricing reflecting infrastructure costs. Heavy users pay more, light users pay less, margins hold across tiers.
Outcome attribution. The product traces user actions to business outcomes and reports back: this many tasks shipped, this much time saved.
Citation requirements. The product refuses to answer without grounding sources and shows them inline. Hallucinations surface as missing citations.
AI-specific security testing as part of the deployment pipeline. Prompt injection, data leakage, and model abuse are tested per release.
Certified data governance. SOC 2, GDPR, and regional compliance with clear documentation of where AI sees customer data and why.
Embedded guardrails in code. Sensitive use cases route through additional checks. Misuse is flagged and queued for review.
Fallback paths on critical features. When AI fails, the product offers the last cached answer, a rule-based alternative, or a clear path to retry.
Cross-customer pattern learning with privacy isolation. The substrate gets smarter for everyone as more customers use it.
Learned-context lock-in. The product knows things about the customer that a competitor would have to relearn from scratch over months.
Multi-area expansion. Customers naturally pull the product into adjacent workflows because the underlying intelligence is reusable.
API-first. Other products build on the product's intelligence via documented APIs. A small ecosystem starts to form.
Active customer community sharing patterns and benchmarks. The product hosts conversations about what good looks like in this category.
Native
94 – 116AI is the product. Every interaction involves inference. Learning loops active and compounding.
AI is a foundational architectural layer. Most features depend on inference. The product was designed assuming AI is always available.
Sophisticated orchestration across providers. Cost-performance tradeoffs are tuned per task. Automatic fallbacks handle outages without user impact.
Layered context architecture: user, team, account, and domain knowledge fused per request. Retrieval pipelines tuned for relevance.
Multi-agent orchestration across product surfaces. Agents coordinate, hand off, and check each other's work. Humans approve outcomes, not steps.
Intelligent multimodal interaction is the default. Voice, chat, structured output, and visual primitives compose dynamically per task.
AI-driven disclosure that adapts per user, per task, per moment. The product reveals what is relevant and tucks away what is not.
Predictive interface. The product anticipates the next action and pre-stages it. Defaults change per user, per context.
Inline reasoning available on demand. The product narrates its own logic when asked, with citations and counterfactuals.
Collaborator. The product participates in decisions. It brings opinions, evidence, and a coherent point of view shaped by everything it has read.
Cross-customer learning with privacy isolation. Patterns learned from one customer improve the substrate without exposing data to other customers.
Role and outcome-aware personalization. The product distinguishes between 'I want a quick answer' and 'I want to understand this deeply' without being asked.
Living knowledge architecture that updates from product usage. New patterns become first-class citizens of the graph automatically.
Real-time data freshness with provenance per record. The product knows when each fact was last verified and surfaces uncertainty when it is stale.
Per-outcome cost tracking. The team knows the marginal cost of each successful task. Pricing decisions reference this number.
Dynamic optimization. The product chooses model, cache, and context size per request to minimize cost while preserving quality.
Value-aligned pricing tied to outcomes the product produces. Customers pay for what works, not for raw inference.
Multi-step attribution across decisions and time. The product narrates how today's outcome compounds from earlier work the customer did.
Verified outputs at runtime. The product checks claims against authoritative data before surfacing them. Unverifiable claims are flagged.
Continuous adversarial testing. Internal red teams probe production. New attack vectors get patched before exploitation.
Customer-controlled data governance. Customers can opt in or out of training contributions, audit data usage, and delete on demand.
Continuously monitored guardrails with human-in-the-loop on edge cases. Drift is detected and corrected per release.
Self-healing infrastructure. Failures auto-route to healthy providers, downgrade to cheaper models, or fall back to deterministic logic without user impact.
Network effects in product quality. New customers benefit from patterns learned from prior customers. Switching costs grow with cohort size.
Deep substrate lock-in. The product's accumulated knowledge of the customer's domain, preferences, and history is itself a strategic asset.
Platform expansion. Customers build their own workflows on top of the product's intelligence layer. Expansion happens without sales motion.
Platform with developer ecosystem. Third-party builders extend the product's intelligence into vertical and adjacent use cases.
Industry benchmarks anchored on the product. Outsiders reference the product's data and definitions as the standard of measurement.
Compounding
117 – 135AI architecture creates defensible moat. Network effects, institutional learning, self-improving systems.
Every interaction routes through AI by design. The product would not exist as a separate concept without it. AI is the substrate, not a layer.
Custom models, training pipelines, and ensembles. Model strategy itself is a competitive advantage that competitors cannot replicate by API access.
Context is itself a product. Compounding knowledge graphs, structured memory, and proprietary embeddings turn every interaction into accumulated value.
Autonomous agents operate continuously in the background. The product takes initiative, surfaces work it has done, and asks forgiveness rather than permission.
The interface generates itself per intent. Users describe outcomes; the product renders the appropriate surface, then dissolves it when the task ends.
Disclosure compounds with usage. The product learns which surfaces matter to which users and reshapes itself silently. Nothing surfaced is wasted.
The interface is a learned artifact unique to each user. Two users on the same plan see meaningfully different products. The personalization compounds.
Full epistemic transparency. Every claim carries its provenance, its confidence, and what would change it. Trust is engineered, not assumed.
Member of the team. The product owns workstreams end to end. Humans review outcomes, not steps. Reviewers describe it as 'they' instead of 'it.'
Network learning. Every customer's usage compounds product quality for everyone. The flywheel is the moat. Day 1000 customers benefit from day 1 customers.
Personalization is an emergent property of every interaction. The product's behavior toward this user is unique enough that switching feels like losing a colleague.
Knowledge compounds without curation. The product extracts, organizes, and distributes institutional intelligence as a byproduct of being used.
Self-healing data quality. The product detects drift, retires bad sources, and improves freshness over time without manual oversight.
Cost per outcome trends down with scale. The flywheel reduces marginal cost. Cost efficiency is itself a competitive advantage.
Self-optimizing inference economics. The product continuously discovers cheaper paths to the same outcome and routes traffic accordingly.
Pricing is itself a competitive advantage. Margins improve as the flywheel reduces cost. Customers pay for results that no competitor can match.
Attribution is itself a product surface customers reference in board decks. The product proves its value better than any account team could.
Grounded by architecture. The product cannot produce a claim without a verifiable source. Hallucination is structurally impossible on critical paths.
Self-hardening security posture. The product detects emerging threats from its own telemetry and patches itself within hours. Security is a learned competence.
Verifiable governance. Customers can cryptographically prove how their data was used or not used. Privacy is a structural commitment, not a policy.
Self-improving ethics layer. The product learns from edge cases, hardens guardrails automatically, and surfaces emerging risks to humans before they surface in production.
Predictive reliability. The product anticipates degradation, pre-warms fallbacks, and adjusts inference choices before users notice anything is wrong.
Network intelligence is the moat. The product is meaningfully better than any competitor purely because of who else uses it. Late entrants cannot catch up.
Substrate is structurally non-replicable. Even with full data export, the network effects and learned context cannot be reconstructed by any competitor.
Category-defining expansion. The product becomes the default substrate for an entire category of work. Expansion is structural, not motion-driven.
Platform leverage is structural. The product's substrate powers other products in the market. The competitive surface is the platform itself.
Category-defining benchmark community. The product's vocabulary, framework, and benchmarks become the way the category talks about itself.
Score your product against all 27 AI-native dimensions.
Free to start. Sign up in 60 seconds.