What are the five KYI pillars?
Know Your Inference scores inference systems across Performance (25%), Economics (25%), Integration (20%), Strategy (20%), and Risk (10%). Each pillar rates 0–100 from live evals, ledger data, and governance signals. The weighted composite produces a confidence level and recommendation: proceed, cap, rightsizing, or sunset. A composite floor of 65 is the default governance threshold — below it triggers enforcement levers per policy. Pillars are not independent checkboxes; weak economics can mask strong latency, which is why KYI weights both equally.
Is KYI a one-off audit?
No. KYI runs continuously in the o10 control plane — every routed call, eval result, and policy decision updates pillar scores and the composite recommendation. One-off audits go stale the week prompts, models, or retries change. Continuous KYI gives boards and regulators current evidence: an immutable ledger backs each score change with model, venue, policy, jurisdiction, and cost per call.
Who wrote Know Your Inference?
Shen Pandi authored the Know Your Inference framework, published on o10.io with live scoring in the control plane. The thesis: cheaper tokens miss the point if the system fails on integration, strategy, or risk — up to 90% of an AI system's operational life is inference, where value, reliability, and risk are decided. The whitepaper defines pillars, weights, and board reporting rigor comparable to established IT governance frameworks.
What triggers enforcement?
Composite KYI score below 65 or individual pillar breach per governance policy triggers levers: spend cap, auto-rightsizing to a cheaper eval-passing model, workload sunset, or escalation to board review. Enforcement is tied to measured signals — eval pass rate collapse, envelope breach, residency violation — not subjective review alone. Policy defines which lever applies; o10 executes in the request path.
How is KYI different from FinOps?
FinOps brings financial accountability to cloud spend and reports token totals, forecasts, and allocations — often a month late. KYI governs whether the inference system creates durable value across performance, economics, integration, strategy, and risk — with a board-signable recommendation. FinOps tells you what you spent; KYI tells you whether the workload should continue, scale, or stop — and o10 holds the levers.
Can KYI run without o10?
The KYI framework is portable — pillars, weights, and scoring logic can be applied in spreadsheets or GRC tools. Continuous scoring, enforce-mode levers, and per-call ledger evidence require a control plane in the inference path. Without live routing data, KYI becomes a periodic exercise that drifts from production reality within days of the next model or prompt change.
What evals feed performance?
Workload-specific eval suites replay production samples against candidate models: support QA, RAG faithfulness, code correctness, classification precision, clinical safety, and custom business metrics. Performance pillar score reflects pass rates, latency percentiles, and drift detection — not vendor leaderboard rankings. When pass rate slips below the floor, o10 stops routing to that model until revalidation.
How do boards consume KYI?
Boards receive a PDF export with composite score, pillar breakdown, confidence level, recommendation, and ledger evidence summary — language finance and directors already use for vendor and risk decisions. Instead of raw token charts, directors see proceed/cap/sunset with justification. Immutable audit trail supports regulator and internal audit questions without reconstructing spend from invoices.
What is the relationship to routing?
Routing executes economics and performance — selecting the cheapest eval-passing model per call. KYI scores whether the entire supply chain (venues, integrations, strategy, risk) is sound above those routes. Routing without KYI optimizes cost; KYI without routing is a report. Together they answer: are we spending wisely, and is the system defensible to the board?
Where is the interactive scorecard?
The live KYI interactive scorecard is at o10.io/kyi — five pillar inputs, composite score, confidence level, and recommendation update as you adjust weights and signals. It demonstrates the framework with the same pillar structure documented in this whitepaper. Production deployments run KYI continuously from live traffic rather than manual scorecard entry.