26  Model Cards / ML Governance

Audience: Kav AI ML team + technical-diligence reviewers + auditors. Purpose: document each detection model’s scope, evaluation, calibration, and drift so the PRD’s accuracy claims are governable, not just asserted. Reference: Google “Model Cards for Model Reporting”; Hugging Face model-card format. Pairs with the validation methodology doc in ../../evaluation/.

26.1 One card per model (RGB, thermal, OGI, cross-source engine, OOD detector)

  • Model details — name, version ID, owner, date, architecture.
  • Intended use — modality, asset classes, what it must NOT be used for.
  • Training data — campaigns/sources, size, labeling process, known gaps.
  • Evaluation — TPR/FPR vs. the PRD targets, on which benchmark, with NDT ground truth.
  • Calibration — confidence-calibration curve; how/when regenerated (PRD pilot Weeks 9–10).
  • OOD behavior — how out-of-distribution inputs are flagged/routed.
  • Drift & retraining cadence — triggers, monitoring (ties to FR-AI-04).
  • Limitations & ethical/safety notes — HITL requirement; Filter Skill > Stage 3.5 > cross-source.
  • Versioning/rollback — per PRD AI Engine section.

These cards are how the TPR > 98% / FPR < 2% claims become defensible. The numbers must match the validation methodology doc and the PRD Performance table.