Module 4 Assignment: Evaluation plans and acceptance criteria

Contents

Module 4 Assignment: Evaluation plans and acceptance criteria#

Scenario#

You are advising a delivery team deciding whether an AI project should move from experiment to deployment. The stakeholders are: project sponsor, product owner, ML lead, operations manager, and governance reviewer.

Task#

Answer the module question: What evidence authorizes forward movement?

Use the module lab and course readings to produce: deployment decision package with project charter, acceptance gates, risk log, and monitoring plan focused on evaluation plans and acceptance criteria: Define acceptance tests for model and workflow quality..

Required Evidence#

Define the decision or system boundary in one paragraph.
Identify the dataset, proxy data, or evidence source you used: synthetic project telemetry with scope volatility, evaluation results, risks, adoption readiness, and operational load.
Compare at least two alternatives, baselines, policies, or designs.
Report one quantitative result or structured scoring table.
Explain two failure modes and one mitigation for each.
State what additional evidence would be required before real deployment.

Submission#

Submit the completed notebook plus a 900-1200 word memo. The memo must include clear headings for context, method, evidence, risks, recommendation, and open questions.

# Assignment workspace for Module 4: Evaluation plans and acceptance criteria
module = 4
decision = "What evidence authorizes forward movement?"
artifact = "deployment decision package with project charter, acceptance gates, risk log, and monitoring plan focused on evaluation plans and acceptance criteria: Define acceptance tests for model and workflow quality."

alternatives = [
    {"option": "baseline_or_manual_process", "strength": "", "risk": "", "evidence": ""},
    {"option": "ai_assisted_or_advanced_option", "strength": "", "risk": "", "evidence": ""},
]

recommendation = {
    "decision": decision,
    "recommended_option": "",
    "minimum_evidence_before_pilot": [],
    "monitoring_metric": "",
    "rollback_trigger": "",
}

{"module": module, "artifact": artifact, "alternatives": alternatives, "recommendation": recommendation}

{'module': 4,
 'artifact': 'deployment decision package with project charter, acceptance gates, risk log, and monitoring plan focused on evaluation plans and acceptance criteria: Define acceptance tests for model and workflow quality.',
 'alternatives': [{'option': 'baseline_or_manual_process',
   'strength': '',
   'risk': '',
   'evidence': ''},
  {'option': 'ai_assisted_or_advanced_option',
   'strength': '',
   'risk': '',
   'evidence': ''}],
 'recommendation': {'decision': 'What evidence authorizes forward movement?',
  'recommended_option': '',
  'minimum_evidence_before_pilot': [],
  'monitoring_metric': '',
  'rollback_trigger': ''}}

Acceptance Criteria#

Your submission is complete only if another reviewer can reproduce your reasoning from the evidence you provide. You do not need production-grade data, but you must be explicit about proxy-data limits and what would change with real institutional data.