Module 4 Assignment: Evaluation plans and acceptance criteria

Module 4 Assignment: Evaluation plans and acceptance criteria#

Scenario#

You are advising a delivery team deciding whether an AI project should move from experiment to deployment. The stakeholders are: project sponsor, product owner, ML lead, operations manager, and governance reviewer.

Task#

Answer the module question: What evidence authorizes forward movement?

Use the module lab and course readings to produce: deployment decision package with project charter, acceptance gates, risk log, and monitoring plan focused on evaluation plans and acceptance criteria: Define acceptance tests for model and workflow quality..

Required Evidence#

  • Define the decision or system boundary in one paragraph.

  • Identify the dataset, proxy data, or evidence source you used: synthetic project telemetry with scope volatility, evaluation results, risks, adoption readiness, and operational load.

  • Compare at least two alternatives, baselines, policies, or designs.

  • Report one quantitative result or structured scoring table.

  • Explain two failure modes and one mitigation for each.

  • State what additional evidence would be required before real deployment.

Submission#

Submit the completed notebook plus a 900-1200 word memo. The memo must include clear headings for context, method, evidence, risks, recommendation, and open questions.

# Assignment workspace for Module 4: Evaluation plans and acceptance criteria
module = 4
decision = "What evidence authorizes forward movement?"
artifact = "deployment decision package with project charter, acceptance gates, risk log, and monitoring plan focused on evaluation plans and acceptance criteria: Define acceptance tests for model and workflow quality."

alternatives = [
    {"option": "baseline_or_manual_process", "strength": "", "risk": "", "evidence": ""},
    {"option": "ai_assisted_or_advanced_option", "strength": "", "risk": "", "evidence": ""},
]

recommendation = {
    "decision": decision,
    "recommended_option": "",
    "minimum_evidence_before_pilot": [],
    "monitoring_metric": "",
    "rollback_trigger": "",
}

{"module": module, "artifact": artifact, "alternatives": alternatives, "recommendation": recommendation}
{'module': 4,
 'artifact': 'deployment decision package with project charter, acceptance gates, risk log, and monitoring plan focused on evaluation plans and acceptance criteria: Define acceptance tests for model and workflow quality.',
 'alternatives': [{'option': 'baseline_or_manual_process',
   'strength': '',
   'risk': '',
   'evidence': ''},
  {'option': 'ai_assisted_or_advanced_option',
   'strength': '',
   'risk': '',
   'evidence': ''}],
 'recommendation': {'decision': 'What evidence authorizes forward movement?',
  'recommended_option': '',
  'minimum_evidence_before_pilot': [],
  'monitoring_metric': '',
  'rollback_trigger': ''}}

Acceptance Criteria#

Your submission is complete only if another reviewer can reproduce your reasoning from the evidence you provide. You do not need production-grade data, but you must be explicit about proxy-data limits and what would change with real institutional data.