Skip to main content

Is your test suite lying to you?

180-minute Workshop

A practical mindset shift for testing GenAI system

Timetable

1:30 p.m. – 4:30 p.m. Wednesday 18th

Room

Room D3+D4 - Track 7: Workshops

Artificial Intelligence (AI)

Audience

Tester, Test Managers

Required

Laptop

Key-Learnings

  • Identify the five most common GenAI failure types and diagnose which pipeline layer caused each one
  • Reframe traditional test cases using a quality criteria framework built for probabilistic, non-deterministic systems
  • Articulate what a GenAI test strategy needs to contain and why?

You wrote a test. It passed. You ran it again. It failed. And you ran it again, IT passed!

Testing GenAI systems means accepting that the same input produces different outputs, errors don't throw exceptions, and a response that looks correct can be entirely fabricated. Every instinct we have built as testers can work against us here if we don't grow beyond them.

Throughout this workshop we follow one analogy: The Restaurant. Marco is a chef who has read every recipe ever written, extraordinary, but with no memory between shifts. Sofia fetches recipes before every service — get her retrieval wrong and Marco cooks from the wrong recipe. Warning: even if Sofia gets it right, Marco might still go off-script. Each character maps to a technical layer you will test: LLM, retrieval, input guardrails, output guardrails, evaluation.

Are you already invested in this story?

Then join me. We open with a running chatbot, five bugs, no instructions. A hallucination. A guardrail bypass. A retrieval failure. A prompt injection. A bias.

Together we build a one-page GenAI Test Strategy Canvas, one layer at a time. You walk out with seven pipeline layers, one failure mode and one test pattern each, mapped to a quality framework you can apply to any GenAI system on Monday.

Related Sessions

Wed, Nov 18 • 10:45 a.m. – 12:30 p.m.
Room E2+E3 - Track 5: Workshops

105-minute Workshop

Artificial Intelligence (AI) Testing Tools

Virtual Pass session
Tue, Nov 17 • 1:30 p.m. – 2:15 p.m.
Room F1 - Track 1: Talks

25-minute Talk

Artificial Intelligence (AI) Ethics in Tech Other

Thu, Nov 19 • 10:45 a.m. – 12:30 p.m.
Room E2+E3 - Track 5: Workshops

105-minute Workshop

Artificial Intelligence (AI) Collaboration & Communication

Virtual Pass session
Tue, Nov 17 • 10:45 a.m. – 11:30 a.m.
Room F3 - Track 3: Talks

25-minute Talk

Artificial Intelligence (AI) Career Development