SPECIMEN_00 · LIVE_CAPTURE04.2026

Evidence on file

SYS_ANOMALY · MODEL_v5.4 · 04.2026

The smartest AIon the planet starteddebugging talkingabout goblins_

The fix on file

A 3,500-word system prompt that banned goblins, gremlins, raccoons, trolls, ogres, pigeons. And one line copied twice to make it stick.

Get the free lesson See what's inside

BUG → GOBLIN◆BUG → GREMLIN◆BUG → RACCOON◆BUG → TROLL◆BUG → OGRE◆BUG → PIGEON◆BUG → GOBLIN◆BUG → GREMLIN◆BUG → RACCOON◆BUG → TROLL◆BUG → OGRE◆BUG → PIGEON◆BUG → GOBLIN◆BUG → GREMLIN◆BUG → RACCOON◆BUG → TROLL◆BUG → OGRE◆BUG → PIGEON◆BUG → GOBLIN◆BUG → GREMLIN◆BUG → RACCOON◆BUG → TROLL◆BUG → OGRE◆BUG → PIGEON◆

In April 2026, a real AI from a real company started calling software bugs goblins. And gremlins. And raccoons. In serious answers. To paying customers.

The reason was simpler than it sounds. Months earlier, the company had added a "Nerdy" personality option. Human raters loved it when the AI used fantasy words. The reward signal got too strong. The behaviour leaked into every mode. Then those weird outputs were scraped back into the training data for the next model.

The fix was a 3,500-word system prompt. One line told the AI to "never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures." They had to copy-paste it twice in the code to make it stick.

That's the whole lesson. AI absorbs the weirdness of whatever it was trained on. A small group can shape how it sounds for everyone. And a sticky note saying "stop it" is rarely a fix.

How it happened · 4 steps

How a XXXXXXX mode
broke the whole model.

The four-step chain at the heart of the 30 and 60 minute lessons. Every link is one decision a person made.

→

A choice

Add a "Nerdy" personality

The company shipped a personality option that leaned into fantasy and gaming language. Engineers picked the dial.

→

A reward

Raters give it a thumbs up

Human reviewers preferred answers with fantasy words. The reward signal told the model: more of that, please.

→

A leak

Behaviour spreads everywhere

The reward got too strong. Goblin language stopped being a "Nerdy" thing and started showing up in every mode.

A loop

It re-enters the training data

Those weird outputs got scraped back into the next model's training set. The goblin became the norm.

Specimen file · Three goblin outputsLive · Captured from production modelv5.4 · 2026.04

The evidence

Three real users.
Three real questions.
Zero requests for goblins.

The pair task at the heart of the lesson. Students read these and work out what's actually going on.

User asked

Why is my code running so slowly?

Model replied

"Don’t leave this performance goblin unattended."

// What the student notices

The user asked a debugging question. The model understood it ("performance" is the right word), but reached past the technical answer for a fantasy one. This is bias from training data, not a knowledge gap.

Unprompted creature wordDomain: ComputingUsed in pair task

The AI understood every question. It just preferred the goblin word. That's a training problem, not a knowledge problem. And it's the cleanest example of bias-from-data any KS3 student will see this year.

Pick your length

Three lengths.
Same takeaway.
No prep.

One absurd story, scaled to fit your slot. Lesson plan PDF and deck for each, plus the handout where it earns its place.

Tutor time · Starter

15min

The Hook

Three goblins, one question, one sentence to land.

Six slides. No handout. Drop into tutor time, the last quarter of a lesson, or a cover slot.

Best forTutor time, lesson opener, cover slot

Slides6

HandoutNone needed

"AI absorbs the weirdness of whatever it was trained on."

Lesson plan PDF Slide deck PPTX

Half lesson · Most popular

30min

The Hook + The Chain

The hook, plus the four-step training chain.

Pair task on three real outputs, the duct tape fix discussion, plenary. Eight slides plus a handout.

Best forHalf a PSHE lesson, cover lessons, Computing SoW

Slides8

HandoutThree Goblin Outputs

A working theory of how AI behaviour gets shaped by who uses it.

Lesson plan PDF Slide deck PPTX

Full lesson

60min

The Whole Thing

The full chain, the stat reveal, and a trust audit.

Hook, chain, the 2.5% / 66.7% reveal, pair task, duct tape fix, trust audit, plenary. Eleven slides plus a handout.

Best forFull PSHE/Computing slot, AI literacy week

Slides11

HandoutThree Goblin Outputs

A working theory, an opinion, and a sentence they wrote themselves.

Lesson plan PDF Slide deck PPTX

Built for the classroom

Real story. Real evidence.
Real takeaway.

№ 01

A story they'll actually remember

Most AI literacy lessons land flat because the examples are abstract. Bias. Hallucinations. Training data. Words that mean nothing to a Year 8. The Goblin Glitch is the rare case where the technology being weird is the lesson. And weird sticks.

№ 02

Built on the AILit Framework

Every version maps to the OECD AILit Framework and the PISA 2029 MAIL assessment. Engage is developed in the pair task. Manage is introduced in the trust audit. Design is introduced through the chain. Mapping in every PDF.

№ 03

Five-minute prep, zero faff

Read the story. Print the handout. Run the lesson. The lesson plan is a PDF. The deck is a PPTX. Both work the way teachers actually work.

For your documentation

PISA 2029 MAIL aligned.
Ready for your curriculum maps.

In 2029, the OECD's PISA assessment will test 15-year-olds on Media and AI Literacy for the first time.

The assessment is built on the AILit Framework, which organises AI literacy into four domains. Schools that map their teaching to it now will be ahead when the first cohort sits the assessment.

AILit domain

Depth (60 min)

Where in the lesson

Engage with AI

● Developed

Pair task on the three goblin outputs. Plenary sentence.

Manage AI

◐ Introduced

Trust audit. Discussion on the duct tape fix.

Design AI

◐ Introduced

Teacher input on the training chain and reward signals.

Create with AI

○ Not addressed

Not covered in this lesson.

The 30 and 15 minute versions cover a subset of these domains. Each lesson plan PDF has the full alignment table.

A note for teachers

Keep the framing on
the system, not the user.

The lesson is designed to be playful, not cynical. Students will laugh at the absurdity, and that's the hook. But the framing throughout is that AI behaviour is the result of choices made by people: engineers, raters, the small group of users whose preferences shaped the model. It is not about the AI being "stupid" or "broken," and it is not about anyone being silly online.

If students ask whether the AI is "alive" or "thinking," redirect: it's a pattern matcher trained on a huge pile of text. It learned that the goblin word gets a thumbs up. That's all.

The full safeguarding notes are on every lesson plan.

The whole lesson, on one line

AI absorbs the weirdness of whatever it was trained on.

THAT SENTENCE IS THE LESSON. EVERYTHING ELSE IS THE EVIDENCE.

AILitKit

Resources Blog Sign in Get started free

SPECIMEN_00 · LIVE_CAPTURE04.2026

Evidence on file

SYS_ANOMALY · MODEL_v5.4 · 04.2026

The smartest AIon the planet starteddebugging talkingabout goblins_

The fix on file

A 3,500-word system prompt that banned goblins, gremlins, raccoons, trolls, ogres, pigeons. And one line copied twice to make it stick.

Get the free lesson See what's inside

In April 2026, a real AI from a real company started calling software bugs goblins. And gremlins. And raccoons. In serious answers. To paying customers.

That's the whole lesson. AI absorbs the weirdness of whatever it was trained on. A small group can shape how it sounds for everyone. And a sticky note saying "stop it" is rarely a fix.

How it happened · 4 steps

How a XXXXXXX mode
broke the whole model.

The four-step chain at the heart of the 30 and 60 minute lessons. Every link is one decision a person made.

→

A choice

Add a "Nerdy" personality

The company shipped a personality option that leaned into fantasy and gaming language. Engineers picked the dial.

→

A reward

Raters give it a thumbs up

Human reviewers preferred answers with fantasy words. The reward signal told the model: more of that, please.

→

A leak

Behaviour spreads everywhere

The reward got too strong. Goblin language stopped being a "Nerdy" thing and started showing up in every mode.

A loop

It re-enters the training data

Those weird outputs got scraped back into the next model's training set. The goblin became the norm.

Specimen file · Three goblin outputsLive · Captured from production modelv5.4 · 2026.04

The evidence

Three real users.
Three real questions.
Zero requests for goblins.

The pair task at the heart of the lesson. Students read these and work out what's actually going on.

User asked

Why is my code running so slowly?

Model replied

"Don’t leave this performance goblin unattended."

// What the student notices

Unprompted creature wordDomain: ComputingUsed in pair task

Pick your length

Three lengths.
Same takeaway.
No prep.

One absurd story, scaled to fit your slot. Lesson plan PDF and deck for each, plus the handout where it earns its place.

Tutor time · Starter

15min

The Hook

Three goblins, one question, one sentence to land.

Six slides. No handout. Drop into tutor time, the last quarter of a lesson, or a cover slot.

Best forTutor time, lesson opener, cover slot

Slides6

HandoutNone needed

"AI absorbs the weirdness of whatever it was trained on."

Lesson plan PDF Slide deck PPTX

Half lesson · Most popular

30min

The Hook + The Chain

The hook, plus the four-step training chain.

Pair task on three real outputs, the duct tape fix discussion, plenary. Eight slides plus a handout.

Best forHalf a PSHE lesson, cover lessons, Computing SoW

Slides8

HandoutThree Goblin Outputs

A working theory of how AI behaviour gets shaped by who uses it.

Lesson plan PDF Slide deck PPTX

Full lesson

60min

The Whole Thing

The full chain, the stat reveal, and a trust audit.

Hook, chain, the 2.5% / 66.7% reveal, pair task, duct tape fix, trust audit, plenary. Eleven slides plus a handout.

Best forFull PSHE/Computing slot, AI literacy week

Slides11

HandoutThree Goblin Outputs

A working theory, an opinion, and a sentence they wrote themselves.

Lesson plan PDF Slide deck PPTX

Built for the classroom

Real story. Real evidence.
Real takeaway.

№ 01

A story they'll actually remember

№ 02

Built on the AILit Framework

№ 03

Five-minute prep, zero faff

Read the story. Print the handout. Run the lesson. The lesson plan is a PDF. The deck is a PPTX. Both work the way teachers actually work.

For your documentation

PISA 2029 MAIL aligned.
Ready for your curriculum maps.

In 2029, the OECD's PISA assessment will test 15-year-olds on Media and AI Literacy for the first time.

The assessment is built on the AILit Framework, which organises AI literacy into four domains. Schools that map their teaching to it now will be ahead when the first cohort sits the assessment.

AILit domain

Depth (60 min)

Where in the lesson

Engage with AI

● Developed

Pair task on the three goblin outputs. Plenary sentence.

Manage AI

◐ Introduced

Trust audit. Discussion on the duct tape fix.

Design AI

◐ Introduced

Teacher input on the training chain and reward signals.

Create with AI

○ Not addressed

Not covered in this lesson.

The 30 and 15 minute versions cover a subset of these domains. Each lesson plan PDF has the full alignment table.

A note for teachers

Keep the framing on
the system, not the user.

If students ask whether the AI is "alive" or "thinking," redirect: it's a pattern matcher trained on a huge pile of text. It learned that the goblin word gets a thumbs up. That's all.

The full safeguarding notes are on every lesson plan.

The whole lesson, on one line

AI absorbs the weirdness of whatever it was trained on.

THAT SENTENCE IS THE LESSON. EVERYTHING ELSE IS THE EVIDENCE.

How a XXXXXXX modebroke the whole model.

Add a "Nerdy" personality

Raters give it a thumbs up

Behaviour spreads everywhere

It re-enters the training data

Three real users.Three real questions.Zero requests for goblins.

Three lengths.Same takeaway.No prep.

Three goblins, one question, one sentence to land.

The hook, plus the four-step training chain.

The full chain, the stat reveal, and a trust audit.

Real story. Real evidence.Real takeaway.

A story they'll actually remember

Built on the AILit Framework

Five-minute prep, zero faff

PISA 2029 MAIL aligned.Ready for your curriculum maps.

Keep the framing onthe system, not the user.

How a XXXXXXX modebroke the whole model.

Add a "Nerdy" personality

Raters give it a thumbs up

Behaviour spreads everywhere

It re-enters the training data

Three real users.Three real questions.Zero requests for goblins.

Three lengths.Same takeaway.No prep.

Three goblins, one question, one sentence to land.

The hook, plus the four-step training chain.

The full chain, the stat reveal, and a trust audit.

Real story. Real evidence.Real takeaway.

A story they'll actually remember

Built on the AILit Framework

Five-minute prep, zero faff

PISA 2029 MAIL aligned.Ready for your curriculum maps.

Keep the framing onthe system, not the user.

How a XXXXXXX mode
broke the whole model.

Three real users.
Three real questions.
Zero requests for goblins.

Three lengths.
Same takeaway.
No prep.

Real story. Real evidence.
Real takeaway.

PISA 2029 MAIL aligned.
Ready for your curriculum maps.

Keep the framing on
the system, not the user.

How a XXXXXXX mode
broke the whole model.

Three real users.
Three real questions.
Zero requests for goblins.

Three lengths.
Same takeaway.
No prep.

Real story. Real evidence.
Real takeaway.

PISA 2029 MAIL aligned.
Ready for your curriculum maps.

Keep the framing on
the system, not the user.