Discussion Questions
Add a Padlet or Mentimeter board
Embed a shared class board for student contributions and ideas.
A practical mental model for prompt engineers - without the math.
Before the lesson
Watch this 60-second clip for a fast vibe-check on the concept. Then dive into the full lesson below.
A large language model (LLM) is a giant function that, given a sequence of tokens, predicts the next token. Repeat that prediction step a few hundred times and you get a paragraph.
That's it. Everything else - chat, tools, reasoning, code - is built on top of "predict the next token."
Models read tokens, which are usually 3–4 characters in English.
You can inspect any prompt in OpenAI's tokenizer or with the tiktoken library.
Take three short prompts (a question, a one-line task, and a paragraph). Paste each into the OpenAI tokenizer. Record the token count, then estimate cost at $5 per 1M input tokens. What does this tell you about how to write prompts cheaply?
Quick self-check