Lloyds Banking Group interview · Data Scientist
The Author
in the Machine
A practical guide to LLMs: from AI and machine learning to transformers, attention, retrieval, and responsible banking systems.
The map
Four questions, one mental model
01 · The landscape
LLMs are one branch — not the whole tree
01 · The road to LLMs
Language models didn't start with ChatGPT
Each era learned more of the language from data — until attention let them finally scale.
02 · Meet the LLM
An LLM is autocomplete — scaled up beyond recognition
The same instinct as your phone keyboard — guess what comes next — trained on a vast amount of text. It's what sits behind ChatGPT, Copilot and Gemini.
02 · Tokens
It doesn't read words. It reads tokens.
Our sentence, the way the model actually sees it — common words stay whole, rarer words break into pieces, and punctuation is its own token.
So 9 words ≈ 11–13 tokens — the exact split is tokenizer-specific, but tokens are the unit of reading, pricing and context limits.
02 · Next-token prediction
All it really does: predict the next token
03 · The transformer moment
Older models read in a line.
The transformer reads everything at once.
It reads in order, so two distant words are many steps apart — the thread between them fades.
Every word can look at every other word directly — which is what let LLMs handle long context and scale.
03 · Attention
Attention asks: what should this word look back at?
03 · The context window
Think of it as the model's desk
Everything it can see at once must fit on one finite desk: instructions, the conversation, retrieved evidence, and room for the reply.
04 · The author
The model isn't the chatbot.
It's the author writing its next line.
Prompting is setting the scene for the author: role, task, evidence, format.
04 · Reliability
A fluent answer is not the same as a true one
Fluent
It can produce polished, plausible language even when the right evidence is missing.
Grounded
The answer is tied to trusted evidence on the context desk, ideally with citations.
To the model, every question feels like an exam — it would rather guess than say “I don't know.” In a bank, a polished but ungrounded answer is a risk, not a gain.
04 · Retrieval — RAG
Turn a closed-book exam into an open-book one
Retrieval-Augmented Generation indexes trusted documents, fetches the relevant passages, places them on the desk, then lets the author write a grounded answer.
apply here?”
documents
evidence
citations
04 · From model to product
The product is the system around the author
A ChatGPT-style tool is prompts, retrieval, tools, memory, UI, logs, evaluations, access controls and human workflows wrapped around a model.
Production reliability lives in the wrapper.
05 · In a bank
Where it helps — and how we keep it safe
Where LLMs fit
How we keep it safe
The takeaway
Five things to keep
What I'd bring
Make retrieval the strength, not the ceiling
Most retrieval systems plateau at finding the right context, not writing from it — fast vector search is broad but blunt. So I'd make retrieval two-stage.
comes in
fast but blunt
the real top 5
answer