LLMs 101

AI for beginners

Plain English explanations of the most important AI concepts — no technical background required.

What actually is AI?

AI (Artificial Intelligence) is software that can perform tasks that would normally require human intelligence — like understanding language, recognising images, or making decisions. Modern AI doesn't "think" the way humans do. Instead, it finds patterns in enormous amounts of data and uses those patterns to make predictions.

What is a Large Language Model?

A Large Language Model (LLM) is a type of AI trained specifically on text. It reads trillions of words — books, websites, code, articles — and learns the patterns of language so well that it can generate new text that sounds human. When you type a question into ChatGPT or Claude, an LLM is reading your words and predicting, one word at a time, what a good response would look like.

How does ChatGPT know things?

It doesn't "know" things the way you know your own phone number. It has learned statistical patterns from a huge amount of text. When it seems to know a fact, it's really predicting what words are likely to follow your question, based on the patterns in its training data. This is why it can sometimes sound very confident while being completely wrong — a phenomenon called hallucination.

What's the difference between ChatGPT, Claude, and Gemini?

They are all LLM-powered chat assistants made by different companies — OpenAI makes ChatGPT, Anthropic makes Claude, and Google makes Gemini. They work in the same fundamental way but have been trained differently, with different data, different safety approaches, and different strengths. Claude tends to be strong at writing and nuanced reasoning. ChatGPT is widely used and fast. Gemini is deeply integrated with Google's products.

What does "running AI locally" mean?

Most AI tools (ChatGPT, Claude) run on remote servers — you send your question over the internet, their computers process it, and send back an answer. "Running locally" means the AI model is downloaded and runs entirely on your own computer. Tools like Ollama make this possible. The advantage is privacy (nothing leaves your machine) and no ongoing cost. The limitation is that local models are typically smaller and less capable than the frontier cloud models.

What is a "token"?

AI models don't read word by word — they break text into "tokens," which are roughly word fragments. The word "unbelievable" might become three tokens: "un", "believ", "able". One token is roughly 0.75 words on average. This matters because models have a limit on how many tokens they can process at once (the "context window"), and API pricing is usually based on token count.

What is a "hallucination" and should I worry about it?

Hallucination is when an AI generates text that sounds plausible and confident but is factually wrong — it might invent a fake citation, misquote a statistic, or get a date wrong. It happens because the model is predicting likely-sounding text, not checking facts. You should always verify important facts from AI responses against reliable sources, especially for medical, legal, or financial matters.

Is AI going to take my job?

This is genuinely debated. AI is already automating some tasks — routine writing, basic coding, data analysis, image generation. But most jobs involve complex judgement, relationships, physical presence, and creativity that AI handles poorly. The more likely near-term outcome is that people who know how to use AI effectively will be more productive than those who don't, rather than AI simply replacing roles wholesale.

Is AI dangerous?

Current AI tools have real risks: misinformation (AI-generated fake content), bias (models can reflect biases in their training data), misuse (fraud, spam, manipulation), and privacy risks. These are serious and worth understanding. The more speculative long-term risks — AI systems acting against human interests — are taken seriously by researchers at labs like Anthropic and DeepMind, and are an active area of safety research.

Interesting Resources

A curated collection of the best places to learn more about AI, large language models, and the broader landscape.

Foundational Reading

Attention Is All You Need — Original Transformer Paper

The 2017 Google research paper that introduced the Transformer architecture which underpins every modern LLM.

AcademicArchitecture

arxiv.org/abs/1706.03762 ↗

The Illustrated Transformer — Jay Alammar

The clearest visual explanation of how transformers work. Essential reading for anyone wanting to understand the mechanics without heavy maths.

VisualBeginner-friendly

jalammar.github.io ↗

Language Models are Few-Shot Learners (GPT-3 Paper)

OpenAI's landmark 2020 paper introducing GPT-3 and the concept of few-shot learning at scale.

AcademicOpenAI

arxiv.org/abs/2005.14165 ↗

Courses & Interactive Learning

fast.ai — Practical Deep Learning for Coders

Free, hands-on course that takes you from basics to building real neural networks. No heavy maths prerequisites.

FreeCoursePractical

course.fast.ai ↗

Andrej Karpathy — Neural Networks: Zero to Hero

YouTube series by former Tesla and OpenAI researcher. Builds a neural network from scratch in Python. One of the best free resources available.

FreeYouTubeCoding

YouTube Playlist ↗

DeepLearning.AI Short Courses

Free short courses on LLMs, prompt engineering, RAG, and agents. Taught by Andrew Ng and leading practitioners. Ideal for non-technical learners.

FreeCourseBeginner-friendly

learn.deeplearning.ai ↗

News & Commentary

The Batch — DeepLearning.AI Newsletter

Weekly AI news and commentary from Andrew Ng. Clear, balanced, and accessible for non-technical readers.

NewsletterWeekly

deeplearning.ai/the-batch ↗

Import AI — Jack Clark

Detailed weekly newsletter from the co-founder of Anthropic. Covers research, policy, and the strategic AI landscape.

NewsletterResearch

importai.substack.com ↗

Model Playgrounds

Ollama — Run models locally

Download and run open-weight models on your own computer. Free, private, no API costs.

FreeLocalOpen source

ollama.com ↗

Hugging Face — Model Hub

The GitHub of AI models. Browse, download, and test thousands of open models. Essential reference for the open AI ecosystem.

FreeModelsCommunity

huggingface.co ↗

About this site

What is this?

LLMs 101 is an interactive mind map designed to help anyone — technical or not — build a solid mental model of how large language models work. It covers the mathematics, the training process, the major model architectures and families, and the prompting techniques that get the best results.

Each node in the map expands to reveal further detail. Clicking any node opens a sidebar with a full explanation and links to primary sources, so you can always go deeper.

How to use it

Navigate to the Mind Map and start at the centre node — Large Language Models — and click it to reveal the four main branches. Click any branch to expand its children. Click any leaf node to open the detail sidebar. Use the hamburger menu (top left) to navigate between the different pages of this site.

On the mind map, nodes with a + indicator have children that will expand on click. Nodes without children open the detail panel directly.

Built with

Pure HTML, CSS, and JavaScript — no frameworks, no build tools, no dependencies beyond Google Fonts. Works as a single file that can be opened directly in any browser or hosted on any static web server.

Typography: Cormorant Garamond (headings) and Jost (body). Colour palette inspired by the Sahel theme by Qode Interactive — combining Dark Goldenrod, Tan, Dust Storm, White Smoke, and Van Dyke Brown.

Sources & further reading

All factual claims in the node explanations are drawn from primary sources including original research papers, official model documentation, and reputable technical publications. Sources are linked directly in each node's detail panel.

Key references include: Vaswani et al. 2017 (Transformer), Brown et al. 2020 (GPT-3), Ouyang et al. 2022 (InstructGPT/RLHF), and the official model cards from Anthropic, OpenAI, Meta, Google DeepMind, Mistral, and Alibaba Qwen.