LLMs 101

Trends & explainers

What's actually happening
in AI right now

Short, plain-English explainers on the biggest shifts — written for people who want to understand AI without a technical background.

Latest explainers
Trend spotlight 15 May 2026 5 min read

The Autonomous Agent Era — From Chatbots to AI That Gets Things Done

The shift from "AI that answers questions" to "AI that completes tasks" is the biggest structural change in how we use these tools. Here's what changed, why it matters, and what it means for how you work.

The old way
You type a prompt → AI gives you a text response → You copy it into a document → You look up the next fact yourself → Repeat.
The new way
You give a single goal → AI plans 5 steps → searches the web → writes the content → checks its own facts → delivers a finished result.
Explainer 1 May 2026 4 min read

Why AI Prices Have Dropped 100x in Two Years

The cost of running frontier AI has fallen faster than almost any technology in computing history. GPT-3's 2020 pricing would cost roughly $20 for the same task that costs $0.002 today. Here's what drove it and what it changes.

2023 pricing
Processing 1 million words via GPT-4 cost roughly $60. Meaningful AI use was expensive enough to require budget approval.
2026 pricing
The same 1 million words now costs under $1 on most frontier models. At DeepSeek rates: under $0.15. AI cost is now negligible for most tasks.
Explainer 15 April 2026 4 min read

Why Every Major AI Now Has a "Thinking" Mode

OpenAI's o1, Claude's extended thinking, Gemini's deep research, DeepSeek R1 — every frontier model now offers a slower, more deliberate mode that reasons through problems before answering. What is it actually doing?

Standard mode
Model reads your prompt → immediately predicts the most likely response → delivers answer in seconds. Fast, fluent, occasionally wrong.
Reasoning mode
Model generates hundreds of "thinking" tokens working through the problem step by step before producing its final answer. Slower. Dramatically more accurate on hard problems.
Deep dive 1 April 2026 6 min read

What DeepSeek R1 Actually Proved — And Why It Shook the Industry

In January 2025, a Chinese lab released an AI model that matched OpenAI's best reasoning model — reportedly trained for $6 million, versus hundreds of millions. The AI industry had a week-long panic. Here's what actually happened and why it matters.

The assumption before R1
Frontier AI required hundreds of millions in compute. Only well-capitalised Western labs could compete at the top. The moat was hardware access.
After R1
Algorithmic efficiency can substitute for raw compute. The training cost advantage was real. The frontier is more reachable than assumed. The moat is narrowing.
Explainer 15 March 2026 3 min read

The Context Window Arms Race — What 1 Million Tokens Actually Means

Gemini reached a 1 million token context window. Claude supports 200,000. Even the smallest models now handle book-length documents. What changed, and does it actually matter for how most people use AI?

2023 context limits
Most models maxed at 4,000–8,000 tokens — a few pages of text. Long documents had to be chunked and passed in pieces. RAG was essential.
2026 context windows
1 million tokens = a 750,000-word novel, or an entire large codebase. Tasks that required complex RAG pipelines now just need one prompt.