🧠

Frontier LLM V3

12 Parts · 41 Chapters

🗺️Territory Map▾

Ch 0 — The Map of the Territory

🎯⚡

∑Mathematical Foundations▾

Ch 1 — Vectors

🎯⚡

Ch 2 — Matrices

Ch 3 — Tensors

Ch 4 — Dot Product

🎯⚡

Ch 5 — Information Theory

Ch 6 — Cross Entropy Loss

🎯

Ch 7 — Optimization

🧠Neural Networks▾

Ch 8 — The Artificial Neuron

Ch 9 — Activation Functions

Ch 10 — Residual Connections

⚡Transformer Architecture▾

Ch 11 — Tokenization

Ch 12 — Embeddings

Ch 13 — Attention

🎯⚡

Ch 14 — The Transformer Block

Ch 15 — Decoder-Only Architecture

📈How Models Learn▸

🎯Alignment Systems▸

⚙️Inference Systems▸

📊Evaluation Systems▸

🖥️Distributed Training▸

🔀Mixture of Experts▸

💭Reasoning Models▸

🤖Agents, RAG & MCP▸

🚀Building Frontier AI▸

Chapters read0/36

Version 3.0 · 2026

The Frontier LLM Curriculum

12 Parts · 41 Chapters · From Fundamentals to Frontier Systems

Chapters

Parts

Quizzes

Terms

📍 Choose a Learning Path

Novice Path

Ch 0 → Read straight through the stack

Engineer Track

Focus on Transformer + Production infrastructure

Researcher Route

Deep dive into Training, MoE, Reasoning

Executive View

Strategy, Competitive Moats, Frontier Thesis

🗺️Part 0 — Territory Map

⚡🎯

Ch 0

The Map of the Territory

The complete vertical stack — from raw data to AI-operated organizations

∑Part 1 — Mathematical Foundations

⚡🎯

Ch 1

Vectors

Words become points. Meaning becomes geometry.

Ch 2

Matrices

All of AI is matrix multiplication

Ch 3

Tensors

Multi-dimensional arrays that flow through every neural network

⚡🎯

Ch 4

Dot Product

The most important operation in AI — measures similarity between vectors

Ch 5

Information Theory

The mathematics of surprise, uncertainty, and prediction

🎯

Ch 6

Cross Entropy Loss

The heartbeat of training — measures how wrong the model is

Ch 7

Optimization

Finding the lowest valley in a billion-dimensional landscape

🧠Part 2 — Neural Networks

Ch 8

The Artificial Neuron

A mathematical simplification of biological neurons

Ch 9

Activation Functions

Without them, the entire network collapses to a single linear operation

Ch 10

Residual Connections

Skip connections that enabled 100+ layer networks

⚡Part 3 — Transformer Architecture

Ch 11

Tokenization

Converting text into numbers — the first step of every LLM

Ch 12

Embeddings

Tokens become vectors; vectors become meaning

⚡🎯

Ch 13

Attention

The invention that changed AI — every token can see every other token simultaneously

Ch 14

The Transformer Block

The repeating unit — stacked 32 to 120+ times in frontier models

Ch 15

Decoder-Only Architecture

The dominant architecture of all modern LLMs

📈Part 4 — How Models Learn

⚡🎯

Ch 16

Backpropagation

The engine of learning — how errors flow backwards through the network

🎯Part 5 — Alignment Systems

🎯

Ch 17

RLHF

Reinforcement Learning from Human Feedback — what made ChatGPT useful

Ch 18

Constitutional AI

Anthropic's approach: principles over pure human labeling

Ch 19

DPO

Direct Preference Optimization — simpler and cheaper than RLHF

Ch 20

RLAIF

AI judging AI — scales where human feedback cannot

⚙️Part 6 — Inference Systems

Ch 21

What Is Inference?

Training creates the model. Inference makes the money.

⚡🎯

Ch 22

KV Cache

The single most important inference optimization — 10–100× speedup

Ch 23

FlashAttention

2–4× faster attention by solving the memory bandwidth bottleneck

Ch 24

vLLM & Continuous Batching

Modern serving: keep GPUs at 100% utilization

📊Part 7 — Evaluation Systems

Ch 27

Evaluation Systems

Without rigorous evaluation, you cannot know if you are improving

🖥️Part 8 — Distributed Training

Ch 31

Distributed Training

No single GPU can train GPT — you need thousands working in concert

🔀Part 9 — Mixture of Experts

⚡🎯

Ch 33

Mixture of Experts (MoE)

Why DeepSeek can compete with 10× less compute

💭Part 10 — Reasoning Models

🎯

Ch 34

Reasoning Models

Trading latency for accuracy — think before you answer

🤖Part 11 — Agents, RAG & MCP

Ch 35

RAG — Retrieval-Augmented Generation

Giving models access to knowledge beyond their training cutoff

Ch 36

MCP — Model Context Protocol

The USB-C for AI — standardizing how models connect to the world

Ch 37

Agent Systems

From answering questions to taking autonomous action

🚀Part 12 — Building Frontier AI

Ch 38

The Competitive Landscape

Who the players are and what advantages they hold in 2026

Ch 39

The Minimum Frontier Stack

What you actually need to compete at the frontier

Ch 40

Real Competitive Moats

Not transformers anymore — everyone has transformers

Ch 41

The Frontier Thesis

Four stages of AI evolution — we are in Stage 2→3 transition