Steven's Knowledge

Machine Learning Basics

Core ML vocabulary every engineer should share before talking about modern AI

Modern AI rides on top of decades of classical machine learning. Even when the day-to-day work is just calling a foundation model, the failure modes — overfitting, distribution shift, label noise, bad evaluation — are inherited from classical ML. This page is the shared vocabulary I want any engineer joining the AI conversation to have.

Supervised, Unsupervised, and Reinforcement Learning

The three classic paradigms still cleanly describe most modern systems:

  • Supervised — learn a mapping from inputs to known labels. Image classification, spam detection, most fine-tuning.
  • Unsupervised — find structure without labels. Clustering, dimensionality reduction, much of pre-training.
  • Reinforcement learning — learn a policy by interacting with an environment and receiving rewards. RLHF for LLMs, game-playing agents, robotics.

The Bias–Variance Tradeoff

Every model balances:

  • Bias — the error from the model being too simple to capture the underlying signal.
  • Variance — the error from the model being so flexible it captures noise.

Underfitting is high bias; overfitting is high variance. Regularization, more data, and simpler model classes pull toward bias; bigger models and more features pull toward variance.

Train / Validation / Test Discipline

The single most common ML mistake is contaminating evaluation:

  • Train — what the model learns from.
  • Validation — what you use to tune hyperparameters and pick model variants.
  • Test — touched once at the end to estimate real-world performance.

If you tune on the test set, you no longer have an honest estimate. The same logic applies to LLM eval sets.

Common Failure Modes

  • Distribution shift — production data drifts away from training data.
  • Label noise — training labels themselves are wrong.
  • Leakage — a feature secretly contains the answer.
  • Spurious correlations — the model learns the wrong shortcut.

Why This Still Matters

Even when "the model" is an API call to a frontier LLM, you are still building a machine learning system: choosing data, defining evaluation, watching for distribution shift. The classical lens is what keeps modern AI work honest.

On this page