Can advanced techniques be combined?

Yes — e.g. distill, then preference-tune, then quantize, then serve

What single discipline underlies every technique in the Academy?

Measuring against a baseline on a held-out gold set and keeping only what clears the gate

Track 4 · Advanced · Lesson 10

Capstone C: choosing the right technique, and graduating

After this capstone you can take any model goal and choose the right technique (or combination) from everything the Academy taught — and you're ready to claim your completion certificate.

Level: advanced Read time: ~10 min Prerequisites: Observability and drift in production

You've now seen the whole toolkit, from what a model is to running one in production. The last skill is the meta-skill: given a goal, knowing which tool to reach for. This lesson is that decision guide, and your graduation.

The decision guide: goal → technique

Start from the constraint, not the technique. Map your goal to the lesson that solves it:

GOAL                                          REACH FOR
-------------------------------------------   --------------------------------
Make a model do a specific task/format        SFT            (Tracks 1-2)
"Better"/tone/helpfulness, not one answer     DPO / ORPO     (4.5)
Match a big model's quality, but small         Distillation   (4.2-4.4)
Fit a device / cut serving cost                Quantization   (4.6, 4.8)
One model, several related tasks               Multi-task     (4.7)
The model needs facts it should look up        RAG / reroute  (Track 3.9)
Faster / cheaper endpoint                      Serving (vLLM) (4.8)
Catch quality decay in production              Drift checks   (4.9)
Don't know if a tweak helps                    A/B on a gold set (everywhere)

They compose — a worked example

Goal: a fast, on-device assistant that matches a frontier model's tone on your domain. No single technique gets there; you stack them:

Distill the frontier teacher into a small student (4.2–4.4) — size down at high quality retained.
Preference-tune the student with ORPO (4.5) — align the tone.
Quantize to GGUF Q4_K_M (4.6) — fit the device.
Serve efficiently and watch for drift (4.8–4.9) — keep it cheap and honest.
At every step, evaluate against a gold set and only keep changes that clear the gate.

The one rule under all of them

Every technique in this Academy is judged the same way: measure against a baseline on a held-out gold set, and keep only what clears the gate. That discipline — not any single method — is what separates a model that works from one that merely ran.

What you can do now

You started not knowing what a model was. You can now: explain next-token prediction and the Transformer; build, mask, and tokenize an SFT dataset; write a LoRA training loop by hand and read its curves; drive the entire BrewSLM lifecycle with full understanding of each stage; and choose among distillation, preference tuning, quantization, multi-task, RAG, and serving for a real constraint. That's the full arc from zero to shipping.

Graduate

That completes the BrewSLM Academy. Mark this lesson done, and if you've finished every track, head to your completion certificate to claim it. From here, the best teacher is a real project — pick a task you care about and ship it.

Key idea

Advanced ML is technique selection, then composition: start from the constraint, pick the tool, stack tools when one isn't enough, and judge every step against a gold set. You have the whole toolkit now — go build something and claim your certificate.

Key terms

technique selection: Choosing the method that matches a goal/constraint rather than defaulting to one.
composing techniques: Stacking methods (distill → preference-tune → quantize → serve) when one isn't enough.
decision guide: A goal→technique mapping across the Academy's methods.
the gold-set rule: Judge every change against a baseline on a held-out gold set; keep only what clears the gate.

Check yourself

Answers are saved to this browser.

Progress is stored locally in your browser.