Engineering blog

Engineering notes on small language model fine-tuning infrastructure.

These posts support the product and the Academy: dataset ingestion, task-aware evaluation, and design decisions behind local, auditable small language model workflows. No content mill, just the implementation details that matter.

2026 · Engineering · Dataset pipeline

Why the schema introspector beats hand-written converters

We shipped a 175-line BIO-to-spans converter for the Kaggle PII dataset. Then we threw it away and built a column sniffer. Here's why the sniffer wins.

Read

2026 · Engineering · Evaluation

Inside the task-aware eval handler dispatcher

One generic "pass rate" metric flatters classification and lies about NER. The story of a real eval bug, how the dispatcher fixes it, and what each task handler does that the others can't.

Read

2026 · Product · Design

Gamifying a dev tool without making it a toy

Most gamified developer tools feel infantilizing. We tried it anyway. The Lab Journal progression layer, the design rules we kept, and what we explicitly refused to ship.

Read

Want to write one?

Pitch a post via GitHub Issues

If you've used BrewSLM in production and found something interesting — a workflow insight, a metric you wish existed, a bug story worth retelling — open an issue on the repo with the title and a paragraph. We'll either point you at the right place to publish or run it here with attribution.

$ open https://github.com/TensorGreed/__SLM__/issues/new