Why the schema introspector beats hand-written converters
We shipped a 175-line BIO-to-spans converter for the Kaggle PII dataset. Then we threw it away and built a column sniffer. Here's why the sniffer wins.
ReadEngineering blog
These posts support the product and the Academy: dataset ingestion, task-aware evaluation, and design decisions behind local, auditable small language model workflows. No content mill, just the implementation details that matter.
We shipped a 175-line BIO-to-spans converter for the Kaggle PII dataset. Then we threw it away and built a column sniffer. Here's why the sniffer wins.
ReadOne generic "pass rate" metric flatters classification and lies about NER. The story of a real eval bug, how the dispatcher fixes it, and what each task handler does that the others can't.
ReadMost gamified developer tools feel infantilizing. We tried it anyway. The Lab Journal progression layer, the design rules we kept, and what we explicitly refused to ship.
ReadWant to write one?
If you've used BrewSLM in production and found something interesting — a workflow insight, a metric you wish existed, a bug story worth retelling — open an issue on the repo with the title and a paragraph. We'll either point you at the right place to publish or run it here with attribution.
$ open https://github.com/TensorGreed/__SLM__/issues/new