Qwen-Image

September 29, 2025

GPT4o’s image generation was a remarkable event, beyond the brief Ghiblification of all social media.GPT-4o offered significantly more steerability than earlier image generation models,, while offering image quality in the ball park of the best diffusion models. Qwen-Image gives a similar level of fidelity and accuracy and is an open-weights model with a pretty decent technical report: QwenLM/Qwen-Image.

Automation & Managerial Control

September 3, 2025

There’s a chart making the rounds that caused Tim Lee over at Understanding AI to rewrite his recent (excellent!) article aboutthe impact of AI on jobs. MIT’s Erik Brynjolfsson and colleagues found¹ that young workers in AI-exposed jobs² have seen their employment drop by 13% since ChatGPT arrived. Meanwhile, their older colleagues in the same fields are doing just fine.

Canaries in the Coal Mine? Six Facts about the Recent Employment Effects of Artificial Intelligence — Stanford Digital Economy Lab ↩
Like programming and accountancy, knowledge work fields that have a large amount of machine interaction ↩

A Primer on Post-Training

September 2, 2025

A Primer on LLM Post-Training – PyTorch

Layouts

August 26, 2025

You could have invented CuTe hierarchical layout (but maybe not the rest of it?) : ezyang’s blog

The TPU book, on GPUs

August 19, 2025

[How to Think About GPUs

How To Scale Your Model](https://jax-ml.github.io/scaling-book/gpus/)

Extending Arcee’s FM context length

August 13, 2025

Extending AFM-4.5B to 64k Context Length

Rubrics

August 12, 2025

Pre-training is about making AI correct, post-training is about making AI helpful¹. That helpfulness is (primarily) shaped by reinforcement learning. RL for LLMs really took off with RLHF (RL from Human Feedback), which trained based on the score from a reward model.

Correct in predicting the next token, and helpful, honest and harmless, specifically. ↩

Constraints & Orchestrators

August 6, 2025

I recently read a few posts that helped connect the dots on why Python is a) so successful as the lingua franca of ML b) also seems likely to be successful in the future¹.

Beyond just sheer momentum, of course. ↩

Overthinking Everything

August 4, 2025

The Tools Are Made Up

July 30, 2025

It has been hard to keep up with the flurry of strong agentic open-source models coming out of Chinese labs recently, including Moonshot’s Kimi K2, Z.ai’s GLM 4.5, and Qwen3-Coder¹.

Which seems a very solid model, but they haven’t released a lot of extra details about how they got there. One interesting component of the release though was that they forked Gemini CLI to make a qwen-code tool that works with any OpenAI compatible API, and I had some success locally plugging it into the smaller Qwen3 (non-coder) releases in case you were looking for some offline agentic capabilities! ↩