A Meticulous Guide to Advances in Deep Learning Efficiency over the Years
November 29, 2024
Needless text
November 29, 2024
November 28, 2024
Training Language Models to Self-Correct via Reinforcement Learning – Aviral Kumar, Vincent Zhuang et al at Deepmind
November 27, 2024
I’ve been following the development of the higher order ops in PyTorch nightlies for a little bit, and got a chance to try out while_loop. The best examples right now are in the tests, but as another, here’s a mandlebrot example:
November 26, 2024
November 25, 2024
The error message you get actually tells you the fix, but I found it non-intuitive to what I was doing enough I was hesitant to actually just try the config:
November 20, 2024
I think I did know this at some point, but I was reminded today that unlike the (sometime) black box that is CUDA https://github.com/ROCm/ROCm is actually available on Github, which is operationally much nicer!
November 19, 2024
Whenever I am trying to do something fun (read: poorly considered) with torch.compile I find myself googling for the same handful of references, most of which are in the PyTorch Drive folder, so putting them here for my own reference.
November 15, 2024
Every time I have to set up a clean system I manage to mess up cuda somehow, so leaving this as an aide memoir. In general, there is a default nvidia-cuda-toolkit package that ships with Ubuntu-based systems, and you should ignore that. The right options are either:
November 5, 2024
A very good passing comment from Nikita at work. When folks train in eager mode they are generally able to leverage the full set of aten ops, with their attendant implementations in different backends.
November 2, 2024