Machine Learning
In my last post, I discussed how Andrej Karpathy fell a little short on building intuition around the attention mechanism in his Neural Nets – Zero to Hero series. In that post I shared how I was doing my own diagrams of…
Machine Learning
I’ve been going through Andrej Karpathy’s Neural Nets: Zero to Hero playlist. I’ve spent the last few days on one of the later ones: Let’s build GPT: from scratch, in code, spelled out. As an aside, I should note that…