Machine Learning
In my last post, I discussed how Andrej Karpathy fell a little short on building intuition around the attention mechanism in his Neural Nets – Zero to Hero series. In that post I shared how I was doing my own diagrams of…
Machine Learning
I’ve been going through Andrej Karpathy’s Neural Nets: Zero to Hero playlist. I’ve spent the last few days on one of the later ones: Let’s build GPT: from scratch, in code, spelled out. As an aside, I should note that…
Machine Learning
What a difference a great teacher makes. Recently, I completed the Coursera Generative AI with Large Language Models course. As I alluded to on LinkedIn, it was a letdown. It didn’t explain the mechanics of how LLMs work. The closest…
Machine Learning
I started the nano-degree over a year ago last year. The company I was working for at the time was in some dire straits and I had a lot going on so I dropped it back then. The company ended…
Machine Learning
Back in this post I alluded to the fact that I hadn’t yet written out the approach I’m going to use to calculate costs not just for a single output as in Andrej Karpathy’s Pong from Pixels but from 48 outputs representing…
Machine Learning
Back when I first read Andrej Karpathy’s revelatory Deep Reinforcement Learning: Pong from Pixels and wrote about it back in this post, I saw that he kept referring to the log probability of any given movement rather than the actual probability….
Machine Learning
Andrej Karpathy’s Pong from Pixels has been my exemplar for implementing reinforcement learning. I find that in general, I’m following along with his approach, and I feel that conceptually, it’s making sense. But as I set out to actually create my…
Machine Learning
So with these classifiers and unambiguous accuracy numbers when compared to ground truth, it’s dead simple to evaluate success. But what about this checkers game? Unlike the pong from pixels example, my game won’t be playing against a functioning, trained…
Machine Learning
It’s been a while since I wrote about my checkers project. For a few weeks there, I was able to give it a lot of consideration because I was doing the first course in the Deep Learning specialization, Deep Neural…
Machine Learning
My personal experience with Neural Networks is that everything became much clearer when I started ignoring full-page, dense derivations of backpropagation equations and just started writing code…(I don’t believe [a lot of math] is necessary and it can sometimes even…