Machine Learning
I’ve been going through Andrej Karpathy’s Neural Nets: Zero to Hero playlist. I’ve spent the last few days on one of the later ones: Let’s build GPT: from scratch, in code, spelled out. As an aside, I should note that…
Machine Learning
I’ve been making slow progress on refactoring my code to play checkers in parallel. In fact, it’s not merely to play in parallel, but in fact, more like simultaneously. Parallel implies that games going on simultaneously have nothing to do…
Machine Learning
Note: This post is the ultimate result of my quest for how to do softmax backpropagation in my hand-coded model for this project. The actual math for softmax back propagation is not something that was specifically covered in my coursework….
Machine Learning
Back in this post I alluded to the fact that I hadn’t yet written out the approach I’m going to use to calculate costs not just for a single output as in Andrej Karpathy’s Pong from Pixels but from 48 outputs representing…
Machine Learning
It’s been a while since I posted anything here. In addition to doubling up on my AI course load, I’ve been doing some work on the side for a colleague who’s VP of Development for an AI startup that does…
Machine Learning
Andrej Karpathy’s Pong from Pixels has been my exemplar for implementing reinforcement learning. I find that in general, I’m following along with his approach, and I feel that conceptually, it’s making sense. But as I set out to actually create my…
Machine Learning
Yesterday, I put some of the plumbing in place to eventually run a checkers game for one or two AIs. Specifically, the board is represented, the fundamentals of pieces are represented, and the system has a way, for any given…
Machine Learning
The one thing that was disappointing in the results of the Python Port of my NN was the time it took to learn the training set. I wasn’t sure what the reason was, and had speculated that it might have…
Machine Learning
As I mentioned, my checkers-playing AI is on hold until I have a few more viable tools available to do what’s necessary. However, since no matter what I do, I can’t stop thinking about the problem, it occurs to me…
Machine Learning
Before moving on to my next project, I thought a good segue would be to discuss the actual results of porting my NN to Python and some open issues that I have yet to resolve. In general, the port seems…