Machine Learning
Back in this post I alluded to the fact that I hadn’t yet written out the approach I’m going to use to calculate costs not just for a single output as in Andrej Karpathy’s Pong from Pixels but from 48 outputs representing…
Machine Learning
This morning, I completed the “plumbing” for checkers. Specifically, with all the game mechanics in place, I now have a framework that will allow two “players” (a red player and a black player) to play against each other ad infinitum…
Machine Learning
There’s a gap in the courses I can take right now, so I have about two weeks to push forward with checkers and I started again on that yesterday. As I was starting to put together some of the classes…
Machine Learning
This is one of those posts that is mostly intuition on the main points, and since I already find myself laughing at some of my early intuitions concerning AI after my first AI course please take this with a grain…
Machine Learning
Back when I first read Andrej Karpathy’s revelatory Deep Reinforcement Learning: Pong from Pixels and wrote about it back in this post, I saw that he kept referring to the log probability of any given movement rather than the actual probability….
Machine Learning
It’s been a while since I posted anything here. In addition to doubling up on my AI course load, I’ve been doing some work on the side for a colleague who’s VP of Development for an AI startup that does…
Machine LearningVideo Games
Last night, after working some more on my checkers algorithm, before going to bed I decided to unwind by playing a little Destiny 2. With checkers and Destiny 2 on my brain, I had a dream in which I was…
Machine Learning
Andrej Karpathy’s Pong from Pixels has been my exemplar for implementing reinforcement learning. I find that in general, I’m following along with his approach, and I feel that conceptually, it’s making sense. But as I set out to actually create my…
Machine Learning
So with these classifiers and unambiguous accuracy numbers when compared to ground truth, it’s dead simple to evaluate success. But what about this checkers game? Unlike the pong from pixels example, my game won’t be playing against a functioning, trained…
Machine Learning
It’s been a while since I wrote about my checkers project. For a few weeks there, I was able to give it a lot of consideration because I was doing the first course in the Deep Learning specialization, Deep Neural…