TomBolton.io

Tom Bolton’s AI and Machine Learning Lab Notebook.

Machine Learning

A couple of days ago, I got all the machinery of my network in place and did my first test. I decided that for starters, using a network I wasn’t even sure worked for a problem I’ve never tackled before…

Machine Learning

Having established in my previous post that softmax looks like the way to go for my final activation layer it’s time to think about the cost function. And this one is trickier. Hypothesis: Use Mean Squared Error Cost Function The…

Machine Learning

Back in this post I alluded to the fact that I hadn’t yet written out the approach I’m going to use to calculate costs not just for a single output as in Andrej Karpathy’s Pong from Pixels but from 48 outputs representing…

Machine Learning

This morning, I completed the “plumbing” for checkers. Specifically, with all the game mechanics in place, I now have a framework that will allow two “players” (a red player and a black player) to play against each other ad infinitum…

Machine Learning

There’s a gap  in the courses I can take right now, so I have about two weeks to push forward with checkers and I started again on that yesterday. As I was starting to put together some of the classes…

Machine Learning

This is one of those posts that is mostly intuition on the main points, and since I already find myself laughing at some of my early intuitions concerning AI after my first AI course please take this with a grain…

Machine Learning

Back when I first read Andrej Karpathy’s revelatory Deep Reinforcement Learning: Pong from Pixels and wrote about it back in this post, I saw that he kept referring to the log probability of any given movement rather than the actual probability….

Machine Learning

It’s been a while since I posted anything here. In addition to doubling up on my AI course load, I’ve been doing some work on the side for a colleague who’s VP of Development for an AI startup that does…

Machine LearningVideo Games

Last night, after working some more on my checkers algorithm, before going to bed I decided to unwind by playing a little Destiny 2. With checkers and Destiny 2 on my brain, I had a dream in which I was…

Machine Learning

Andrej Karpathy’s Pong from Pixels has been my exemplar for implementing reinforcement learning. I find that in general, I’m following along with his approach, and I feel that conceptually, it’s making sense. But as I set out to actually create my…