Machine Learning

Vector Checkers is Complete

A couple of days ago, I finished my implementation of a completely vectorized checkers gameplay engine. With the current implementation, the model does all game operations for both sides on an entire batch of games at once. The most computationally…

Tom Bolton

March 16, 2025

Updates

Machine Learning

My vectorization of checkers is moving along. Before I talk about that, though, one thing came up as part of that process that is worth mentioning. I’m building the whole thing in a Jupyter Notebook step by step so I…

Tom Bolton

February 8, 2025

I’m Vectorizing the Shit Out of Checkers

Machine Learning

It’s been a while, but since I last wrote, I managed to implement the AlphaGo Zero gameplay/learning algorithm. I updated my code to do most of the important things that AlphaGo Zero does. My model now outputs policy probabilities and…

Tom Bolton

January 19, 2025

Endgame

Machine Learning

I have been considering where this effort has brought me and where it might go. When I was selecting a project to work on, the first option I considered, just because it was absurdly simple, was Tic Tac Toe. After…

Tom Bolton

December 14, 2024

The Final Optimization

Machine Learning

I did my final optimization of my setup using Policy Gradient Loss with Reward. Rather than reward all moves of games equally, I implemented discounting whereby the move that generated the win is given the full 1 reward, and the…

Tom Bolton

December 11, 2024

Lots of Action

Machine Learning

I really can’t believe how much I’ve managed to do since the last post. I was still talking about dropout and projection layers and forgetting. My goodness has a lot happened. The Old Convolutional Model My previous model input the…

Tom Bolton

December 8, 2024

Dropout

Machine Learning

Dropout helped the model a lot, but not for helping with overfitting. Instead, adding dropout layers in the model radically improved the speed at which it was able to increase its win rate. Here is pre-dropout performance for the first…

Tom Bolton

December 1, 2024

A Lesson in Over-Fitting

Machine Learning

I have updated my model to be a conv net. In addition to the piece vectors, I am now feeding the 4x8x4 board state into parallel 3×3 and 5×5 conv layers with an 8×4 output and 16 channels each for…

Tom Bolton

November 29, 2024

The Five-year Memory Hole

Machine Learning

With the last update, my objective assessment of the model’s progress was that it could not beat a beginner checkers player (me), and that I had a lot of work to do. The model is comically simple. It is a…

Tom Bolton

November 26, 2024

A Checkers UI to Develop Intuition

Machine Learning

After seeing unexpected behavior from the model in a bootstrap setting, I had decided that it was important to do some evaluation of the performance of the different versions of the model against one another. So I set up an…

Tom Bolton

November 24, 2024