Machine Learning
I have updated my model to be a conv net. In addition to the piece vectors, I am now feeding the 4x8x4 board state into parallel 3×3 and 5×5 conv layers with an 8×4 output and 16 channels each for…
Machine Learning
With the last update, my objective assessment of the model’s progress was that it could not beat a beginner checkers player (me), and that I had a lot of work to do. The model is comically simple. It is a…
Machine Learning
After seeing unexpected behavior from the model in a bootstrap setting, I had decided that it was important to do some evaluation of the performance of the different versions of the model against one another. So I set up an…
Machine Learning
I’ve completed changing my code to support a model designed to win games rather than guess legal moves. The model is the same, but the reward function, some game details, and much of the administrative framework has changed. The actual…
Machine Learning
I started actually doing AI again. As I was watching all the Karpathy videos, I found myself wanting to stop watching videos and get back to building. I could have gone out and built Karpathy’s nano-gpt model, but he already…