Machine Learning
I have been considering where this effort has brought me and where it might go. When I was selecting a project to work on, the first option I considered, just because it was absurdly simple, was Tic Tac Toe. After…
Machine Learning
I did my final optimization of my setup using Policy Gradient Loss with Reward. Rather than reward all moves of games equally, I implemented discounting whereby the move that generated the win is given the full 1 reward, and the…
Machine Learning
I really can’t believe how much I’ve managed to do since the last post. I was still talking about dropout and projection layers and forgetting. My goodness has a lot happened. The Old Convolutional Model My previous model input the…
Machine Learning
Dropout helped the model a lot, but not for helping with overfitting. Instead, adding dropout layers in the model radically improved the speed at which it was able to increase its win rate. Here is pre-dropout performance for the first…