TomBolton.io

Tom Bolton’s AI and Machine Learning Lab Notebook.

math

Machine Learning

Having established in my previous post that softmax looks like the way to go for my final activation layer it’s time to think about the cost function. And this one is trickier. Hypothesis: Use Mean Squared Error Cost Function The…

Machine Learning

Back when I first read Andrej Karpathy’s revelatory Deep Reinforcement Learning: Pong from Pixels and wrote about it back in this post, I saw that he kept referring to the log probability of any given movement rather than the actual probability….

Machine Learning

Andrej Karpathy’s Pong from Pixels has been my exemplar for implementing reinforcement learning. I find that in general, I’m following along with his approach, and I feel that conceptually, it’s making sense. But as I set out to actually create my…