Andrej Karpathy – TomBolton.io

Building Intuition

Machine Learning

In my last post, I discussed how Andrej Karpathy fell a little short on building intuition around the attention mechanism in his Neural Nets – Zero to Hero series. In that post I shared how I was doing my own diagrams of…

Tom Bolton

October 21, 2024

Mastering Attention

Machine Learning

I’ve been going through Andrej Karpathy’s Neural Nets: Zero to Hero playlist. I’ve spent the last few days on one of the later ones: Let’s build GPT: from scratch, in code, spelled out. As an aside, I should note that…

Tom Bolton

October 18, 2024

Andrej Karpathy to the Rescue (of Course)

Machine Learning

What a difference a great teacher makes. Recently, I completed the Coursera Generative AI with Large Language Models course. As I alluded to on LinkedIn, it was a letdown. It didn’t explain the mechanics of how LLMs work. The closest…

Tom Bolton

September 23, 2024

Checking in on Udacity’s Reinforcement Learning Nano-degree

Machine Learning

I started the nano-degree over a year ago last year. The company I was working for at the time was in some dire straits and I had a lot going on so I dropped it back then. The company ended…

Tom Bolton

January 16, 2021

Calculating Cost (and Other Important Details) Part 1 – Softmax Hypothesis

Machine Learning

Back in this post I alluded to the fact that I hadn’t yet written out the approach I’m going to use to calculate costs not just for a single output as in Andrej Karpathy’s Pong from Pixels but from 48 outputs representing…

Tom Bolton

August 9, 2018

Why Log Probabilities? Now I Know.

Machine Learning

Back when I first read Andrej Karpathy’s revelatory Deep Reinforcement Learning: Pong from Pixels and wrote about it back in this post, I saw that he kept referring to the log probability of any given movement rather than the actual probability….

Tom Bolton

July 19, 2018

Confirming my Intuition

Machine Learning

Andrej Karpathy’s Pong from Pixels has been my exemplar for implementing reinforcement learning. I find that in general, I’m following along with his approach, and I feel that conceptually, it’s making sense. But as I set out to actually create my…

Tom Bolton

June 12, 2018

Measuring Success, Part 1

Machine Learning

So with these classifiers and unambiguous accuracy numbers when compared to ground truth, it’s dead simple to evaluate success. But what about this checkers game? Unlike the pong from pixels example, my game won’t be playing against a functioning, trained…

Tom Bolton

June 11, 2018

Optimization, Metrics and Graphs (oh my)

Machine Learning

It’s been a while since I wrote about my checkers project. For a few weeks there, I was able to give it a lot of consideration because I was doing the first course in the Deep Learning specialization, Deep Neural…

Tom Bolton

June 8, 2018

A Revelation

Machine Learning

My personal experience with Neural Networks is that everything became much clearer when I started ignoring full-page, dense derivations of backpropagation equations and just started writing code…(I don’t believe [a lot of math] is necessary and it can sometimes even…

Tom Bolton

May 30, 2018