Kudos to @karpathy (OpenAI/Tesla/Stanford) for his incredible video showing the details on how to build GPT from scratch, in code: I found it useful to actually watch him tokenize a bunch of data, turn it into a tensor, use PyTorch, try various models,…