Simon Willison’s Weblog

llm.c (via) Andrej Karpathy implements LLM training—initially for GPT-2, other architectures to follow—in just over 1,000 lines of C on top of CUDA. Includes a tutorial about implementing LayerNorm by porting an implementation from Python.

Posted 9th April 2024 at 3:24 pm

Recent articles

My Lethal Trifecta talk at the Bay Area AI Security Meetup - 9th August 2025
The surprise deprecation of GPT-4o for ChatGPT consumers - 8th August 2025
GPT-5: Key characteristics, pricing and model card - 7th August 2025

c 41 ai 1499 andrej-karpathy 31 generative-ai 1313 llms 1291 gpt-2 12

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe