Beyond the Parrot - Girish Gupta

Modern AI: anatomy, agency, and a world beyond the stochastic parrot.

June 18, 2026
From Latin Digits to Babylonian Cuneiform: Number Helices Across Scripts
Language models lay each number out as a point on a helix. Here I rebuild that helix across eight models and ten writing systems and bases, from Arabic to Babylonian cuneiform, to see how the geometry follows value, glyphs, base, and place value.
Read post →
June 16, 2026
Smuggling a Globe Into a Classifier
A recent BlueDot AI-safety puzzle hid one feature non-linearly inside a tiny classifier. Finding it was a lesson in geometry. For the open-ended task I turned the lesson around and sculpted a feature's geometry on purpose, training a model whose hidden layer is a globe that makes no difference to a single one of its outputs.
Read post →
February 24, 2026
Back in Cambridge with ERA
No one steps into the same River Cam twice, for the river is not the same and nor is the person. I was last at King's College, Cambridge twenty years ago, beginning my studies in physics.
Read post →
January 13, 2026
Introducing "Beyond the Parrot"
In 2021, Emily Bender and her collaborators coined the phrase "stochastic parrots" to describe Large Language Models (LLMs).
Read post →
January 13, 2026
What I Learned Building and Training an LLM from Scratch
In this post, I’ll share what surprised me most about building an LLM from scratch—where structure finally became visible. I wanted to write an autoregressive, transformer-based, decoder-only Large Language Model—like GPT, LLaMA, etc.—without too many abstractions and hand-holding.
Read post →
March 12, 2025
An Introduction to AI for Investigation: Theory and Practice
The following is an edited, Claude-generated summary of a Whisper-generated transcript of a guest lecture I gave at Berkeley's Human Rights Center on March 4, 2025.
Read post →