Girish Gupta
Beyond the Parrot

Modern AI: anatomy, agency, and a world beyond the stochastic parrot.

  • June 18, 2026

    From Latin Digits to Babylonian Cuneiform: Number Helices Across Scripts

    Language models lay each number out as a point on a helix. Here I rebuild that helix across eight models and ten writing systems and bases, from Arabic to Babylonian cuneiform, to see how the geometry follows value, glyphs, base, and place value.

    Read post →
  • June 16, 2026

    Smuggling a Globe Into a Classifier

    A recent BlueDot AI-safety puzzle hid one feature non-linearly inside a tiny classifier. Finding it was a lesson in geometry. For the open-ended task I turned the lesson around and sculpted a feature's geometry on purpose, training a model whose hidden layer is a globe that makes no difference to a single one of its outputs.

    Read post →
  • February 24, 2026

    Back in Cambridge with ERA

    No one steps into the same River Cam twice, for the river is not the same and nor is the person. I was last at King's College, Cambridge twenty years ago, beginning my studies in physics.

    Read post →
  • January 13, 2026

    Introducing "Beyond the Parrot"

    In 2021, Emily Bender and her collaborators coined the phrase "stochastic parrots" to describe Large Language Models (LLMs).

    Read post →
  • January 13, 2026

    What I Learned Building and Training an LLM from Scratch

    In this post, I’ll share what surprised me most about building an LLM from scratch—where structure finally became visible. I wanted to write an autoregressive, transformer-based, decoder-only Large Language Model—like GPT, LLaMA, etc.—without too many abstractions and hand-holding.

    Read post →
  • March 12, 2025

    An Introduction to AI for Investigation: Theory and Practice

    The following is an edited, Claude-generated summary of a Whisper-generated transcript of a guest lecture I gave at Berkeley's Human Rights Center on March 4, 2025.

    Read post →