Hidden Ingredients of AI Creativity
- Jul 2
- 2 min read
Updated: Jul 4

I've often discussed the phenomenon we call "hallucination" the surreal, seemingly nonsensical creativity displayed by AI, like multi-fingered hands generated by DALL·E. My intuition always suggested this wasn't merely a quirky glitch. And I'm glad I stuck to my instinct because Webb Wright’s recent Quanta piece confirmed it: these “mistakes” aren’t bugs, they’re features baked into the very math of diffusion models. Let me walk you through what this means and why it genuinely fascinates me.
1. Not bugs, but blurred brilliance
Diffusion AIs work by shredding images into noise and then stitching them back together. For years, we treated their odd outputs (extra digits, stretched limbs) as glitches. But research by Stanford’s Mason Kamb and Surya Ganguli shows that those quirks arise from two essential constraints: locality (working patch‑by‑patch) and translational equivariance (treating shifted patches consistently). Out of these constraints, creativity emerges mathematically! They even built a theoretical model "the ELS machine" that uses only these two properties. Astonishingly, it recreates diffusion‑model outputs with ~90% accuracy. What we thought was whimsical randomness is actually deterministic artistry woven by the model’s own architecture.
2. A mirror to human creativity
This insight resonated deeply with me. It’s as if the model’s constraints, its “rules” plus imperfect vision, force it into improvisation. In life, our own creative spark often emerges from constraints too: resource limits, tight deadlines, curious constraints of medium. These very limits force unexpected combinations and imaginative leaps. Could it be that human creativity, in part, is a non‑perfect solution to our own local and equivariant constraints? As Ben Hoover suggests, “Human and AI creativity may not be so different… we assemble things based on what we experience… AI is also just assembling… what it’s seen." This isn’t romanticizing AI. It’s a shared rhythm—architecture shaping imagination. And that fascinates me.
3. What about language?
Kamb and Ganguli focus on image diffusion models. But what about Large Lanaguage Models (LLMs) chatbots, story‑writers, code‑generators? They don’t rely on the same locality/equivariance mechanisms but they too display creativity. Maybe their “creativity” stems from attention patterns, tokenization quirks, or training dynamics. Exploring that frontier feels like staring into a room full of mirrors, in one direction you see diffusion models, another reveals LLMs, and all around is the possibility that creativity is structure inviting novelty.
For creators, artists, thinkers this matters because it suggests novelty is inevitable. We can shape and tune constraints to amplify the creative leap. What if we designed AI with intentional “quirks”? What if we trained human teams with structures that amplify serendipity? For the curious, it reconnects us to the humility that creativity (human or silicon) comes from our limits, our broken windows, our imperfections.
If AI's creativity and wildly imaginative outputs aren't magic but are baked into the very math of diffusion models then perhaps our creativity, too, is a shape-crafting dance with constraints. And if creativity is inevitable, maybe so is wonder.



Comments