Jeremy Howard On AI Coding: Why LLMs "Cosplay" Understanding

Cover

📺 Today’s recommended deep-dive video: https://www.youtube.com/watch?v=dHBEQ-Ryo24

The AI Slot Machine: Jeremy Howard on “Vibe Coding” and Understanding Debt

AI models are “cosplaying” intelligence, providing a dangerous illusion of control while eroding the very skills that make engineers valuable. Deep learning pioneer Jeremy Howard argues that the path to real productivity lies not in outsourcing thought to LLMs, but in building interactive, high-feedback environments where humans and machines challenge one another.

Core Question: How can we leverage generative AI without falling into “understanding debt” or sacrificing our long-term technical competence?

Highlights

AI models excel at “combinatorial creativity” but often fail at “transformative insight” when pushed outside their training data.
The “Slot Machine” effect of AI coding creates a dopamine loop of luck rather than intentional, reasoned engineering.
Modern AI was built on the back of transfer learning (ULMFiT), proving that next-word prediction implicitly forces a model to build structural abstractions.
The real existential threat of AI isn’t autonomous destruction, but the centralization of power and the “enfeeblement” of human expertise.

⏱️ Reading time: approx. 8 minutes · Saves you about 79 minutes vs. watching.

Want to take notes while watching? Click the image below and let AI Notebook capture the key points for you 👇

The Illusion of Understanding and the ULMFiT Revolution

From Wikipedia to Movie Reviews

Jeremy Howard reflects on the skepticism surrounding fine-tuning in 2017, noting that most researchers fundamentally missed the value of transfer learning. At the time, NLP was dominated by specialized models that required massive amounts of labeled data to perform even basic sentiment analysis.

Howard’s breakthrough with ULMFiT demonstrated that high-capacity models require heavy regularization rather than smaller architectures to remain flexible yet controlled.

He describes a three-stage process—pre-training, mid-training, and post-training—where he used a gaming GPU to train a model to predict the next word on Wikipedia. This experiment proved that for a machine to predict the next word effectively, it must implicitly build a hierarchy of structural abstractions about how the world functions, from basic grammar to the concept of political leadership. By fine-tuning this “world model” on a tiny set of movie reviews, he beat specialized models that teams had spent entire PhDs perfecting.

A functional architecture diagram showing the three stages of ULMFiT: 1. General Domain Pre-training (Wikipedia corpus), 2. Target Task LM Fine-tuning (domain-specific data), 3. Target Task Classifier Fine-tuning (labeled data with discriminative learning rates).

💡 Digging Deeper

Q: Why use different learning rates for different layers?
A: Different layers capture different levels of information; earlier layers are more general, while later layers are more task-specific. Discriminative learning rates prevent “blowing out” the general representations of the early layers during fine-tuning.

Q: Is “dead neurons” a permanent problem in fine-tuning?
A: No, it’s a lack of observation; if you actually look at your activations and gradients, you can almost always fix neurons that have drifted toward zero or infinity.

Q: How does regularization allow for larger models?
A: Instead of shrinking the model, which limits potential, you add constraints like dropout or weight decay so the model stays flexible but avoids overfitting the noise.

“Vibe Coding” vs. Real Engineering

The Danger of the Slot Machine

Jeremy Howard critiques “vibe coding” as a form of high-stakes gambling where developers pull the lever of a prompt and hope for a win. This stochastic process creates a “loss disguised as a win” scenario, where the occasional successful code generation masks the reality that the developer no longer understands the underlying system.

Most developers are currently trading their long-term growth for the temporary dopamine hit of a successful, yet unvetted, AI-generated pull request.

Howard observes that while coding—the act of typing syntax—is effectively a “style transfer” problem for AI, software engineering remains a distinct discipline focused on design and abstraction. He argues that LLMs are “cosplaying” intelligence within the distribution of their training data, meaning they often fail disastrously when asked to solve genuinely novel architectural problems. This creates a terrifying “understanding debt” where organizations ship codebases that no human can explain, maintain, or debug when the protocol eventually changes.

A comparison table comparing "Coding" (Syntax, Style Transfer, Completion, Combinatorial Creativity) vs "Software Engineering" (Architecture, Abstraction, Contextual Design, Transformative Insight, Validation).

💡 Digging Deeper

Q: What is “desirable difficulty” in learning?
A: It is the concept that memories only form when the brain has to work hard to retrieve information; AI removes this friction, potentially preventing long-term skill acquisition.

Q: Why does Howard prefer “slope” over “intercept” in career growth?
A: Intercept is where you are now, but slope is how fast you are learning; focusing on AI results only cares about the intercept, leading to eventual obsolescence.

Q: Can AI do clean-room implementations?
A: Not really; things like “clean-room C compilers” in Rust are often just nonlinear averages of existing LVM code found in the training data, not original architectural designs.

Reclaiming the Interactive Loop

Beyond the Terminal

The solution to AI-induced enfeeblement isn’t to ban the technology, but to move beyond the “inhumane” text-file-and-terminal interface that has dominated for decades. Howard points to the roots of computing—Smalltalk, Lisp, and Mathematica—as the gold standard for human-computer interaction.

True productivity emerges when humans can manipulate objects in real-time, receiving immediate feedback that allows them to refine their mental models.

By putting the human, the AI, and the interpreter into a single rich dynamic environment like a notebook, we transform AI from a distant oracle into a collaborative tool. Howard’s mission with NBDEV is to allow developers to build production software inside these living documents, ensuring that every step is tested and understood. This interactive loop provides “reality pushback,” where the computer corrects both the human and the AI, preventing the hallucination-filled marathons common in toolsets like Claude Code.

A process map showing the interactive programming loop: 1. Human defines small component, 2. AI generates implementation, 3. Interpreter provides immediate visual/data feedback, 4. Human refines mental model and constraints, 5. Component is integrated into production codebase.

💡 Digging Deeper

Q: Are notebooks bad for version control?
A: Only if you don’t use the right tools; specialized merge drivers allow Git to treat notebooks as cell-based JSON objects, making them perfectly reproducible.

Q: How does Howard avoid bugs without a debugger?
A: By building in tiny, interactive steps where each piece of code is verified against real data immediately, leaving no room for bugs to hide in large, untested blocks.

Q: What is the “interactive stateful environment” advantage?
A: It allows the human brain to outsource low-level state tracking to the computer, freeing up cognitive resources for high-level design and architectural reasoning.

Key Takeaways

The transition from manual coding to AI-assisted development is not merely a change in tooling, but a fundamental shift in how we manage knowledge. We must resist the urge to centralize AI power or outsource our competence, as a company’s only real value is the “slope” of its employees’ learning. If we allow ourselves to become mere prompt-engineers for black-box systems, we are setting ourselves up for a massive collapse of technical infrastructure when these systems inevitably hit their limits.

Innovation is ultimately about “adaptivity”—the ability to find new constraints and build mental models that don’t yet exist in a training corpus. By utilizing interactive, exploratory programming environments, we can use AI to accelerate the boring parts of development while sharpening our own engineering “muscles.” The goal is a synthesis where the human provides the “why” and the “design,” while the machine handles the “how” under strict, human-verified constraints.

Q&A

Q1: Is Jeremy Howard worried about AI taking over the world?
A: He believes the real danger is the centralization of power in the hands of a few companies or governments, which could use that monopoly to enfeeble the rest of society.

Q2: What is the “Slot Machine” effect in coding?
A: It’s the addictive, stochastic loop where developers pull the lever of a prompt, get a win, and feel in control, even though they don’t understand the generated output.

Q3: Why did ULMFiT change the industry?
A: It was the first major demonstration that pre-training on a general corpus (like Wikipedia) could produce a “world model” that is easily fine-tuned for specialized tasks.

Q4: What is “Understanding Debt”?
A: It occurs when a company relies on AI-generated code that no one on the team fully understands, leading to an inability to maintain or evolve the product over time.

Q5: Why does Jeremy Howard dislike traditional software engineering workflows?
A: He finds “dead” text-file environments inhumane and prefers “live” environments like Smalltalk or notebooks where you can touch and manipulate data in real-time.

Q6: Can AI learn to be creative?
A: AI is excellent at “combinatorial creativity” (mixing existing ideas), but it struggles with “transformative creativity” or moving outside the distribution of its training data.

Q7: What is the key to using AI as a “learning superpower”?
A: Using it in an environment that provides “desirable difficulty” and immediate feedback, rather than one that simply gives you the final answer without effort.

TeraBox Blog | 1TB Free Cloud Storage & All-in-One AI Space

Jeremy Howard on AI Coding: Why LLMs “Cosplay” Understanding

The AI Slot Machine: Jeremy Howard on “Vibe Coding” and Understanding Debt