The Struggle Is the Learning: What Bellwether's Research Means for AI in Education

What productive struggle actually means

Productive struggle is engaging with a task that's hard enough to require real effort — organizing your thoughts, hitting a wall, trying a different approach, failing, adjusting — while remaining within reach with support. It's not frustration. It's not busywork. It's the zone where the brain builds capacity.

The key word is "productive." Not all difficulty helps. A confusing interface is unproductive friction. A language barrier is unproductive friction. But the effort of working through a genuine intellectual challenge — that builds the neural pathways that make future thinking possible.

AI collapses this zone. Why struggle with a hard problem when you can get the answer in eight seconds?

Four things that break

Bellwether identifies four cognitive areas where AI-enabled shortcuts undermine learning:

1. Memory and processing

When students actively wrestle with difficult material, they encode it into long-term memory. The effort of organizing, connecting, and struggling with information is what moves it from working memory into lasting knowledge. When AI produces the answer, the student skips that encoding entirely. The work looks done. Nothing was retained.

2. Attention and engagement

There's a "flow zone" where challenge is high enough to prevent boredom but manageable enough to avoid giving up. This zone is where sustained, deep attention happens. AI collapses it. Why maintain focus on a hard problem when the answer is immediate? Students build cognitive endurance through challenging tasks — remove the challenge and the endurance never develops.

3. Motivation and mindset

Success on a hard task triggers intrinsic motivation — the feeling of "I figured this out." That self-satisfaction drives deeper engagement with the next challenge. AI gives you the answer without the satisfaction. Students who used ChatGPT for creative tasks reported less effort but also less enjoyment and lower perceived value. Easier was not better. It was emptier.

4. Metacognition and self-regulation

Struggle forces you to notice what you don't understand. "I thought I knew this, but I can't explain it." That moment of reckoning — the awareness of your own gaps — is a skill that only develops through friction. AI lets students skip it entirely. Bellwether calls it "metacognitive laziness," and it's not the students' fault. It's a design problem.

The data behind it

The report compiles research from multiple studies:

Undergraduates using AI for research experienced lower cognitive load but produced lower-quality arguments than those using traditional search. Easier process, worse outcome.

High school math students using basic ChatGPT showed improved short-term performance but worse long-term retention. They got better at the test and worse at the math.

Students receiving ChatGPT support in writing improved their essay scores without gaining transferable knowledge. The essay improved. The writer didn't.

Nearly half (47%) of university student-AI interactions were "direct" — seeking answers with minimal engagement. Not asking for help understanding. Asking for the answer.

When the default experience is frictionless, the brain takes the path of least resistance. Every brain does. This isn't about discipline. It's about design.

The finding that changes everything

Here's what makes this report actionable, not just alarming:

Basic ChatGPT

Students used ChatGPT with no constraints. Short-term performance improved. Long-term retention got worse. Gains didn't persist.

Configured as tutor

ChatGPT was set up to refuse direct answers and prompt problem-solving instead. Students maintained their performance gains over time.

Same tool. Different design. Opposite outcome.

When AI was configured to challenge instead of answer — to push students to think through problems rather than skip past them — the cognitive benefits persisted. The AI became a training partner, not a shortcut. It preserved the productive struggle instead of eliminating it.

What this means for schools

The report is clear: design and implementation determine outcomes. Simply providing AI access creates default reliance. The responsibility extends beyond teachers to ed tech developers, system leaders, and researchers.

Three things have to change:

How much cognitive effort AI should alleviate versus preserve. Some friction is unproductive and should be removed — formatting, access barriers, language translation. Other friction is productive and must be protected — formulating arguments, evaluating evidence, making decisions under uncertainty.

Guardrails preventing over-scaffolding. When AI does too much of the cognitive work, it doesn't help the student — it replaces them. The task gets completed. The learning doesn't happen.

Task redesign ensuring AI supports rather than replaces thinking. This is not about restricting AI. It's about designing tasks where AI amplifies the intellectual challenge instead of resolving it. The student should finish the task having thought harder, not having thought less.

The question is not whether students will use AI. 92% in Latin America already do. The question is whether anyone will redesign the educational experience so that AI strengthens thinking instead of replacing it.

Source: "Productive Struggle: How Generative AI Might Impact Learning" — Bellwether, June 2025.
bellwether.org/publications/productive-struggle/

Back to Ako Sapiens