Explainer

The Lever and the Lie

In 1938, B.F. Skinner put a rat in a box with a lever. When the rat pressed the lever, food appeared. That much is famous. What is less remembered is what happened next, and why Skinner himself was frightened by it.

By xbard 29 March 2026 10 min read

The Box Got an Upgrade

Every major social media platform on Earth runs on variable ratio reinforcement. This is not a metaphor. It is a design principle.

Post a video. Sometimes it gets 2,000 views. Sometimes 8,000. Sometimes, if the algorithmic gods are generous, 400,000. You will never know which press delivers the food. You will never know why one piece flies and another dies. You just keep pressing.

The same mechanism operates on viewers. Scroll through a feed. Most of what you see is mediocre, irrelevant, mildly irritating. But every few swipes, something lands. Something funny, something enraging, something that makes you feel seen. You keep scrolling for that hit. The ratio is variable. The reinforcement is unpredictable. The behaviour, endless scrolling, is the product.

Instagram, TikTok, YouTube, X. These are not platforms. They are Skinner boxes with better graphic design and a much worse privacy policy.

But what Skinner discovered about rats is only the first layer. What these systems do to the people inside them, both creators and audiences, goes deeper than compulsion. It reaches into identity itself.

The Brain That Rewards You for Talking

In 2012, Diana Tamir and Jason Mitchell at Harvard published research in the Proceedings of the National Academy of Sciences that answered a question nobody had thought to ask precisely: why do humans spend so much time talking about themselves?

The numbers are striking. In ordinary conversation, 30 to 40 percent of what people say is about their own experiences. On social media, the figure rises above 80 percent. Nobody asks for this information. Nobody needs it. And yet people cannot stop providing it.

Tamir and Mitchell put participants in an fMRI scanner and gave them a choice: answer questions about yourself or answer questions about someone else. When people talked about themselves, two brain regions activated, the nucleus accumbens and the ventral tegmental area. These are the core of the mesolimbic dopamine system, the same reward pathway triggered by food, sex, and addictive drugs. The brain treats self-disclosure as a reward on the same neurological level as eating when hungry.

In a follow-up experiment, they attached different payments to each option. Sometimes participants could earn more money by answering questions about someone else. People were willing to forfeit payment for the chance to talk about themselves instead. The brain valued self-disclosure so highly it would sacrifice financial reward to get it.

Now consider what happens when someone posts a video and a million people watch them share their thoughts. Every signal to the brain says: people are listening to you talk about yourself. The nucleus accumbens fires not once but thousands of times across days and weeks. It is applause without eye contact, psychologically disorienting but powerfully addictive.

This is the mechanism that turns ordinary people into content machines. Not greed. Not vanity. Neurochemistry, exploited at scale by systems designed to maximise the time you spend pressing the lever.

The Drift You Cannot See

Here is where it gets worse.

Phil Reed, a professor of psychology at Swansea University, has written about a phenomenon he calls dopamine overdose. The popular understanding of dopamine is that it makes you feel good. Reward, pleasure, satisfaction. That is not wrong, but it misses the primary function.

Dopamine's core job in the brain is to signal change. It helps you detect that something in your environment has shifted, something new, something different from what you expected. It operates by contrast. You notice a small increase in dopamine against a low baseline the way you notice a candle being lit in a dark room.

But when the baseline is already elevated, when you are flooded with dopamine from constant notifications, engagement metrics, and reward signals, small changes become invisible. It is like trying to spot a candle in broad daylight. The signal is there. You cannot see it against the noise.

Reed points to studies showing that artificially increased dopamine levels during discrimination tasks, tasks requiring you to unlearn something old and learn something new, actually impair your ability to detect the new pattern. Too much dopamine makes you worse at recognising that something has changed.

Apply this to a content creator in the middle of a growth spike. Views, subscribers, comments, shares, all driving dopamine from every direction. The baseline is elevated. And the very capacity that elevated baseline impairs is the ability to notice change, specifically, to notice that you are changing. Your content is shifting. Your tone is shifting. Your positions are drifting toward whatever the audience rewards. But the detection system that would normally flag this, the internal alarm that says something is different here, is drowned out by the reward signal.

The dopamine does not just feel good. It makes you blind to your own drift.

The Man Who Ate Himself

Nicholas Perry was a classically trained violinist. Soft-spoken. Thoughtful. Vegan. He wanted to be on Broadway.

In 2014, he started a YouTube channel sharing recipes and playing music. Gentle, wholesome content. Green smoothies and violin pieces. Almost nobody watched.

Then, as a joke, he tried mukbang, a genre originating in South Korea where a creator eats a large quantity of food on camera while talking to the audience. His first mukbang got 50,000 views in two weeks, more than everything he had posted before, combined. The lever had delivered food. Literally, in this case, sushi.

The algorithm's message was clear. Put the violin down. Eat more. Be louder. Be more extreme.

Over eight years, Perry gained over 250 pounds. He developed sleep apnea. He fractured his ribs. He became disabled. He bought a mobility scooter. He filmed himself crying and eating 20,000-calorie meals three times a day across five separate YouTube channels.

Here is a direct quote from him: "They like when I'm sick. They like when I'm upset. And they like when I'm hyper. So I just give them that."

He knew. He understood the mechanism perfectly. He could describe it in plain English. And he still could not stop. Because knowing you are in a Skinner box does not get you out of the Skinner box. The reinforcement schedule does not care whether you understand it. It works regardless of intelligence, regardless of insight, regardless of self-awareness. That is what made Skinner himself afraid of his own research.

The Loop That Closes

The conditioning of creators and the conditioning of audiences are not separate processes. They are a single loop, and it closes like this.

The algorithm rewards certain content. The creator makes more of that content. The dopamine reward makes the creator less able to notice they are changing. The audience reinforces the direction because that is what they are served, and the engagement metrics confirm it. The creator's beliefs reshape to match their behaviour. They become more extreme, more polarised, more dependent on the algorithm's feedback, but at no point does anyone involved feel manipulated.

It all feels like choice. It feels like authenticity. It feels like saying what you believe.

Which is, if you think about it, exactly what a perfectly designed conditioning system would feel like from the inside.

This is the part that should concern anyone who cares about public discourse, democratic deliberation, or the basic question of whether people are capable of forming their own views in an environment engineered to form views for them. The system does not need to tell you what to think. It just needs to reward certain patterns of thought and let the dopamine do the rest. The beliefs follow the behaviour. The identity follows the content. And the person at the centre of it, the creator or the viewer, experiences the entire process as self-expression.

The Silence That Protects the System

There is a comedian in London named Barry who recently made a video about exactly this. He described checking his analytics obsessively, thinking in terms of what would get clicks rather than what was true, feeling the pull of a growth spike toward producing more of whatever the algorithm had rewarded. He was transparent about it in a way that most creators are not, and the transparency itself was the point.

Because the studies he cited are not obscure. Keltner's research showing that the experience of power damages the brain's capacity for empathy. Kraus, Piff, Mendoza-Denton, and Keltner's work demonstrating that wealth produces solipsism, a first-person-only view of the world, not from malice but from an environment that stops requiring you to consider others. Wilkinson and Pickett's The Spirit Level, showing that inequality degrades outcomes for everyone, including the wealthy, across virtually every measure of health, crime, trust, and social mobility. Piketty's demonstration in Capital in the Twenty-First Century that when returns on capital exceed economic growth, wealth concentrates not because of individual greed but because of mathematics.

None of this is fringe. None of it is partisan. It is arithmetic and neuroscience, published in peer-reviewed journals, replicated across studies, available to anyone who looks.

And yet the algorithmic environment systematically selects against this kind of content. Nuance does not drive engagement. Complexity does not maximise watch time. What drives engagement is heat, not light. What maximises watch time is content that triggers an emotional response strong enough to override the impulse to close the app. The algorithm is not biased toward the left or the right. It is biased toward extremity, because extremity is what keeps the rat pressing the lever.

The result is an information environment where the most important findings in psychology, economics, and neuroscience, findings that explain the very system people are trapped inside, are structurally disadvantaged against content designed to provoke.

Staying silent about provable, peer-reviewed, repeatable facts because you are afraid of the algorithm, afraid of the comments, afraid of being called political, is itself a political act. Silence protects the system. Every time.

The Freedom That Isn't

Skinner gave an interview late in his life that has aged with uncomfortable precision. He said that what he called "the literature of freedom" had aggrandised the individual. You are free. Your values are the only values. Anyone who wants to change them is your enemy.

He thought this was dangerous, not because freedom is bad, but because the version of freedom being sold was a lie. A culture, he argued, brings a person under the control of the remoter consequences of their behaviour, what will happen ten years from now because of what you do today. Reject the culture, insist that you are the sole authority on what you do, and you lose contact with the future.

The social media environment has taken Skinner's warning and turned it into a product. It tells every user: you are free. You choose what to watch. You choose what to post. You choose what to believe. The algorithm merely reflects your preferences back to you. You are in control.

You are not in control. You are in a box with a lever, and the lever is shaped like a screen, and the food comes at random intervals calibrated by people whose only job is to make sure you keep pressing. The freedom is real in the same way that the rat's freedom to stop pressing is real. Technically available. Neurochemically implausible.

And the people who built the box know this. They have read Skinner. They have read Tamir and Mitchell. They have read Reed. They employ people who have spent their careers studying variable ratio reinforcement, dopamine pathways, and compulsive behaviour. The design is not accidental. The addictiveness is not a side effect. It is the product. Engagement is the product. Your attention, measured in seconds and sold to advertisers, is the product. Everything else, the content, the creators, the viewers, the discourse, is the lever.

What the Rat Cannot Do

The difference between a rat and a human is not immunity to conditioning. There is no such immunity. The reinforcement schedule works on anyone, regardless of intelligence, regardless of education, regardless of whether you have read this article or watched Barry's video or studied Skinner yourself.

The difference, the only meaningful difference, is the ability to name the mechanism while you are inside it. To say: I can feel this working on me, and I am going to act on something other than the reward signal anyway. Not because the feeling goes away. It does not. But because you have decided, consciously, what you are pressing the lever for, and it is not the food.

That is not a solution. It is a starting position. The solution would involve redesigning the box, regulating the companies that build them, and treating the deliberate exploitation of variable ratio reinforcement as what it is: a public health issue on the same level as gambling, tobacco, or any other industry that profits from addictive design while insisting its customers are making free choices.

But the companies that build the boxes also control the information environment in which regulation is discussed. The politicians who might regulate them are themselves inside the boxes, optimising their own content for engagement, pressing their own levers, drifting toward whatever the algorithm rewards. And the public whose support would be needed for regulation is scrolling, endlessly, looking for the next hit, in a system designed to make sure they never stop long enough to ask who built the box and why.

Skinner was afraid of what he found in 1938. He should have been. The rat is still pressing. The lever still works. And the people who own the box are doing very well.

This article draws on the work of B.F. Skinner (schedules of reinforcement, 1938), Diana Tamir and Jason Mitchell (self-disclosure and neural reward, PNAS, 2012), Phil Reed (dopamine and change detection, Swansea University), Dacher Keltner (power and empathy), Paul Piff et al. (wealth and social cognition), Richard Wilkinson and Kate Pickett (The Spirit Level, 2009), and Thomas Piketty (Capital in the Twenty-First Century, 2013). Video inspiration: Barry, Angel Comedy.

Overwatch Report is an independent publication. We have no financial positions in any entity mentioned.

Discuss this piece Discussion guidelines