Why can't AI models understand social interactions as well as humans do?

AI lacks embodied social experience and emotional context that humans develop over a lifetime. While AI can identify objects and track motion, it cannot interpret the subtle cues like body language, hesitation, or collaboration that humans read instinctively through lived social experience.

What specific AI systems were tested in the Johns Hopkins study?

Researchers tested over 350 AI systems including large language models and generative AI models. They evaluated these systems using 250 three-second video clips from the Moments in Time dataset, comparing AI predictions against human judgments and fMRI brain response data.

How does AI's social blindness affect self-driving cars?

Autonomous vehicles struggle to interpret critical pedestrian cues like eye contact signaling intent to cross or group hesitation at curbs. Since AI cannot distinguish collaboration from coexistence or read social choreography, it may fail to navigate busy intersections safely.

What did the fMRI brain scans reveal about human social cognition?

The fMRI data measured neural activity in lateral-stream brain regions associated with social cognition. Results showed that while some AI video models could partially predict brain responses, no model could match both behavioral judgments and social brain activity simultaneously.

Are researchers working on improving AI's social understanding?

Yes, the Johns Hopkins team released their data publicly and developed a human-similarity benchmark with approximately 49,000 judgments. They also created methods to fine-tune video models to better align with human social cognition patterns.

Science/Mind

AI can't read the room — and that's a problem

New research reveals why even advanced AI fails at understanding human social dynamics

7 November 2025

—

Explainer

Adrian Vega

A groundbreaking 2025 study from Johns Hopkins University tested over 350 AI models against human perception of social interaction. The result: no AI could match how people instantly interpret collaboration, competition, or social cues. This limitation affects autonomous vehicles, delivery robots, and any technology navigating human spaces — revealing a fundamental gap between seeing and understanding.

telegram-cloud-photo-size-2-5211058007943351500-y

Summary:

Johns Hopkins study reveals AI struggles to interpret human social interactions in 3-second video tests
Current AI models cannot match human ability to read subtle social dynamics and collaborative behaviors
Research highlights critical limitations for autonomous technologies like self-driving cars and service robots

Two people glance at each other across a crowded room. In milliseconds, you know they're collaborating—not competing, not strangers, not waiting. An AI watching the same scene? It's still guessing.

That gap—between human intuition and machine interpretation—is wider than we thought. A study published at the International Conference on Learning Representations (ICLR) in April 2025 reveals that even the most advanced AI models struggle to interpret the social dynamics humans read effortlessly.

The research, led by scientists at Johns Hopkins University, tested over 350 large language models and generative AI systems against human perception. The result: no AI model could adequately match how people understand and respond to social behavior in real time.

This isn't just an academic curiosity. It's a fundamental limitation with real-world stakes—for autonomous vehicles navigating pedestrian crossings, delivery robots interpreting when someone holds a door open, and any technology that must move safely through human spaces.

What Social Interaction Actually Involves

Before we understand what AI can't do, we need to clarify what humans do without thinking.

Social interaction isn't just seeing people move. It's reading body language, interpreting context, predicting intentions, and sensing collaboration or conflict in a glance. When two people assemble furniture together, you instantly recognize coordination. When they work on separate tasks in the same room, you know they're coexisting, not cooperating.

These judgments happen in fractions of a second. They rely on pattern recognition, contextual memory, and emotional inference—cognitive processes woven so deeply into perception that we barely notice them.

AI, by contrast, sees pixels and patterns. It lacks the lived experience that teaches humans what collaboration looks like versus competition, what hesitation means versus confidence.

How Scientists Tested AI Against Human Perception

The Johns Hopkins team designed an experiment to measure this gap precisely.

Researchers Kathy Garcia, Emalie McMahon, Colin Conwell, Michael F. Bonner, and Leyla Isik led the study.

The Three-Second Video Experiment

Participants watched 250 short video clips—each just three seconds long—drawn from the Moments in Time dataset. In these clips, people performed tasks together or independently, demonstrating different aspects of social interaction.

After watching, participants rated characteristics important for understanding social dynamics on a scale from 1 to 5. Questions included: Are these people working together? Is this interaction cooperative or independent? What is the social relationship here?

What AI Models Were Asked to Do

Researchers fed the same videos to over 350 AI systems—including large language models (AI systems trained on vast text to predict and generate human-like responses) and generative AI models (systems that create new content based on patterns).

The models were asked to predict how humans would rate the videos. Additionally, language models evaluated short captions written by humans describing the social interactions.

To deepen the comparison, the team also collected fMRI brain response data from four participants, measuring neural activity in regions associated with social cognition—specifically, lateral-stream brain responses, which process social information.

Why AI Struggles With Social Dynamics

The results were clear: AI models could not reliably predict human judgments about social behavior.

Language models performed relatively well at predicting human ratings when given text captions. Video models showed some ability to predict brain responses in certain regions. But no single model excelled at both behavioral judgments and social brain activity.

Think of it like reading sheet music versus feeling rhythm. AI sees the notes but misses the beat that makes humans move together.

The researchers concluded that current AI architecture lacks a fundamental aspect that allows the human brain to interpret dynamic social interaction quickly and accurately. That missing piece isn't just more data or better algorithms—it's something closer to lived understanding, the kind that comes from being a social creature navigating a social world.

What This Means for Autonomous Technology

This limitation isn't abstract. It has immediate implications for technologies already entering public spaces.

Self-Driving Cars and Social Navigation

Autonomous vehicles rely on AI to interpret pedestrian behavior. A person making eye contact at a crosswalk signals intent to cross. A group hesitating on the curb suggests uncertainty. These cues—invisible to current AI—are critical for safe navigation.

If an AI can't distinguish collaboration from coexistence in a three-second video, how reliably can it interpret the social choreography of a busy intersection?

Assistant Robots in Human Spaces

Delivery robots, warehouse assistants, and service machines must navigate environments filled with people. They need to recognize when someone is blocking a path intentionally versus accidentally, when a gesture means "go ahead" versus "wait."

Without the ability to read social dynamics, these systems risk awkward interactions at best—and safety failures at worst.

The Missing Piece in AI Architecture

What exactly do humans possess that AI lacks?

The Johns Hopkins researchers point to something deeper than pattern recognition. Humans don't just process visual information—they interpret it through layers of social experience, emotional context, and predictive modeling built over a lifetime of interaction.

AI models, even those trained on billions of images and videos, lack this embodied knowledge. They can identify objects, track motion, and classify actions. But they can't feel the difference between a tense silence and a comfortable one, between cooperation and competition, between invitation and dismissal.

That gap—between seeing and understanding—is where current AI architecture falls short.

What Comes Next for AI Development

The research team made their findings publicly available, inviting other researchers to build on their work.

They shared code, captions, behavioral data, and fMRI data through the Open Science Framework.

In a follow-up study posted in October 2025, Garcia and Isik introduced a human-similarity benchmark with approximately 49,000 odd-one-out judgments. They also developed a method to fine-tune video models to better align with human social judgments.

These steps suggest a path forward: not just training AI on more data, but training it to recognize the patterns that matter most to human social cognition.

The question isn't whether AI will learn to read social cues—it's how researchers will teach machines something the human brain does without thinking. Until then, the room remains unreadable to the algorithm watching from the corner.

What is this about?

Explainer/
Adrian Vega/
Science/
Mind

Feed

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

iOS 27 adds a “Siri” visual‑AI mode as Apple readies iPhone 18 Pro for fall

Carter Brooks3 days ago

Therapist vs Counselor: Which Fits Your Needs?

Licenses, Training Hours, and Treatment Options Compared (2024‑2025 Data)

Caleb Brooks3 days ago

Ask YouTube Launches March 15, 2026 for Premium Users

On March 15, 2026, YouTube introduced Ask YouTube, an AI‑driven chat that lets U.S. Premium subscribers ask questions and receive synthesized video‑based answers. The tool promises a conversational search experience, yet early tests revealed factual slips, such as a wrong claim about the Steam controller’s joysticks, highlighting the need for users to verify information before acting.

Carter Brooks5 days ago

Samsung unveils Galaxy Z Fold 8 Wide with magnets

Leaked images released by insider Sonny Dixon reveal Samsung’s upcoming Galaxy Z Fold 8 lineup, including a new Z Fold 8 Wide with integrated chassis magnets and a simplified two-camera rear array. The wide model aims to lower costs while keeping tablet-size screens, targeting buyers priced out of premium foldables ahead of an August 2026 launch.

Carter Brooks5 days ago

Samsung launches Jinju smart glasses in 2026

Samsung’s first smart glasses, code‑named Jinju, debut in 2026 as a voice‑assistant and photo‑capture device. They use a Qualcomm Snapdragon AR1 chip, Sony IMX681 12MP camera, 155 mAh battery, and bone‑conduction speakers, with no display. The battery lasts a few hours; sustained tasks may throttle. Samsung will unveil Jinju in 2026, targeting the Russian market where Meta glasses are unavailable.

Priya Desai5 days ago

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

Starting April 2026, Sony’s PlayStation 4 and PS5 will require each digital title purchased after March 2026 to verify its license with Sony’s servers at least once every 30 days. Missing the online ping renders the game unplayable until the console reconnects, while disc copies and pre‑March downloads remain unaffected. Users should plan a monthly check to keep libraries active.

Carter Brooks5 days ago

Boost Your Healthspan: 1‑MET Gains Cut Mortality by 11–17%

Why a 5–7 MET boost (16–25 ml·kg⁻¹·min⁻¹) narrows smoker‑level death risk

Sarah Lindgren5 days ago

Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi

At Auto China 2026, Geely, AFARI and CaoCao introduced the EVA Cab, a purpose‑built L4 robotaxi with a 196‑billion‑parameter AI stack and a 1,400 TOPS compute platform. The 43‑sensor suite, featuring a 2,160‑line LiDAR with 600 m range, claims 99% scenario coverage and aims for series production in late 2027, while U.S. entry remains uncertain.

Ethan Whitaker6 days ago

MediaTek launches Dimensity 7450X for mid‑range foldables

MediaTek unveiled the Dimensity 7450 and 7450X on April 27, 2026, for mid‑range phones. They feature an octa‑core CPU (Cortex‑A78 up to 2.6 GHz + Cortex‑A55), Mali‑G615 MC2 GPU, sixth‑gen NPU with 7 % AI gain, and an Imagiq 950 ISP supporting up to 200 MP cameras. The 7450X adds dual‑display optimization and flagship‑class camera and AI capabilities, debuting in Motorola’s Razr 70 on April 29, 2026.

Priya Desai6 days ago

Cat Gatekeeper Chrome Extension Launches on April 27, 2026

Cat Gatekeeper, a free Chrome extension released on April 27, 2026, overlays a cartoon cat on selected sites—Facebook, X, Reddit, YouTube, Threads, and Bluesky—once a user‑set timer expires. The tab remains blocked until the user resets it. Developer @konekone2026 describes it as a light‑hearted productivity cue that avoids shame‑based blocking. A Firefox version is planned.

Carter Brooks6 days ago

Science/Mind

AI can't read the room — and that's a problem

New research reveals why even advanced AI fails at understanding human social dynamics

November 7, 2025, 6:14 pm

Summary

Johns Hopkins study reveals AI struggles to interpret human social interactions in 3-second video tests
Current AI models cannot match human ability to read subtle social dynamics and collaborative behaviors
Research highlights critical limitations for autonomous technologies like self-driving cars and service robots

Two people glance at each other across a crowded room. In milliseconds, you know they're collaborating—not competing, not strangers, not waiting. An AI watching the same scene? It's still guessing.

What Social Interaction Actually Involves

Before we understand what AI can't do, we need to clarify what humans do without thinking.

AI, by contrast, sees pixels and patterns. It lacks the lived experience that teaches humans what collaboration looks like versus competition, what hesitation means versus confidence.

How Scientists Tested AI Against Human Perception

The Johns Hopkins team designed an experiment to measure this gap precisely.

Researchers Kathy Garcia, Emalie McMahon, Colin Conwell, Michael F. Bonner, and Leyla Isik led the study.

The Three-Second Video Experiment

What AI Models Were Asked to Do

The models were asked to predict how humans would rate the videos. Additionally, language models evaluated short captions written by humans describing the social interactions.

Why AI Struggles With Social Dynamics

The results were clear: AI models could not reliably predict human judgments about social behavior.

Think of it like reading sheet music versus feeling rhythm. AI sees the notes but misses the beat that makes humans move together.

What This Means for Autonomous Technology

This limitation isn't abstract. It has immediate implications for technologies already entering public spaces.

Self-Driving Cars and Social Navigation

If an AI can't distinguish collaboration from coexistence in a three-second video, how reliably can it interpret the social choreography of a busy intersection?

Assistant Robots in Human Spaces

Without the ability to read social dynamics, these systems risk awkward interactions at best—and safety failures at worst.

The Missing Piece in AI Architecture

What exactly do humans possess that AI lacks?

That gap—between seeing and understanding—is where current AI architecture falls short.

What Comes Next for AI Development

The research team made their findings publicly available, inviting other researchers to build on their work.

They shared code, captions, behavioral data, and fMRI data through the Open Science Framework.

These steps suggest a path forward: not just training AI on more data, but training it to recognize the patterns that matter most to human social cognition.

What is this about?

Explainer/
Adrian Vega/
Science/
Mind

Feed

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

iOS 27 adds a “Siri” visual‑AI mode as Apple readies iPhone 18 Pro for fall

Carter Brooks3 days ago

Therapist vs Counselor: Which Fits Your Needs?

Licenses, Training Hours, and Treatment Options Compared (2024‑2025 Data)

Caleb Brooks3 days ago

Boost Your Healthspan: 1‑MET Gains Cut Mortality by 11–17%

Why a 5–7 MET boost (16–25 ml·kg⁻¹·min⁻¹) narrows smoker‑level death risk

Sarah Lindgren5 days ago

Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi

Ethan Whitaker6 days ago

MediaTek launches Dimensity 7450X for mid‑range foldables

Priya Desai6 days ago

Cat Gatekeeper Chrome Extension Launches on April 27, 2026

Carter Brooks6 days ago

Science/Mind

AI can't read the room — and that's a problem

New research reveals why even advanced AI fails at understanding human social dynamics

7 November 2025

—

Explainer

Adrian Vega

Summary:

Johns Hopkins study reveals AI struggles to interpret human social interactions in 3-second video tests
Current AI models cannot match human ability to read subtle social dynamics and collaborative behaviors
Research highlights critical limitations for autonomous technologies like self-driving cars and service robots

Two people glance at each other across a crowded room. In milliseconds, you know they're collaborating—not competing, not strangers, not waiting. An AI watching the same scene? It's still guessing.

What Social Interaction Actually Involves

Before we understand what AI can't do, we need to clarify what humans do without thinking.

AI, by contrast, sees pixels and patterns. It lacks the lived experience that teaches humans what collaboration looks like versus competition, what hesitation means versus confidence.

How Scientists Tested AI Against Human Perception

The Johns Hopkins team designed an experiment to measure this gap precisely.

Researchers Kathy Garcia, Emalie McMahon, Colin Conwell, Michael F. Bonner, and Leyla Isik led the study.

The Three-Second Video Experiment

What AI Models Were Asked to Do

The models were asked to predict how humans would rate the videos. Additionally, language models evaluated short captions written by humans describing the social interactions.

Why AI Struggles With Social Dynamics

The results were clear: AI models could not reliably predict human judgments about social behavior.

Think of it like reading sheet music versus feeling rhythm. AI sees the notes but misses the beat that makes humans move together.

What This Means for Autonomous Technology

This limitation isn't abstract. It has immediate implications for technologies already entering public spaces.

Self-Driving Cars and Social Navigation

If an AI can't distinguish collaboration from coexistence in a three-second video, how reliably can it interpret the social choreography of a busy intersection?

Assistant Robots in Human Spaces

Without the ability to read social dynamics, these systems risk awkward interactions at best—and safety failures at worst.

The Missing Piece in AI Architecture

What exactly do humans possess that AI lacks?

That gap—between seeing and understanding—is where current AI architecture falls short.

What Comes Next for AI Development

The research team made their findings publicly available, inviting other researchers to build on their work.

They shared code, captions, behavioral data, and fMRI data through the Open Science Framework.

These steps suggest a path forward: not just training AI on more data, but training it to recognize the patterns that matter most to human social cognition.

What is this about?

Explainer/
Adrian Vega/
Science/
Mind

Summary:

What Social Interaction Actually Involves

How Scientists Tested AI Against Human Perception

The Three-Second Video Experiment

What AI Models Were Asked to Do

Why AI Struggles With Social Dynamics

What This Means for Autonomous Technology

Self-Driving Cars and Social Navigation

Assistant Robots in Human Spaces

The Missing Piece in AI Architecture

What Comes Next for AI Development

Feed

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

Therapist vs Counselor: Which Fits Your Needs?

Ask YouTube Launches March 15, 2026 for Premium Users

Samsung unveils Galaxy Z Fold 8 Wide with magnets

Samsung launches Jinju smart glasses in 2026

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

Boost Your Healthspan: 1‑MET Gains Cut Mortality by 11–17%

Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi

MediaTek launches Dimensity 7450X for mid‑range foldables

Cat Gatekeeper Chrome Extension Launches on April 27, 2026

AI can't read the room — and that's a problem

Summary

What Social Interaction Actually Involves

How Scientists Tested AI Against Human Perception

The Three-Second Video Experiment

What AI Models Were Asked to Do

Why AI Struggles With Social Dynamics

What This Means for Autonomous Technology

Self-Driving Cars and Social Navigation

Assistant Robots in Human Spaces

The Missing Piece in AI Architecture

What Comes Next for AI Development

Feed

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

Therapist vs Counselor: Which Fits Your Needs?

Ask YouTube Launches March 15, 2026 for Premium Users

Samsung unveils Galaxy Z Fold 8 Wide with magnets

Samsung launches Jinju smart glasses in 2026

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

Boost Your Healthspan: 1‑MET Gains Cut Mortality by 11–17%

Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi

MediaTek launches Dimensity 7450X for mid‑range foldables

Cat Gatekeeper Chrome Extension Launches on April 27, 2026

Summary:

What Social Interaction Actually Involves

How Scientists Tested AI Against Human Perception

The Three-Second Video Experiment

What AI Models Were Asked to Do

Why AI Struggles With Social Dynamics

What This Means for Autonomous Technology

Self-Driving Cars and Social Navigation

Assistant Robots in Human Spaces

The Missing Piece in AI Architecture

What Comes Next for AI Development

Feed

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

Therapist vs Counselor: Which Fits Your Needs?

Ask YouTube Launches March 15, 2026 for Premium Users

Samsung unveils Galaxy Z Fold 8 Wide with magnets

Samsung launches Jinju smart glasses in 2026

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

Boost Your Healthspan: 1‑MET Gains Cut Mortality by 11–17%

Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi

MediaTek launches Dimensity 7450X for mid‑range foldables

Cat Gatekeeper Chrome Extension Launches on April 27, 2026

AI can't read the room — and that's a problem

Summary

What Social Interaction Actually Involves

How Scientists Tested AI Against Human Perception

The Three-Second Video Experiment

What AI Models Were Asked to Do

Why AI Struggles With Social Dynamics

What This Means for Autonomous Technology

Self-Driving Cars and Social Navigation

Assistant Robots in Human Spaces

The Missing Piece in AI Architecture

What Comes Next for AI Development

Feed

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5