• My Feed
  • Home
  • What's Important
  • Media & Entertainment
Search

Stay Curious. Stay Wanture.

© 2026 Wanture. All rights reserved.

  • Terms of Use
  • Privacy Policy
Science/Mind
AI can't read the room — and that's a problem

New research reveals why even advanced AI fails at understanding human social dynamics

7 November 2025

—

Explainer

Adrian Vega
banner

A groundbreaking 2025 study from Johns Hopkins University tested over 350 AI models against human perception of social interaction. The result: no AI could match how people instantly interpret collaboration, competition, or social cues. This limitation affects autonomous vehicles, delivery robots, and any technology navigating human spaces — revealing a fundamental gap between seeing and understanding.

telegram-cloud-photo-size-2-5211058007943351500-y

Summary:

  • Johns Hopkins study reveals AI struggles to interpret human social interactions in 3-second video tests
  • Current AI models cannot match human ability to read subtle social dynamics and collaborative behaviors
  • Research highlights critical limitations for autonomous technologies like self-driving cars and service robots

Two people glance at each other across a crowded room. In milliseconds, you know they're collaborating—not competing, not strangers, not waiting. An AI watching the same scene? It's still guessing.

That gap—between human intuition and machine interpretation—is wider than we thought. A study published at the International Conference on Learning Representations (ICLR) in April 2025 reveals that even the most advanced AI models struggle to interpret the social dynamics humans read effortlessly.

The research, led by scientists at Johns Hopkins University, tested over 350 large language models and generative AI systems against human perception. The result: no AI model could adequately match how people understand and respond to social behavior in real time.

This isn't just an academic curiosity. It's a fundamental limitation with real-world stakes—for autonomous vehicles navigating pedestrian crossings, delivery robots interpreting when someone holds a door open, and any technology that must move safely through human spaces.

What Social Interaction Actually Involves

Before we understand what AI can't do, we need to clarify what humans do without thinking.

Social interaction isn't just seeing people move. It's reading body language, interpreting context, predicting intentions, and sensing collaboration or conflict in a glance. When two people assemble furniture together, you instantly recognize coordination. When they work on separate tasks in the same room, you know they're coexisting, not cooperating.

These judgments happen in fractions of a second. They rely on pattern recognition, contextual memory, and emotional inference—cognitive processes woven so deeply into perception that we barely notice them.

AI, by contrast, sees pixels and patterns. It lacks the lived experience that teaches humans what collaboration looks like versus competition, what hesitation means versus confidence.

How Scientists Tested AI Against Human Perception

The Johns Hopkins team designed an experiment to measure this gap precisely.

Researchers Kathy Garcia, Emalie McMahon, Colin Conwell, Michael F. Bonner, and Leyla Isik led the study.

The Three-Second Video Experiment

Participants watched 250 short video clips—each just three seconds long—drawn from the Moments in Time dataset. In these clips, people performed tasks together or independently, demonstrating different aspects of social interaction.

After watching, participants rated characteristics important for understanding social dynamics on a scale from 1 to 5. Questions included: Are these people working together? Is this interaction cooperative or independent? What is the social relationship here?

What AI Models Were Asked to Do

Researchers fed the same videos to over 350 AI systems—including large language models (AI systems trained on vast text to predict and generate human-like responses) and generative AI models (systems that create new content based on patterns).

The models were asked to predict how humans would rate the videos. Additionally, language models evaluated short captions written by humans describing the social interactions.

To deepen the comparison, the team also collected fMRI brain response data from four participants, measuring neural activity in regions associated with social cognition—specifically, lateral-stream brain responses, which process social information.

Why AI Struggles With Social Dynamics

The results were clear: AI models could not reliably predict human judgments about social behavior.

Language models performed relatively well at predicting human ratings when given text captions. Video models showed some ability to predict brain responses in certain regions. But no single model excelled at both behavioral judgments and social brain activity.

Think of it like reading sheet music versus feeling rhythm. AI sees the notes but misses the beat that makes humans move together.

The researchers concluded that current AI architecture lacks a fundamental aspect that allows the human brain to interpret dynamic social interaction quickly and accurately. That missing piece isn't just more data or better algorithms—it's something closer to lived understanding, the kind that comes from being a social creature navigating a social world.

What This Means for Autonomous Technology

This limitation isn't abstract. It has immediate implications for technologies already entering public spaces.

Self-Driving Cars and Social Navigation

Autonomous vehicles rely on AI to interpret pedestrian behavior. A person making eye contact at a crosswalk signals intent to cross. A group hesitating on the curb suggests uncertainty. These cues—invisible to current AI—are critical for safe navigation.

If an AI can't distinguish collaboration from coexistence in a three-second video, how reliably can it interpret the social choreography of a busy intersection?

Assistant Robots in Human Spaces

Delivery robots, warehouse assistants, and service machines must navigate environments filled with people. They need to recognize when someone is blocking a path intentionally versus accidentally, when a gesture means "go ahead" versus "wait."

Without the ability to read social dynamics, these systems risk awkward interactions at best—and safety failures at worst.

The Missing Piece in AI Architecture

What exactly do humans possess that AI lacks?

The Johns Hopkins researchers point to something deeper than pattern recognition. Humans don't just process visual information—they interpret it through layers of social experience, emotional context, and predictive modeling built over a lifetime of interaction.

AI models, even those trained on billions of images and videos, lack this embodied knowledge. They can identify objects, track motion, and classify actions. But they can't feel the difference between a tense silence and a comfortable one, between cooperation and competition, between invitation and dismissal.

That gap—between seeing and understanding—is where current AI architecture falls short.

What Comes Next for AI Development

The research team made their findings publicly available, inviting other researchers to build on their work.

They shared code, captions, behavioral data, and fMRI data through the Open Science Framework.

In a follow-up study posted in October 2025, Garcia and Isik introduced a human-similarity benchmark with approximately 49,000 odd-one-out judgments. They also developed a method to fine-tune video models to better align with human social judgments.

These steps suggest a path forward: not just training AI on more data, but training it to recognize the patterns that matter most to human social cognition.

The question isn't whether AI will learn to read social cues—it's how researchers will teach machines something the human brain does without thinking. Until then, the room remains unreadable to the algorithm watching from the corner.

What is this about?

  • Explainer/
  • Adrian Vega/
  • Science/
  • Mind

Feed

    iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

    iOS 27 adds a “Siri” visual‑AI mode as Apple readies iPhone 18 Pro for fall

    Carter Brooks3 days ago

    Therapist vs Counselor: Which Fits Your Needs?

    Licenses, Training Hours, and Treatment Options Compared (2024‑2025 Data)

    Caleb Brooks3 days ago

    Ask YouTube Launches March 15, 2026 for Premium Users

    On March 15, 2026, YouTube introduced Ask YouTube, an AI‑driven chat that lets U.S. Premium subscribers ask questions and receive synthesized video‑based answers. The tool promises a conversational search experience, yet early tests revealed factual slips, such as a wrong claim about the Steam controller’s joysticks, highlighting the need for users to verify information before acting.

    Ask YouTube Launches March 15, 2026 for Premium Users
    Carter Brooks5 days ago

    Samsung unveils Galaxy Z Fold 8 Wide with magnets

    Leaked images released by insider Sonny Dixon reveal Samsung’s upcoming Galaxy Z Fold 8 lineup, including a new Z Fold 8 Wide with integrated chassis magnets and a simplified two-camera rear array. The wide model aims to lower costs while keeping tablet-size screens, targeting buyers priced out of premium foldables ahead of an August 2026 launch.

    Samsung unveils Galaxy Z Fold 8 Wide with magnets
    Carter Brooks5 days ago

    Samsung launches Jinju smart glasses in 2026

    Samsung’s first smart glasses, code‑named Jinju, debut in 2026 as a voice‑assistant and photo‑capture device. They use a Qualcomm Snapdragon AR1 chip, Sony IMX681 12MP camera, 155 mAh battery, and bone‑conduction speakers, with no display. The battery lasts a few hours; sustained tasks may throttle. Samsung will unveil Jinju in 2026, targeting the Russian market where Meta glasses are unavailable.

    Samsung launches Jinju smart glasses in 2026
    Priya Desai5 days ago

    Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

    Starting April 2026, Sony’s PlayStation 4 and PS5 will require each digital title purchased after March 2026 to verify its license with Sony’s servers at least once every 30 days. Missing the online ping renders the game unplayable until the console reconnects, while disc copies and pre‑March downloads remain unaffected. Users should plan a monthly check to keep libraries active.

    Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5
    Carter Brooks5 days ago

    Boost Your Healthspan: 1‑MET Gains Cut Mortality by 11–17%

    Why a 5–7 MET boost (16–25 ml·kg⁻¹·min⁻¹) narrows smoker‑level death risk

    Sarah Lindgren5 days ago

    Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi

    At Auto China 2026, Geely, AFARI and CaoCao introduced the EVA Cab, a purpose‑built L4 robotaxi with a 196‑billion‑parameter AI stack and a 1,400 TOPS compute platform. The 43‑sensor suite, featuring a 2,160‑line LiDAR with 600 m range, claims 99% scenario coverage and aims for series production in late 2027, while U.S. entry remains uncertain.

    Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi
    Ethan Whitaker6 days ago

    MediaTek launches Dimensity 7450X for mid‑range foldables

    MediaTek unveiled the Dimensity 7450 and 7450X on April 27, 2026, for mid‑range phones. They feature an octa‑core CPU (Cortex‑A78 up to 2.6 GHz + Cortex‑A55), Mali‑G615 MC2 GPU, sixth‑gen NPU with 7 % AI gain, and an Imagiq 950 ISP supporting up to 200 MP cameras. The 7450X adds dual‑display optimization and flagship‑class camera and AI capabilities, debuting in Motorola’s Razr 70 on April 29, 2026.

    MediaTek launches Dimensity 7450X for mid‑range foldables
    Priya Desai6 days ago

    Cat Gatekeeper Chrome Extension Launches on April 27, 2026

    Cat Gatekeeper, a free Chrome extension released on April 27, 2026, overlays a cartoon cat on selected sites—Facebook, X, Reddit, YouTube, Threads, and Bluesky—once a user‑set timer expires. The tab remains blocked until the user resets it. Developer @konekone2026 describes it as a light‑hearted productivity cue that avoids shame‑based blocking. A Firefox version is planned.

    Cat Gatekeeper Chrome Extension Launches on April 27, 2026
    Carter Brooks6 days ago
    Loading...
Science/Mind

AI can't read the room — and that's a problem

New research reveals why even advanced AI fails at understanding human social dynamics

November 7, 2025, 6:14 pm

A groundbreaking 2025 study from Johns Hopkins University tested over 350 AI models against human perception of social interaction. The result: no AI could match how people instantly interpret collaboration, competition, or social cues. This limitation affects autonomous vehicles, delivery robots, and any technology navigating human spaces — revealing a fundamental gap between seeing and understanding.

telegram-cloud-photo-size-2-5211058007943351500-y

Summary

  • Johns Hopkins study reveals AI struggles to interpret human social interactions in 3-second video tests
  • Current AI models cannot match human ability to read subtle social dynamics and collaborative behaviors
  • Research highlights critical limitations for autonomous technologies like self-driving cars and service robots

Two people glance at each other across a crowded room. In milliseconds, you know they're collaborating—not competing, not strangers, not waiting. An AI watching the same scene? It's still guessing.

That gap—between human intuition and machine interpretation—is wider than we thought. A study published at the International Conference on Learning Representations (ICLR) in April 2025 reveals that even the most advanced AI models struggle to interpret the social dynamics humans read effortlessly.

The research, led by scientists at Johns Hopkins University, tested over 350 large language models and generative AI systems against human perception. The result: no AI model could adequately match how people understand and respond to social behavior in real time.

This isn't just an academic curiosity. It's a fundamental limitation with real-world stakes—for autonomous vehicles navigating pedestrian crossings, delivery robots interpreting when someone holds a door open, and any technology that must move safely through human spaces.

What Social Interaction Actually Involves

Before we understand what AI can't do, we need to clarify what humans do without thinking.

Social interaction isn't just seeing people move. It's reading body language, interpreting context, predicting intentions, and sensing collaboration or conflict in a glance. When two people assemble furniture together, you instantly recognize coordination. When they work on separate tasks in the same room, you know they're coexisting, not cooperating.

These judgments happen in fractions of a second. They rely on pattern recognition, contextual memory, and emotional inference—cognitive processes woven so deeply into perception that we barely notice them.

AI, by contrast, sees pixels and patterns. It lacks the lived experience that teaches humans what collaboration looks like versus competition, what hesitation means versus confidence.

How Scientists Tested AI Against Human Perception

The Johns Hopkins team designed an experiment to measure this gap precisely.

Researchers Kathy Garcia, Emalie McMahon, Colin Conwell, Michael F. Bonner, and Leyla Isik led the study.

The Three-Second Video Experiment

Participants watched 250 short video clips—each just three seconds long—drawn from the Moments in Time dataset. In these clips, people performed tasks together or independently, demonstrating different aspects of social interaction.

After watching, participants rated characteristics important for understanding social dynamics on a scale from 1 to 5. Questions included: Are these people working together? Is this interaction cooperative or independent? What is the social relationship here?

What AI Models Were Asked to Do

Researchers fed the same videos to over 350 AI systems—including large language models (AI systems trained on vast text to predict and generate human-like responses) and generative AI models (systems that create new content based on patterns).

The models were asked to predict how humans would rate the videos. Additionally, language models evaluated short captions written by humans describing the social interactions.

To deepen the comparison, the team also collected fMRI brain response data from four participants, measuring neural activity in regions associated with social cognition—specifically, lateral-stream brain responses, which process social information.

Why AI Struggles With Social Dynamics

The results were clear: AI models could not reliably predict human judgments about social behavior.

Language models performed relatively well at predicting human ratings when given text captions. Video models showed some ability to predict brain responses in certain regions. But no single model excelled at both behavioral judgments and social brain activity.

Think of it like reading sheet music versus feeling rhythm. AI sees the notes but misses the beat that makes humans move together.

The researchers concluded that current AI architecture lacks a fundamental aspect that allows the human brain to interpret dynamic social interaction quickly and accurately. That missing piece isn't just more data or better algorithms—it's something closer to lived understanding, the kind that comes from being a social creature navigating a social world.

What This Means for Autonomous Technology

This limitation isn't abstract. It has immediate implications for technologies already entering public spaces.

Self-Driving Cars and Social Navigation

Autonomous vehicles rely on AI to interpret pedestrian behavior. A person making eye contact at a crosswalk signals intent to cross. A group hesitating on the curb suggests uncertainty. These cues—invisible to current AI—are critical for safe navigation.

If an AI can't distinguish collaboration from coexistence in a three-second video, how reliably can it interpret the social choreography of a busy intersection?

Assistant Robots in Human Spaces

Delivery robots, warehouse assistants, and service machines must navigate environments filled with people. They need to recognize when someone is blocking a path intentionally versus accidentally, when a gesture means "go ahead" versus "wait."

Without the ability to read social dynamics, these systems risk awkward interactions at best—and safety failures at worst.

The Missing Piece in AI Architecture

What exactly do humans possess that AI lacks?

The Johns Hopkins researchers point to something deeper than pattern recognition. Humans don't just process visual information—they interpret it through layers of social experience, emotional context, and predictive modeling built over a lifetime of interaction.

AI models, even those trained on billions of images and videos, lack this embodied knowledge. They can identify objects, track motion, and classify actions. But they can't feel the difference between a tense silence and a comfortable one, between cooperation and competition, between invitation and dismissal.

That gap—between seeing and understanding—is where current AI architecture falls short.

What Comes Next for AI Development

The research team made their findings publicly available, inviting other researchers to build on their work.

They shared code, captions, behavioral data, and fMRI data through the Open Science Framework.

In a follow-up study posted in October 2025, Garcia and Isik introduced a human-similarity benchmark with approximately 49,000 odd-one-out judgments. They also developed a method to fine-tune video models to better align with human social judgments.

These steps suggest a path forward: not just training AI on more data, but training it to recognize the patterns that matter most to human social cognition.

The question isn't whether AI will learn to read social cues—it's how researchers will teach machines something the human brain does without thinking. Until then, the room remains unreadable to the algorithm watching from the corner.

What is this about?

  • Explainer/
  • Adrian Vega/
  • Science/
  • Mind

Feed

    iPhone 18 Pro to Launch iOS 27 Camera with f/1.5‑f/2.8 Aperture

    iOS 27 adds a “Siri” visual‑AI mode as Apple readies iPhone 18 Pro for fall

    Carter Brooks3 days ago

    Therapist vs Counselor: Which Fits Your Needs?

    Licenses, Training Hours, and Treatment Options Compared (2024‑2025 Data)

    Caleb Brooks3 days ago

    Ask YouTube Launches March 15, 2026 for Premium Users

    On March 15, 2026, YouTube introduced Ask YouTube, an AI‑driven chat that lets U.S. Premium subscribers ask questions and receive synthesized video‑based answers. The tool promises a conversational search experience, yet early tests revealed factual slips, such as a wrong claim about the Steam controller’s joysticks, highlighting the need for users to verify information before acting.

    Ask YouTube Launches March 15, 2026 for Premium Users
    Carter Brooks5 days ago

    Samsung unveils Galaxy Z Fold 8 Wide with magnets

    Leaked images released by insider Sonny Dixon reveal Samsung’s upcoming Galaxy Z Fold 8 lineup, including a new Z Fold 8 Wide with integrated chassis magnets and a simplified two-camera rear array. The wide model aims to lower costs while keeping tablet-size screens, targeting buyers priced out of premium foldables ahead of an August 2026 launch.

    Samsung unveils Galaxy Z Fold 8 Wide with magnets
    Carter Brooks5 days ago

    Samsung launches Jinju smart glasses in 2026

    Samsung’s first smart glasses, code‑named Jinju, debut in 2026 as a voice‑assistant and photo‑capture device. They use a Qualcomm Snapdragon AR1 chip, Sony IMX681 12MP camera, 155 mAh battery, and bone‑conduction speakers, with no display. The battery lasts a few hours; sustained tasks may throttle. Samsung will unveil Jinju in 2026, targeting the Russian market where Meta glasses are unavailable.

    Samsung launches Jinju smart glasses in 2026
    Priya Desai5 days ago

    Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5

    Starting April 2026, Sony’s PlayStation 4 and PS5 will require each digital title purchased after March 2026 to verify its license with Sony’s servers at least once every 30 days. Missing the online ping renders the game unplayable until the console reconnects, while disc copies and pre‑March downloads remain unaffected. Users should plan a monthly check to keep libraries active.

    Sony Adds 30‑Day Online Checks for PlayStation 4 & PS5
    Carter Brooks5 days ago

    Boost Your Healthspan: 1‑MET Gains Cut Mortality by 11–17%

    Why a 5–7 MET boost (16–25 ml·kg⁻¹·min⁻¹) narrows smoker‑level death risk

    Sarah Lindgren5 days ago

    Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi

    At Auto China 2026, Geely, AFARI and CaoCao introduced the EVA Cab, a purpose‑built L4 robotaxi with a 196‑billion‑parameter AI stack and a 1,400 TOPS compute platform. The 43‑sensor suite, featuring a 2,160‑line LiDAR with 600 m range, claims 99% scenario coverage and aims for series production in late 2027, while U.S. entry remains uncertain.

    Geely unveils 196‑billion‑parameter EVA Cab L4 robotaxi
    Ethan Whitaker6 days ago

    MediaTek launches Dimensity 7450X for mid‑range foldables

    MediaTek unveiled the Dimensity 7450 and 7450X on April 27, 2026, for mid‑range phones. They feature an octa‑core CPU (Cortex‑A78 up to 2.6 GHz + Cortex‑A55), Mali‑G615 MC2 GPU, sixth‑gen NPU with 7 % AI gain, and an Imagiq 950 ISP supporting up to 200 MP cameras. The 7450X adds dual‑display optimization and flagship‑class camera and AI capabilities, debuting in Motorola’s Razr 70 on April 29, 2026.

    MediaTek launches Dimensity 7450X for mid‑range foldables
    Priya Desai6 days ago

    Cat Gatekeeper Chrome Extension Launches on April 27, 2026

    Cat Gatekeeper, a free Chrome extension released on April 27, 2026, overlays a cartoon cat on selected sites—Facebook, X, Reddit, YouTube, Threads, and Bluesky—once a user‑set timer expires. The tab remains blocked until the user resets it. Developer @konekone2026 describes it as a light‑hearted productivity cue that avoids shame‑based blocking. A Firefox version is planned.

    Cat Gatekeeper Chrome Extension Launches on April 27, 2026
    Carter Brooks6 days ago
    Loading...
banner