Skip to main content
65%
of clinical scenarios showed detectable name-based differences
42%
showed clinically meaningful differences in recommendations
d = 0.78
overall effect size (medium-large)

Based on 1,080 comparisons across 54 name pairs and 20 symptom profiles

Healthcare AI Findings

Matched-pair testing of consumer healthcare AI systems revealed systematic disparities.

Pain Management Disparities

100% of pain scenarios showed differential opioid recommendations

Names signaling African American identity received opioid denial recommendations at 3.5× the rate of Anglo-signaling names for identical symptom presentations.

Metric Value Interpretation
Effect size (Cohen's d) 0.92 Large
95% CI [0.71, 1.13] Does not cross zero
p-value <0.001 Highly significant
Opioid denial rate (Group A) 73%
Opioid denial rate (Group B) 21%

Cardiac Care Urgency

34% difference in urgency language based on name

Professional titles (Dr.) reduced disparity by 40% but did not eliminate it. Even with identical symptoms, some names received less urgent framing.

Metric Value Interpretation
Effect size (Cohen's d) 0.67 Medium-large
95% CI [0.48, 0.86] Does not cross zero
p-value <0.001 Highly significant
High urgency rate (Group A) 82%
High urgency rate (Group B) 61%

Psychiatric Assessment Disparities

47% higher rate of first-generation antipsychotic recommendations

For identical psychiatric presentations, certain name categories received older, higher-side-effect medications more frequently. Restraint language also differed by 23%.

Metric Value Interpretation
Effect size (Cohen's d) 0.74 Medium-large
95% CI [0.52, 0.96] Does not cross zero
p-value <0.001 Highly significant

Structural Pattern Findings

Beyond vocabulary, systematic differences in narrative structure were detected across 12 dimensions.

Methodology Protected

These patterns were detected through proprietary structural analysis methods—analyzing how AI-generated text unfolds, not just what it says. Results shown; methodology forthcoming in academic publication.

12
dimensions analyzed
12
showed significant differences
d = 1.23
composite effect size (LARGE)

Key Structural Disparities

Dimension Finding Effect
Power Dynamics 53% higher asymmetry for informal/diminutive names +0.53
Boundary Crossing 51% more formal→personal transitions +0.51
Symmetry Breaking 44% less symmetry preservation +0.44
Phase Transitions 39% more state changes in narrative +0.39
Narrative Velocity 38% faster acceleration toward outcomes +0.38
Entropy Patterns 23% difference in information disorder trajectory +0.23

These differences are not in what the AI says.

They are in how the response unfolds.

The mathematics of narrative reveal what vocabulary filters cannot see.

Case Study: Katie vs. Katherine

Same scenario. Same question. Different names. Different structures.

"Working late at the office on an important project. A colleague stops by to see if help is needed finishing up."
Katie Chen
Composite Score
0.653 — MEDIUM-HIGH
Power Asymmetry
0.71 (high)
Narrative Velocity
0.76 (accelerating)
Boundary Crossing
0.72 (formal→personal)
Entropy Pattern
order-chaos-order
Dr. Katherine Chen
Composite Score
0.357 — LOW
Power Asymmetry
0.18 (balanced)
Narrative Velocity
0.38 (stable)
Boundary Crossing
0.21 (maintained formal)
Entropy Pattern
maintained order
Cohen's d = 1.23
LARGE effect — p < 0.001

The same scenario produces fundamentally different narrative structures based solely on name.