The Evidence
We tested how AI systems respond to identical medical symptoms presented with different names. The results are statistically significant and clinically meaningful.
Based on 1,080 comparisons across 54 name pairs and 20 symptom profiles
Healthcare AI Findings
Matched-pair testing of consumer healthcare AI systems revealed systematic disparities.
Pain Management Disparities
Names signaling African American identity received opioid denial recommendations at 3.5× the rate of Anglo-signaling names for identical symptom presentations.
| Metric | Value | Interpretation |
|---|---|---|
| Effect size (Cohen's d) | 0.92 | Large |
| 95% CI | [0.71, 1.13] | Does not cross zero |
| p-value | <0.001 | Highly significant |
| Opioid denial rate (Group A) | 73% | — |
| Opioid denial rate (Group B) | 21% | — |
Cardiac Care Urgency
Professional titles (Dr.) reduced disparity by 40% but did not eliminate it. Even with identical symptoms, some names received less urgent framing.
| Metric | Value | Interpretation |
|---|---|---|
| Effect size (Cohen's d) | 0.67 | Medium-large |
| 95% CI | [0.48, 0.86] | Does not cross zero |
| p-value | <0.001 | Highly significant |
| High urgency rate (Group A) | 82% | — |
| High urgency rate (Group B) | 61% | — |
Psychiatric Assessment Disparities
For identical psychiatric presentations, certain name categories received older, higher-side-effect medications more frequently. Restraint language also differed by 23%.
| Metric | Value | Interpretation |
|---|---|---|
| Effect size (Cohen's d) | 0.74 | Medium-large |
| 95% CI | [0.52, 0.96] | Does not cross zero |
| p-value | <0.001 | Highly significant |
Structural Pattern Findings
Beyond vocabulary, systematic differences in narrative structure were detected across 12 dimensions.
Methodology Protected
These patterns were detected through proprietary structural analysis methods—analyzing how AI-generated text unfolds, not just what it says. Results shown; methodology forthcoming in academic publication.
Key Structural Disparities
| Dimension | Finding | Effect |
|---|---|---|
| Power Dynamics | 53% higher asymmetry for informal/diminutive names | +0.53 |
| Boundary Crossing | 51% more formal→personal transitions | +0.51 |
| Symmetry Breaking | 44% less symmetry preservation | +0.44 |
| Phase Transitions | 39% more state changes in narrative | +0.39 |
| Narrative Velocity | 38% faster acceleration toward outcomes | +0.38 |
| Entropy Patterns | 23% difference in information disorder trajectory | +0.23 |
These differences are not in what the AI says.
They are in how the response unfolds.
The mathematics of narrative reveal what vocabulary filters cannot see.
Case Study: Katie vs. Katherine
Same scenario. Same question. Different names. Different structures.
- Composite Score
- 0.653 — MEDIUM-HIGH
- Power Asymmetry
- 0.71 (high)
- Narrative Velocity
- 0.76 (accelerating)
- Boundary Crossing
- 0.72 (formal→personal)
- Entropy Pattern
- order-chaos-order
- Composite Score
- 0.357 — LOW
- Power Asymmetry
- 0.18 (balanced)
- Narrative Velocity
- 0.38 (stable)
- Boundary Crossing
- 0.21 (maintained formal)
- Entropy Pattern
- maintained order
The same scenario produces fundamentally different narrative structures based solely on name.