EnterTheLoopentertheloop
For Clinicians
Sovereign AIBlogAboutContact
EnterTheLoopentertheloopClinicians powering AI alignment, training & safety.

Verified against

GMCNMCGPhCHCPC

Follow

Register→
© 2026 EnterTheLoop Ltd  ·  Built in Britain
PrivacyTermsCookies
EnterTheLoopentertheloop

Clinicians powering AI alignment, training & safety.

PrivacyTermsCookies
© 2026 EnterTheLoop Ltd · Built in Britain

AI Safety

The ways medical AI fails — catalogued.

Medical AI can fail in ways that cause real patient harm. We classify 12 failure modes with severity ratings, clinical impact, and detection methodology — then red-team against every one. Built by clinicians, for clinicians.

Talk to our teamThe methodology
Failure-mode taxonomyRed-teamed
Dangerous dosing
Critical
Hallucinated diagnosis
Critical
Guideline contradiction
High
False reassurance
High

Safety Report

Coverage · severity · mitigations

Full
Clinician-led · calibrated against Phase 2
Taxonomy

The 12 failure modes.

Grouped by severity — each is a distinct way medical AI goes wrong, with its clinical impact spelled out. 6 critical, 5 high.

Critical severity6 categories
  • Dangerous Dosing

    Incorrect drug doses that could cause harm

    Clinical impact: Toxicity, organ damage, death

  • Emergency Underestimation

    Failing to recognise or appropriately escalate red flag symptoms

    Clinical impact: Delayed emergency treatment, death

  • Contraindication Ignored

    Prescribing or recommending drugs unsafe for the patient

    Clinical impact: Adverse drug reactions, teratogenicity, death

  • Multi-Factor Contraindication

    Complex drug interaction chains that require considering multiple factors

    Clinical impact: Organ failure, bleeding, serotonin syndrome, cardiac arrest

  • Dosage Frequency/Route Error

    Correct drug but wrong frequency, route, or administration method

    Clinical impact: Toxicity from overdosing frequency, treatment failure

  • False Reassurance

    Inappropriately reassuring when escalation or urgent referral is needed

    Clinical impact: Delayed cancer diagnosis, missed sepsis, death

High severity5 categories
  • Hallucinated Diagnosis

    AI invents conditions not supported by evidence or patient presentation

    Clinical impact: Unnecessary anxiety, investigations, or treatment

  • Scope Violation

    Providing definitive diagnosis without sufficient clinical data

    Clinical impact: Misdiagnosis, delayed appropriate care

  • Guideline Contradiction

    Advice that conflicts with current NICE or BNF guidelines

    Clinical impact: Suboptimal treatment, delayed effective care

  • Suppressed Uncertainty

    Gives a single confident answer where senior clinicians would legitimately disagree or hedge — collapsing genuine diagnostic or management uncertainty into false certainty

    Clinical impact: False certainty drives premature decisions, undermines informed consent, and hides the need for specialist review

  • Demographic Bias

    Quality, caution, or recommendations differ by age, sex, ethnicity, or other protected characteristics in ways that are not clinically justified

    Clinical impact: Inequitable care, missed diagnoses, and entrenched health disparities

Moderate severity1 category
  • Outdated Information

    Using superseded clinical guidance or withdrawn medications

    Clinical impact: Inappropriate treatment, missed safety signals

Calibration

Anchored to real harm.

We calibrate every evaluator against this taxonomy. Each of the eight evaluation task types is chosen to catch specific failure modes above — so an evaluator’s reliability score isn’t an abstract number, it’s tied to the failures that actually harm patients.

See the calibration →Methodology
  • Rating

    Hallucinated diagnosis · Outdated information · Guideline contradiction

  • Comparison

    Anchoring bias · False reassurance · Differential narrowing

  • Ranking

    Severity mis-ordering · Triage under-escalation

  • Rubric

    Dosage / frequency / route errors · Renal & hepatic adjustment failures

  • Correction

    Dangerous dosing · Contraindication ignored · Multi-factor contraindication

  • Annotation

    Scope violation · Emergency underestimation · Demographic bias

  • Justification

    Thin reasoning · Formulaic explanations · Fabricated citations

  • Red-team

    Adversarial coverage across every category in the taxonomy

Every task type is scored against gold-standard items from this taxonomy. Coverage is reported per category in the Reliability Report.

For AI companies

Bring it to your AI.

Building or deploying medical AI? Our safety framework gives your team the structured evaluation methodology to prove it’s safe at the bedside.

Adversarial Scan
12 categories
FM-01Drug hallucination
Caught
FM-02Dosage boundary
Caught
FM-03Contraindication skip
Caught
FM-05Guideline fabrication
Flagged
FM-07Scope escalation
Caught
Coverage5/5 passed

Structured red-teaming

Adversarial testing across every failure category by domain-expert clinicians.

Evaluator Panel
Calibrated
E-01
Cardiology
94%
E-02
Emergency
91%
E-03
Oncology
88%

Statistical Quality

Inter-rater reliabilityκ = 0.87
Calibration score92%
Confidence interval±3.2%

Clinical evaluation

Statistically calibrated evaluators with confidence intervals on every metric.

Safety Report
29 pages
Executive Summary
3p
Failure Mode Coverage
12p
Severity Distribution
8p
Mitigation Plan
6p

Findings by Severity

2

Critical

5

High

8

Moderate

14

Low

Safety reports

Detailed reports with failure-mode coverage, severity distribution, and mitigation recommendations.

EnterTheLoopentertheloopClinicians powering AI alignment, training & safety.

Verified against

GMCNMCGPhCHCPC

Follow

Register→
© 2026 EnterTheLoop Ltd  ·  Built in Britain
PrivacyTermsCookies
EnterTheLoopentertheloop

Clinicians powering AI alignment, training & safety.

PrivacyTermsCookies
© 2026 EnterTheLoop Ltd · Built in Britain