General-Purpose Beats Purpose-Built: a Nature Medicine head-to-head on clinical AI
GPT-5.2, Gemini 3.1 and Claude outscored OpenEvidence and UpToDate's own AI on medical knowledge, clinician alignment, and real physician queries. On live clinical questions the specialist tools barely matched a free Google AI summary. A note on why the medical label may be selling certainty, not accuracy.