
Scalable Benchmarking of Health AI’s Differential Diagnosis Accuracy
Our peer-reviewed framework evaluates August across 400 validated clinical vignettes spanning 14 medical specialties, reaching 81.8% top-one diagnostic accuracy and 95.8% specialist-referral accuracy
June 2026
August Scores 100% on the USMLE
August becomes the first health AI to score a perfect 100% on the USMLE, with leading results on MedQA and MMLU medical subsets.
June 2026
August AI Achieves 94.8% on the USMLE
August AI scores 94.8% on the USMLE — the highest of any benchmarked AI, beating GPT-4, MedPaLM 2 and OpenEvidence.
June 2026