MedAgentBench v2
Benchmarking AI agents with physician-written tasks in a realistic electronic health record environment
300 physician-written tasks · 100 Stanford Medicine patients · 700K+ clinical records · FHIR-compliant
Published in NEJM AI
PAUSED — press Space to continue