We believe healthcare AI should be verifiable. We publish our benchmarks, open-source our evaluation code, and release models under permissive licences.
42 speech-to-text models ranked on medical conversations using Medical Word Error Rate.
A safety-first SOAP benchmark measuring hallucinations, evidence grounding, and clinical coverage.
An open 3B clinical model for structured SOAP notes, released under the MIT licence.