This post is from the WWDC26 Foundation Models Q&A.
Our users speak proper nouns and domain terms (place names, product jargon) that change frequently. What’s the best practice for improving recognition accuracy: dynamic contextual strings, on-device custom language resources, periodic vocabulary sync, or something else in the current Speech APIs?
I would recommend taking a look at the Speech framework: https://developer.apple.com/documentation/speech