Contextual Clues Can Help Improve Alexa’s Speech Recognizers

Automatic speech recognition systems, which convert spoken words into text, are an important component of conversational agents such as Alexa. These systems generally comprise an acoustic model, a pronunciation model, and a statistical language model. The role of the statistical language model is to assign a probability to the next word in a sentence, given the previous ones. For instance, the phrases "Pulitzer Prize" and "pullet surprise" may have very similar acoustic profiles, but statistically, one is far more likely to conclude a question that begins "Alexa, what playwright just won a … ?"