When using Speech-to-Text with LUIS, phrases like "I need a tee shirt" frequently come through as "I need a teacher".
Should these utterances be binned together if I need to increase the catch rate of "tee shirt" and don't need to distinguish from "teacher"?
Short answer is, no, you shouldn't bin them together. Really, that is just a band-aid and more than likely will just lead to other issues down the road.
Instead, there are several steps you can take:
- Remind users to speak clearly and not too fast. This is generally good practice with any speech service.
- For Speech Services, consider adding Custom Speech. Essentially, you provide additional audio to further train the service on what to recognize. This can include providing a Pronunciation Model to help the service distinguish between similar sounding words.
- For LUIS, look at how many example utterances you have trained. Results tend to improve with more utterances. Recommendation is a minimum of five.
- For LUIS, you can consider adding a Phrase List. I think your mileage will vary more here than with the other suggestions as I think the issue is more speech driven that LUIS driven, but it could help and requires little to setup.
Hope of help!