Fuzzy EmulateRecognize on Windows Speech Recognition

2018-06-23 00:33:05

Microsoft C# API provide a SpeechRecognitionEngine to recognize Audio stream. One way to test recogition is to call method SpeechRecognizer.EmulateRecognize

According to documentation:

recognizers ignore case and character width when applying 
grammar rules to the input phrase

I'd like to know if there is a way to handle more fuzzy string because confidence is very low even for mispelled text ! Far from real life...

With Audio I could say Hello, Helo, Helllo with a good confidence

With Text the engine is very strict

EDIT: For what purpose ?

My speech engine is working fine, but I also want to trigger it from text input.

Let's say your on mobile phone and use HTML5 SpeechRecognition. I'd like to send the recognized text to engine to get the same behavior as speech

Ok I found the answer ! I should better read the documentation !

SpeechRecognizer.EmulateRecognize

Is really straightforward and test the given string but

SpeechRecognizer.SimulateRecognize

Will try to build a an 'idealized' audio representation of the input phrase (based on the engine's lexicon and acoustic model)

And so it works very well !

When you send audio to the recognizer, the SR engine does a lot of work to create a set of phonemes (via acoustic modeling) and then a set of strings (via phoneme modeling). During that process, much of the ambiguity gets eliminated. EmulateRecognize doesn't generate audio that gets processed via the SR engine; it skips all the modeling and just does a string match.

There's no way to work around this that doesn't involve a lot of work (eg, implementing a SAPI-compatible SR engine that only does EmulateRecognize ).

在SpeechSynthesizer.Speak（）中输入您的字符串并将其用作SpeechRecognitionEngine的输入？

链接地址: http://www.djcxy.com/p/64562.html

上一篇: System.Speech.Recognition; 背景控制或语音识别

下一篇: 基于Windows语音识别的模糊模拟识别