Which speech recognition system to use?

I want to implement following idea. I need my user to open web page or mobile application and say something. The speech of user will be recorded end sent to a local server, and I want the server to process it and generate some voice output.

My questions are next:

  • Which Speech Recognition API to use? In my case audio file will be sent to server, and after that it will be processed.

  • I need to recognize names and last names. This is a potential problem, because I don't live in English-speaking country, and names and last names could be rather strange to English-speaker. I want to implement this by adding required names and last names to "dictionary" of speech-recognizing system, so I need one that can expand its dictionary.

  • I need a free decision.

  • So, which Speech Recognizing API should I use in implementation of my idea?


    I have had success in the past using the CMU Sphinx toolkit for speech recognition. I had used a variation of it within in Android application. If I remember correctly it did allow the creation of a dictionary of specific words to look for so it may fit your needs.


    I recommend you have a look at the Speech API from Nuance. They have the best on the market and are available on any platform. We use them for our product LinguSocial, which offers near real-time voice call translation to mobiles & landlines and also machine translated video chats.

    链接地址: http://www.djcxy.com/p/34370.html

    上一篇: 如何使用System.Speech进行编程式多语言语音识别

    下一篇: 使用哪种语音识别系统?