Robust Affordable Speech Recognition
Recognition Accuracy
The Vestec Speech Engine delivers among the highest recognition accuracy in the industry. It is well suited for a wide variety of keywords, names, digits, numbers, dates, and yes/no speech grammars. In addition, the engine has been designed to operate in different audio channels – including, VoIP, cellular, and landline – in both “noisy” as well as “clean” environments.
Generally speaking, without grammar “tuning”, the speech engine delivers recognition accuracy in the 90% range for native speakers and in the 80% range for non-native speakers. Grammar “tuning” by the application developer typically improves recognition accuracy by 5-10%.
Accuracy Study
Vestec regularly benchmarks the recognition accuracy of its speech engine against leading competitive offerings and has consistently demonstrated superior performance. A recent accuracy study involved comparative tests with third-party data consisting of audio recordings of the five hundred most commonly spoken words in English. The recordings represented a diverse group of male and female voices, across different age groups, in a variety of channel environments. For both native and non-native speakers, Vestec speech engine outperformed a leading brand-name competitor by nearly 3% points in recognition accuracy.| RECOGNITION ACCURACY | Vestec Speech Engine | Improvement over Leading Competitive Engine |
|---|---|---|
| Keywords - Native Speakers | In the 90% range | +3.2% |
| Keywords - Non-native Speakers | In the 80% range | +3.0% |
NOTE: The above results are for "untuned" speech grammars using default engine settings. Grammar "tuning" typically results in an increase in recognition accuracy.