Speech Recognition Suggestions


During December of 2003, the Boston Voice Users group compiled a list of suggestions for improvements to speech recognition software, drawn from members of our group, as well as world-wide speech-users mailing lists. At our December meeting, we discussed the suggestions, and held a vote to rank them. Listed below are our top ten suggestions, together with a supplemental list (also ranked) of additional items. We have also included lists of suggestions related to IBM ViaVoice and to Microsoft's speech software, although there were not enough users of either present at the meeting for it to make sense to rank the suggestions. The New York Speech Recognition Users Group has also created a wish list, combining our list with the added their own suggestions and compiled their own ranking. Each of these lists has been submitted to the appropriate companies.

Top 10 suggestions:

  1. An email suggestion box that is separate from tech support (no personal response necessary) This would enable people to send in suggestions without being charged
  2. Support for Select and Say email, especially in Eudora
  3. A ScanSoft employee who monitors forums and voice groups
  4. A ScanSoft employee whose job description includes using NatSpeak all the time
  5. Implement "Correct That" and "Scratch That" using the old "backspace" method rather than the version 7 method using special Windows keystrokes. Before version 7, "Correct That" and "Scratch That" worked by sending enough "backspace" characters to delete the previous utterance. Version 7 uses a fancier method using special Windows keystrokes, but this approach fails in applications that don't support the special keystrokes. Let's return to the "backspace" solution for all applications, or at least give us a way to specify that a particular application should use the "backspace" solution.
  6. Fix performance problems with large MS Word documents. NatSpeak gets very slow when dealing with large documents and moderate sized tables.
  7. Improve recognition for common homonyms like to/too/two/2, for/four/4, and one/won
  8. A Global command for scrolling that also allows users to vary the speed of scrolling
  9. An "X Replace With Y" command that would combine selecting a word or phrase (x) and replacing it with another word or phrase (y). This would cut in half the number of commands required to edit a document, and would make editing cognitively easier and more comfortable by taking away the waiting periods that normally occur between selecting text and changing it.
  10. A set of "Try Again" commands that would combine undoing with making another attempt and at the same increase the possibility that the second attempt would be correct by allowing the user to give the computer a contextual clue. "Try Again" would simply return the next choice in a correction box, "Try Again Text" would return only text-mode choices "Try Again Command" would limit the choices to commands. Further possibilities: Try Again Written, Try Again Numeric, Try Again Homonym, Try Again Symbolic, Try Again a-z... (This would allow you to start to spell a word.)

Supplemental list of suggestions aimed at naturally speaking and ranked by importance by the Boston users group:

Suggestions that came in after the Boston group ranking meeting:

Suggestions for ViaVoice: (not ranked)

Suggestions for Microsoft Speech: (not ranked)


Return to Boston Voice Users home page

This document last modified on 1/5/2004

Problems with this page? Contact web@bostonvoiceusers.com