Computers, phones, laptops, tablets or any virtual software can be very smart when it comes to brain-straining things like playing chess and filling out tax returns. They are far better than humans in such work. So, people might think, that they would be similarly outstanding in recognizing faces or understanding speech.
But even after trying hard for over 5 decades to make these
smart devices do such simple things, most of the developers and programmers
have failed and concluded that just because a human can easily recognize voice
or faces, a software can’t be good at it as much as humans do.
To all of us, the humans, it seems so easy to understand the
speech. Whenever someone speaks a word in English, it just pops into our heads
as soon as they open their mouths.
How Nuance
Dragon Performs Speech Recognition? How Accurate It Is?
This unconscious nature of the understanding process makes it
extremely difficult for computer programmers to mimic.
To understand it better, let’s take an example of something
the computers, mobile or laptops are very good at recognizing and understanding
- The touch-tones.
The blips and bloops on the phone lines can’t be understood by
the humans as better as the computers.
As the touch-tone vocabulary has only twelve (12) words in it
including 0 to 9 and (#) & (*) keys.
None of these words sound the same way. While you will tap on
1, the tone would be clearly different from the tone that comes out when you
tap on 7.
However, all speakers of the language say the words in a
similar manner. If you will press the “5” button on any phone, it is obvious to
receive the same tone in every phone. But in case of humans, a small kid and an
elderly man can sound the same thing very differently as they speak. Even the
people living in different countries would have different accent, different
pronunciation.
The context here might be meaningless.
But for the phone, 1 would be 1 and 5 would be 5. How you are
going to interpret the tone really doesn’t depend on the preceding number or
the next number. But in written English, context would be everything. There
would be different meanings of similarly spoken phrases like “go to New York”
or “go two New York” or “go too New York”.
So, now you might have understood how Dragon works and learns
to recognize speech. If you’re planning to use this software anytime soon,
speak with our experts to learn more of it.
https://www.dragonsupportservice.us/troubleshoot-speech-recognition-software-error/
No comments:
Post a Comment