Voice and SpeechRecognition

Voice and SpeechRecognition

An easy software solution to enable you to control your computer, dictate emails and letters, and have the computer read documents back to you.

What is Speech Recognition?

Speech Recognition is a technology that allows the computer to identify and understand words spoken by a person using a microphone or telephone. The ultimate goal of the technology is to beable to produce a system that can recognize with 100% accuracy all words that are spoken by any person.

Even after years of research in this area, the best speech recognition software applications still cannot recognize speech with 100% accuracy. Some applications are able to recognize over 90% of words when spoken under specific constraints regarding content and previous training to recognize the speaker's speech characteristics.

Computer software that understands your speech enables you to have conversations with the computer. These conversations would include you and the computer speaking as commands or in response to events, input, or other feedback.

Speaking is easier and more intuitive than selecting buttons and menu items. Human speech has evolved over many thousands of years to become an efficient method of sharing information and giving instructions.

Useful when accessing systems through pay telephones that do not have attached keyboards.

What is continuous speech recognition?

Continuous speech recognition is software that allows the user to speak at a conversational speed. Pauses between words are not needed. Using this software allows you to speak to your computer at a conversational page. The software then transcribes your speech into text as a form of dictation. Continuous speech is not used for command and control of you is software that understands the rules regarding grammar and applies those rules to the speech heard in order to develop a better transcript of the spoken text.

What is Discrete voice recognition ?

Discrete speech is spoken to the computer using a brief pause between each word. This allows the computer to attempt to understand each word in isolation. There are some weaknesses in this approach. Since each word is recognized in isolation, there is less accuracy than in systems that use rules of grammar to facilitate recognition.

Reasons for Trying e-Speaking's

Over 100 commands built-in

Ability to add more commands

Runs in Windows2000 and WindowsXP

Small file size

Utilizes latest technologies from Microsoft

Seemlesly integrate with Office

Voice commands of Mouse events

Based on SAPI and .NET technologies

Easy to Use

Using the Keyboard with Dictation

You can also program a key on the keyboard to activate/toggle the dictation mode. The use of a key in this manner can allow you to quickly change from command to dictation mode and back without having to issue a verbal command.

Activating Dictation using your Voice

You can also program voice commands to activate/toggle the dictation mode. You can assign a command phrase as well as the computer's audible response for both the 'starting' and 'stopping' of dictation.

Help. The computer isn't typing the right word.

You can train the computer to recognize the sounds of individual words. When in Dictation mode, say: Train Word. A Microsoft SAPI dialog will appear allowing you to add/edit words in your user Lexicon (i.e., the database that stores individual words and sounds). After you type the word in the box, press the "Record Pronounciation" button and say the word. You may have to repeat this step several times while the computer records different samples of the way the word sounds. Once it has been accepted by the computer, the word will be added to the list below indicating that it is in your Lexicon.

Important Things to Consider in Dictation

If you are new to dictation, it may help to begin by using notes or handwritten ideas. These aids will help you to gather your thoughts before speaking. With practice, you should get better at dictating to a machine and maybecome very good at dictating without the use of printed materials.

When dictating, try not to look at the computer screen since this may distract you from your thoughts and cause you to loose concentration. The computer will attempt to transcribe everything you say. So, make sure you think about it first before saying it.

Background noises

Voice and Speech recognition products are best used in quiet environments. Although the software can be trained to ignore most background noise, an unexpected noise such as telephone ring, others talking or laughing can end up in the computer attempting to recognize the extraneous sounds as if you were dictating text.

Advantages of Interacting With Computers Through Speech

Hands Speech does not require use of physical devices such as keyboards or pointing devices. Computing devices can become more compact as keyboard and mouse pointing devices take a less prominent role. Individuals with physical challenges may also benefit from the use of speech based applications.
Eyes Does not require the use of vision. Speech can be performed effectively in low light environments or by persons with low or no vision. Applications for this technology may include uses where the individual needs to keep their eyes on equipment or the environment if navigating a vehicle.
History People have been speaking for thousands of years and individuals start speaking at a young age.
Distance Speech systems can process speech from individuals that may be at some distance away from the computer.
Smaller Wireless and handheld devices are becoming increasingly smaller with corresponding smaller displays and input keys. A voice interface is not constrained by the physical size of the device.

Dictation

Dictation is a mode of speech recognition in which the user enters data

byreading directly to the computer. Dictation mode allows users to

dictate memos, letters, and e-mail messages, as well as to enter data

using a speech recognition dictation engine. Most recognizers are speaker-dependent. To ensure accurate recognition, a "speaker profile" is created

to ensure accurate recognition.

Dictation software allows you to create documents (email, wordprocessing, forms) by speaking instead of typing. Most dictation software allows you to speak in a natural way (conversational). To work effectively, these systems need to utilize databases of words and sounds (a lexicon). An initial training session is used to help the computer learn to interpret your voice and speech characteristics. This training (called "enrollement") will improve the accuracy of the dictation.

Pronounciation

Speak in a clear voice and pronounce each word. Speech recognition software is not perfect and may make mistakes in the translation of your voice.

HOW TO USE IT

Use a noise cancelling microphone
Use a digital microphone instead of an analog one
Make sure background noise is at a minimum
Perform the voice training / user enrollment
Practice at first by reading from a printed source (e.g., newspaper)
Speak clearly and slowly
Think about what you want to say before you say it
Train individual words that are not recognized

CLICK TO DOWNLOAD

Voice and SpeechRecognition

Friday 5 August 2011