Speech Recognition

In 1968's '2001: A Space Odyssey,' computers exchanged information with people by talking. Now we're near the year 2000 in the real world, and plenty of people are talking to their computers'though it's often a one-way conversation, using words we shouldn't print. Voice recognition technology is still a long way from Arthur C. Clarke's HAL 9000.

By Kevin JonahSpecial to GCNIn 1968's '2001: A Space Odyssey,' computers exchanged information with people by talking. Now we're near the year 2000 in the real world, and plenty of people are talking to their computers'though it's often a one-way conversation, using words we shouldn't print. Voice recognition technology is still a long way from Arthur C. Clarke's HAL 9000.Voice recognition software has nevertheless improved significantly, as have handwriting recognition, optical character recognition and other interfaces between people and machines. But those advances still aren't enough to get most people talking to their computers in kinder, gentler tones.There are plenty of potential applications for voice recognition. But that's the problem'they're still potential applications, for the most part.The simplest ones work best. Common sense will tell you that systems that call up data in response to a few well-used verbal commands, in the form of numbers or one- or two-syllable phrases, are more likely to work than, say, a system that tries to transcribe the minutes of the House Ways and Means Committee. The closest anyone has come to HAL'the disembodied, interactive, intelligent voice'is in technologies such as high-end telephony applications for customer service and specialized systems such as Microsoft's Auto PC. Auto PC apprehends human commands and responds with information in a synthesized voice.Still, Auto PC is a Windows CE device with a specific'that is, small'set of functions. We're not only a long way from HAL, but that less-classic model for the voice-interface computer, the talking car in 'Knight Rider,' isn't around the corner, either.Generally speaking, there are two modes of voice recognition: command and dictation.Command mode is when a user says words linked to commands, which are then interpreted and executed. You might use command mode to navigate menus in applications or activate macros.This is how voice recognition is used in Auto PC and the voice applications for phone systems. Each word is recognized independently and processed by the recognition software, then passed to the operating system or active application as a command, keystroke or predetermined set of commands.In dictation mode, the computer transcribes what you say into a word processing document, e-mail or other application, adding appropriate punctuation and spelling the words based on context.Dictation is a more difficult computing function than command mode. The software needs to know a larger set of words and to distinguish between words that sound alike but have different meanings'for example, two and to'from the context of the sentence. That's where most voice recognition software runs into trouble.Although great strides have been made in improving the recognition skills of desktop PC voice tools, those advances have increased the overall accuracy of voice recognition only slightly. And even an accuracy rate of 90 percent means that one word in 10 will be mangled'not exactly what you hope for when giving dictation.Unless you have a very good reason to use voice recognition for dictation, you're better off not doing it, unless you don't mind typing in the corrections later.Most voice recognition software does best at command mode, handling well-worn sets of verbal commands. Fortunately, this is the type of voice recognition that gets the most mileage in most voice recognition applications.That's a good thing, because there isn't a great deal of competition in the desktop PC voice recognition market to stimulate the push for accuracy. There are three major competitors in this market: Dragon Systems Inc., IBM Corp. and Lernout & Hauspie.Microsoft has made advancements with its voice recognition technology, Microsoft Speech API 4.0, but it's still a programmer's toolkit and not an integrated suite.For a high level of accuracy in your voice recognition software, you'll need to invest time in teaching it to understand you.Every voice recognition tool for the desktop PC has to be trained. The primary user has to speak a set of words into the computer's microphone to calibrate the recognition software to his or her voice characteristics. The training process can take an hour or more to complete. Add that to the time spent on the initial configuration of the software and on troubleshooting, and you're looking at spending a full workday to get voice recognition up and running on just one machine.That machine had better be fast, too. Most voice recognition tools have steep system requirements. Figure on needing at least a 300-MHz Pentium II or III to say anything more than 'See Dick, Jane and Spot.' Also plan to have at least 64M of RAM and plenty of hard-drive space to store all that voice training.Command mode voice recognition requires additional steps. You have to associate keyboard strokes and application commands with spoken phrases. But you may have less work to do with common applications. Some commands for Microsoft Office and other desktop PC suites may be in the command set, especially if you buy a suite bundled with the voice recognition tool.IBM packages a version of its ViaVoice with Lotus SmartSuite, and Dragon Systems does the same with Corel's WordPerfect suite. Some packages are tailored to specific requirements. Lernout & Hauspie's Now You're Talking on the Web is for use with Web browsers, for example.No matter what package you choose, remember that patience is the biggest user requirement with voice recognition software. And be careful'if you don't watch what you say, your computer might ask, 'What are you doing, Dave?' nXXXSPLITXXX-
It's not quite ready to take dictation, but the technology is accurate enough for other useful applications













Keep it simple














Lernout & Hauspie's Voice Xpress, priced at $150, has a feature that accepts dictation from travelers using a notebook PC or handheld device.




IBM's ViaVoice includes a document analyzer for adding words. The Pro Millennium version costs $180.



Two ways























Obedience school





Tips for Buyers


  • Make sure you know what you're looking for. Do you need command mode or dictation?
  • Use specialized software for enterprise-level tasks.
  • Throw hardware at the problem. To get responsive voice recognition, you'll need a fast system.
  • Adjust your expectations. Even products that have a 98 percent accuracy rate are going to make spectacularly bad mistakes sometimes.
  • Remember, desktop means dedicated. Desktop PC voice recognition software works well only for the user who has trained it.













Kevin Jonah, a Maryland network manager and free-lance writer, writes about computer technology.
















































CompanyProductPlatformsSystem requirementsHow product learns voicesTelephone recognitionSpecial featuresPrice
Dragon Systems Inc.
Newton, Mass.
617-965-5200
www.dragonsys.com
Dragon Systems
NaturallySpeaking
Preferred V4
Win9x, NT 4.0 (with Service Pack 3 or greater)200-MHz Pentium MMX; 48M of RAM for Win9x (reduced training time with 300 MHz and 64M of RAM), 64M of RAM for NT 4.0One initial script training; adjustments can be made during use NoIs capable of Internet browsing, select-and-say editing, dictation playback and text-to-speech conversion; provides multiple-user support$199
IBM Corp.
Armonk, N.Y.
914-499-1900
www.ibm.com/software/
speech/
IBM ViaVoice
Pro Millennium
Win9x, NT 4.0; Mac OS by year's end; toolkits available for Linux233-MHz Pentium with 256K Level 2 cache; 48M of RAM for Win9x, 64M of RAM for NT 4.0; 310M available hard drive spaceIn initial training process and during use; Analyze My Document option adds words to base vocabularyOffered through
ViaVoice
Directory Dialer
Includes specialized vocabularies such as cuisine and legal; comes in Standard, Web and Pro editions$180
Lernout & Hauspie
Burlington, Mass.
781-203-5000
www.lhsl.com
Voice Xpress
Professional
Win9x, NT 4.0Pentium II or equivalent; 48M of RAM for Win9x, 64M of RAM for NT (with Service Pack 3); 200M hard drive space; Sound Blaster 16 or compatible sound card (unless using USB headset microphone); noise canceling headset includedProfile is created after user reads enrollment script for five to 10 minutes; editing feature allows adjustmentsNoIncludes specializedvocabularies, headset microphone and mobile dictation feature that lets users record into some portable devices; comes in Standard, Mobile and Advanced editions$150
Now You're Talking
on the Web
Win9x, NT 4.0
200-MHz Pentium MMX or equivalent; 32M of RAM; 200M hard drive space; Sound Blaster 16 or 100 percent equivalent sound card (unless using USB microphone); noise canceling headset includedSameNoHas talking links (say or click on link); generates a number for each link for navigation; translates to chat-room jargon ("by the way" becomes "BTW")$50








X
This website uses cookies to enhance user experience and to analyze performance and traffic on our website. We also share information about your use of our site with our social media, advertising and analytics partners. Learn More / Do Not Sell My Personal Information
Accept Cookies
X
Cookie Preferences Cookie List

Do Not Sell My Personal Information

When you visit our website, we store cookies on your browser to collect information. The information collected might relate to you, your preferences or your device, and is mostly used to make the site work as you expect it to and to provide a more personalized web experience. However, you can choose not to allow certain types of cookies, which may impact your experience of the site and the services we are able to offer. Click on the different category headings to find out more and change our default settings according to your preference. You cannot opt-out of our First Party Strictly Necessary Cookies as they are deployed in order to ensure the proper functioning of our website (such as prompting the cookie banner and remembering your settings, to log into your account, to redirect you when you log out, etc.). For more information about the First and Third Party Cookies used please follow this link.

Allow All Cookies

Manage Consent Preferences

Strictly Necessary Cookies - Always Active

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data, Targeting & Social Media Cookies

Under the California Consumer Privacy Act, you have the right to opt-out of the sale of your personal information to third parties. These cookies collect information for analytics and to personalize your experience with targeted ads. You may exercise your right to opt out of the sale of personal information by using this toggle switch. If you opt out we will not be able to offer you personalised ads and will not hand over your personal information to any third parties. Additionally, you may contact our legal department for further clarification about your rights as a California consumer by using this Exercise My Rights link

If you have enabled privacy controls on your browser (such as a plugin), we have to take that as a valid request to opt-out. Therefore we would not be able to track your activity through the web. This may affect our ability to personalize ads according to your preferences.

Targeting cookies may be set through our site by our advertising partners. They may be used by those companies to build a profile of your interests and show you relevant adverts on other sites. They do not store directly personal information, but are based on uniquely identifying your browser and internet device. If you do not allow these cookies, you will experience less targeted advertising.

Social media cookies are set by a range of social media services that we have added to the site to enable you to share our content with your friends and networks. They are capable of tracking your browser across other sites and building up a profile of your interests. This may impact the content and messages you see on other websites you visit. If you do not allow these cookies you may not be able to use or see these sharing tools.

If you want to opt out of all of our lead reports and lists, please submit a privacy request at our Do Not Sell page.

Save Settings
Cookie Preferences Cookie List

Cookie List

A cookie is a small piece of data (text file) that a website – when visited by a user – asks your browser to store on your device in order to remember information about you, such as your language preference or login information. Those cookies are set by us and called first-party cookies. We also use third-party cookies – which are cookies from a domain different than the domain of the website you are visiting – for our advertising and marketing efforts. More specifically, we use cookies and other tracking technologies for the following purposes:

Strictly Necessary Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Functional Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Performance Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Social Media Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Targeting Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.