VOICETYPE DICTATION FOR OS/2 AND WINDOWS The high-accuracy speech recognition solution --------------------------------------------------------------------- Highlights IBM VoiceType Dictation offers: o A personal productivity investment - No more typing - Proven IBM technology - Work where you want o Capture precisely what you say - Accurate speech-to-text - Optional vocabularies improve accuracy o Work the way you think - Maintain your train of thought - Create text while busy A personal productivity investment Once the realm of science fiction and futurism, talking to a computer instead of typing is now a practical and affordable solution for a wide variety of professionals. With IBM VoiceType(TM) Dictation for OS/2((R)) or IBM VoiceType Dictation for Windows,(TM) people can create text quickly and efficiently simply by talking to a desktop personal computer, notebook or subnotebook PC. No more typing VoiceType Dictation transforms spoken words into accurate text that is available within seconds-without typing, and without the time-consuming, labor-intensive process of manually transcribing dictation. The result is increased productivity and faster turnaround. For people who need to use their hands and eyes and simultaneously create written information, this increase can be even more pronounced. For example, doctors can review X-rays or examine patients and dictate their findings directly into the computer. This means they can see and verify reports onscreen, then transfer them immediately into the medical records system. Nurses can update patient charts at the bedside-by voice input. And because no transcription is needed, the results can be available to other medical professionals sooner. The capabilities of speech recognition technology can be just as valuable in other professions. Attorneys can develop briefs while working with research materials. Reporters can dictate stories rather than using the keyboard. Executives can call up database information or dictate their electronic messages. Anyone can benefit VoiceType Dictation can speed information handling in almost any paper- and forms-intensive office environment, or wherever people need to dictate while working with their hands and eyes free for other tasks. By making speech recognition technology readily available to the millions of users of PCs, VoiceType Dictation represents the beginning of a major change in the way people interact with computers. What's more, VoiceType Dictation can enable those people who cannot use a keyboard to enter data, create text and run most business applications. This can help organizations meet the special needs of employees, plus make computer resources available to people who simply aren't comfortable with typing. Proven IBM technology VoiceType Dictation is pure IBM technology, based on more than two decades of IBM research and development. Developed in conjunction with IBM's Thomas J. Watson Research Center, VoiceType Dictation's speech recognition technology and high-accuracy language models were previously available only in the award-winning IBM Personal Dictation System product. Capture precisely what you say VoiceType Dictation for OS/2 and VoiceType Dictation for Windows run on IBM-compatible desktop, notebook or subnotebook personal computers with 486 or Pentium(TM) processors. It is an isolated-word, speaker-dependent system that utilizes sophisticated statistical language models and a speech adapter card-both designed by IBM-to rapidly analyze spoken input. Accurate speech-to-text The language models, an extensive 22,000-word general vocabulary and a detailed understanding of each user's voice, allow VoiceType Dictation to accurately convert speech to text as quickly as 70 to 100 words a minute- faster than most people can type. Users simply talk into a hand-held or headset microphone. The system analyzes their speech in seconds and displays it onscreen as text. Properly recognized words are automatically displayed correctly. Even like-sounding words that are difficult to distinguish-such as "to," "too" and "two" or "our" and "hour"-are easily understood. With voice commands, users can edit or correct dictated text while it is displayed in the VoiceType Dictation speech window. VoiceType Dictation can even "play back" a user's dictation for easier proofreading. Dictated text can then be transferred into OS/2 or Windows applications for processing. VoiceType Dictation can be ordered with ISA, Micro Channel((R)) or PCMCIA(TM) (Personal Computer Memory Card International Association) speech adapter cards. The cards handle voice processing and word recognition functions, independent of the computer's processor, and provide audio support for the microphone and speaker. Optional vocabularies improve accuracy You can optimize VoiceType Dictation and increase accuracy for various professions, including medicine and journalism, by taking advantage of specialty vocabularies, available as an option. Easily installed, these vocabularies include words, spellings, word probabilities and linguistic information that enable VoiceType Dictation to correctly "hear" industry-specific words. Optional vocabularies include: o VOICETYPE VOCABULARY FOR RADIOLOGY DICTATION. 19,000 words. Carefully constructed for radiologists to use while reading X-rays or films, this vocabulary includes medical terminology for radiology and related specialties. o VOICETYPE VOCABULARY FOR EMERGENCY MEDICINE DICTATION. 16,000 words. This vocabulary contains medical terms used in hospital emergency room environments. It can also be used by other doctors and health-care professionals. o VOICETYPE VOCABULARY FOR JOURNALISM DICTATION. 30,000 words. Constructed from the archives of major newspapers, this vocabulary includes a variety of significant proper names, as well as editorial terminology. You can also customize VoiceType Dictation's base vocabulary by adding up to 2,000 of your own words. Adapts to you VoiceType Dictation is a user-specific speech recognition solution. This allows the system to deliver the highest accuracy, while supporting an extensive vocabulary of general and specific terms and accommodating the natural variations in pronunciation from speaker to speaker. The initial enrollment process for each speaker takes about an hour and a half of the user's time. The process also helps individuals learn the "discrete" speech pattern-short, clear spaces between words-understood by VoiceType dictation. As people use the system, VoiceType Dictation continues to learn each person's speaking style, further increasing accuracy and instant word recognition. Create customized solutions easily The Voice Action Editor, along with built-in support for OS/2 and Windows applications and sophisticated APIs (application program interfaces), make it easy to develop customized speech recognition solutions built around IBM's VoiceType Dictation. The Voice Action Editor is a powerful yet simple voice-controlled macro application. Individual users can create dictation macros with the Voice Action Editor by prerecording repetitive phrases or "boilerplate" paragraphs and later recalling them with a single spoken word. With the same ease, developers or systems experts can build application-specific macros that invoke and control other programs. The system prompts you by displaying all possible voice or keyboard-activated commands. Plus, you can store a series of keystrokes as a voice-command macro. The possibilities are limited only by your imagination. The command "Good morning," for example, could automatically call up a calendar program. Or "Save report" could call up a dialogue box for naming, identifying and saving a radiology report, then transfer the file to the medical records system and place it in a work queue for text processing. Support for OS/2 and Windows applications VoiceType Dictation works in concert with existing OS/2, OS/2 Warp, and Windows operating systems and applications. Using voice, mouse or keyboard commands, you can transfer dictated data to most OS/2 and Windows applications that support a data transfer clipboard via cut-and-paste functions. Keystroke emulation macros enable voice command transfer of data to applications that do not support cut-and-paste. What's more, you can use voice commands to invoke many operating system and application functions. VoiceType Dictation includes automatic speech enabling of the OS/2 Workplace Shell((R)) and Presentation Manager((R)) application controls. This means users can select and operate Presentation Manager applications by speaking words that appear on menu bars, list boxes, push buttons, check boxes and radio buttons. Compatible OS/2 applications include: o WordPerfect((R)) for OS/2 V5.2 o Lotus((R)) 1-2-3((R)) for OS/2 V1.1 o Lotus Notes((R)) for OS/2 V3.0 o Microsoft((R)) Word((R)) for OS/2 V1.1 o OS/2 System Editor V2.1 o OS/2 Communications Manager 3270 Emulator V2.0 o DeScribe(TM) for OS/2 V4.0. Speech support for Windows and Windows applications allows users to select icons and tool-bar functions with voice commands. Compatible Windows applications include: o Intuit((R)) Quicken((R)) V3.0 o Lotus 1-2-3 V4.01 o Lotus Notes V3.0 o Lotus Ami Pro((R)) V3.01 o Lotus cc:Mail(TM) V2.0 o Microsoft Word V6.0 o Microsoft Excel(TM) V5.0 o Microsoft Works(TM) for Windows V3.0 o WordPerfect for Windows V6.0. Support for OS/2 and Windows applications is continually being expanded. Ask for a current list of compatible applications. Everything you need in a single package The IBM VoiceType Dictation solution includes: o 3.5-inch 1.44MB program diskettes o User's Guide with Program License Agreement o One IBM VoiceType Dictation Adapter (ISA, Micro Channel or PCMCIA) o One IBM-qualified microphone. Voice your request for more information To learn more about how IBM VoiceType Dictation can help you enhance your productivity, call 1-800-TALK-2-ME (1-800-825-5263), or ask your authorized IBM reseller about customized speech recognition solutions using IBM VoiceType Dictation. IBM VOICETYPE DICTATION AT A GLANCE --------------------------------------------------------------------- Hardware requirements o IBM-compatible personal computer with a for OS/2 minimum of a 486SX 25MHz microprocessor o 3.5-inch 1.44MB diskette drive o Minimum of 8MB of memory above operating system and application requirements o 33MB hard-disk space available (plus an additional 30MB during the enrollment process) --------------------------------------------------------------------- Hardware requirements o IBM-compatible personal computer with a for Windows minimum of a 486SX/SLC 25MHz with math coprocessor (486DX 33MHz processor (recommended) o Minimum of 12MB of memory (16MB recommended) o 33MB hard-disk space available (plus an additional 30MB during the enrollment process) o DOS 5.2 or higher with Windows 3.1/3.11 or Windows for Workgroups 3.1/3.11 (DOS 6.1 required for PCMCIA) --------------------------------------------------------------------- Additional requirements o Additional disk space for optional for OS/2 or Windows industry-specific vocabularies -VoiceType Vocabulary for Radiology Dictation: 16MB -VoiceType Vocabulary for Emergency Medicine Dictation: 16MB -VoiceType Vocabulary for Journalism Dictation: 14MB o An IBM-qualified microphone (shipped with the software) o One available ISA or Micro Channel expansion slot or PCMCIA Type II slot o An IBM VoiceType Dictation Adapter (shipped with the software) o Optional speaker and appropriate cables --------------------------------------------------------------------- Software requirements o OS/2 Version 2.1 or higher o DOS 5.2 or higher with Windows 3.1/3.11, or Windows for Workgroups 3.1/3.11 (DOS 6.1 is required for PCMCIA) o IBM VoiceType Dictation is a prerequisite for optional VoiceType vocabularies --------------------------------------------------------------------- Limitations o IBM VoiceType Dictation and its files should not be compressed with disk compression utilities such as Stacker(TM) and DoubleSpace(TM) o IBM VoiceType Dictation may not work with all processor upgrade boards --------------------------------------------------------------------- --------------------------------------------------------------------- Features Benefits --------------------------------------------------------------------- Voice-to-text capability o Facilitates the creation of documents o Saves time and increases productivity by eliminating transcription o Provides a solution for people who are unable (or unwilling) to use a keyboard --------------------------------------------------------------------- Hands-free, eyes-free o Enables users in technical, production and operation office environments to dictate text, produce reports or fill in forms while using their hands and eyes for other tasks o Physicians and other medical personnel can read X-rays or examine patients and describe findings simultaneously --------------------------------------------------------------------- Large general-language o Provides a ready-to-use language model vocabulary designed to serve a wide range of professional and business users o Helps ensure accurate word capture o Allows you to customize the system easily by adding up to 2,000 words to the vocabulary --------------------------------------------------------------------- Optional industry- o Ready-to-use solutions for radiology, specific vocabulary emergency medicine and journalism enhance applications o Specialized terminology and word-pattern models enable the system to frequently recognize words that are run together, slurred or partially obscured by background noise --------------------------------------------------------------------- Speaker enrollment o Maximizes dictation accuracy by creating a function model for every individual speaker --------------------------------------------------------------------- Clipboard data transfer o Transfer dictated text to OS/2 and Windows applications by cut-and-paste, using voice, keyboard or mouse commands o Use keyboard emulation to transfer data into applications that do not support cut-and-paste --------------------------------------------------------------------- Navigation controls o Use voice commands to operate OS/2, Windows and compatible applications o Includes automatic "speech enabling" of the OS/2 WorkPlace Shell and Presentation Manager controls, as well as Windows icons and tool bars --------------------------------------------------------------------- Voice Action Editor o Allows users and developers to define macros that perform a sequence of actions with a single voice command o Facilitates development of customized, application-specific solutions o Increases productivity by letting users insert prerecorded phrases or paragraphs of repetitive text with a single spoken word --------------------------------------------------------------------- PCMCIA VoiceType o Supports notebook computers for Dictation Adapter transportable dictation solutions o Helps make it easy for people to use the system, wherever they need to work --------------------------------------------------------------------- Multiple language o Available for OS/2 in U.S. English, U.K. support English, French, German, Italian and Spanish. Available for Windows in U.S. English. Support for languages is continually being expanded. --------------------------------------------------------------------- ((R))IBM, OS/2, Micro Channel, Workplace Shell and Presentation Manager are registered trademarks of International Business Machines Corporation. (TM) VoiceType is a trademark of International Business Machines Corporation. (TM) Windows is a trademark of Microsoft Corporation. All other products are trademarks or registered trademarks of their respective companies. Screens shown in this publication are prototypes. Actual screens generated by the product delivered by IBM may differ. ((C))International Business Machines Corporation 1994 IBM Corporation Department 507 1133 Westchester Avenue White Plains, NY 10604 Produced in the United States of America 10-94 All Rights Reserved References in this publication to IBM products or services do not imply that IBM intends to make them available outside the United States. G221-3915-02 / G2213915