Science Current Events | Science News | Brightsurf.com
 
Email a Friend Send to a friend
Printer Friendly Print Study evaluates transcription accuracy in men and women

Study evaluates transcription accuracy in men and women

May 07, 2007

There is a significantly higher rate of transcription error in women compared to men when using commercial voice recognition applications, according to a recent study.

"Our residency program and department recently made the transition to speech recognition from a digital dictation system," said Syed Ali, MD, lead author of the study. "This prompted us to ask research questions about how to increase our accuracy rates and what factors adversely impacted speech recognition," said Dr. Ali.




Ten radiology residents, five male and five female, were each trained on a commercial speech recognition application. Each resident was asked to dictate a standardized set of ten radiology reports containing a total of 2,123 words. Utilizing a commercial software solution, the generated reports were then compared with the original reports and error rates were calculated. The error rate was defined as the sum of the number of word insertions and deletions divided by the total word count for a given report.

According to the study, error rates in the male population ranged from 0.025 to 0.139 while the error rates in the females ranged from 0.015 to 0.206. The results show a higher rate of recognition error in the females compared to the males.

"The immediate impact of the study for radiologists is an increased level of awareness that women may need to spend more time training on the system than their male counterparts and may have to work somewhat harder to make the system successful," said Dr. Ali. "This could include using macros or actually altering dictation style to increase recognition rates," he said.

"Any efforts to improve recognition rates will have a positive impact on physicians and patients of course by reducing error rates and improving productivity," said Dr. Ali.

The full results of this study will be presented on Monday, May 7, 2007 during the American Roentgen Ray Society's annual meeting in Orlando, FL.

American Roentgen Ray Society



Related Speech Recognition Current Events and Speech Recognition News Articles Speech Recognition Current Events and Speech Recognition News RSS Speech Recognition Current Events and Speech Recognition News RSS
New NIST method reveals all you need to know about 'waveforms'
The National Institute of Standards and Technology (NIST) has unveiled a method for calibrating entire waveforms-graphical shapes showing how electrical signals vary over time-rather than just parts of waveforms as is current practice.

Drawing inspiration from nature to build a better radio
MIT engineers have built a fast, ultra-broadband, low-power radio chip, modeled on the human inner ear, that could enable wireless devices capable of receiving cell phone, Internet, radio and television signals.

Age-related difficulty recognizing words predicted by brain differences
Older adults may have difficulty understanding speech because of age-related changes in brain tissue, according to new research in the May 13 issue of The Journal of Neuroscience.

Cochlear implant recipients experience improvement in quality of life
Cochlear implant recipients experience a significant improvement in their quality of life, and have improved speech recognition, according to new research published in the March 2008 issue of Otolaryngology - Head and Neck Surgery.

Lend me your ears -- and the world will sound very different
Recognising people, objects or animals by the sound they make is an important survival skill and something most of us take for granted. But very similar objects can physically make very dissimilar sounds and we are able to pick up subtle clues about the identity and source of the sound.

Genes influence age-related hearing loss
A new Brandeis University study of twins shows that genes play a significant role in the level of hearing loss that often appears in late middle age.

MIT develops lecture search engine to aid students
Imagine you are taking an introductory biology course. You're studying for an exam and realize it would be helpful to revisit the professor's explanation of RNA interference. Fortunately for you, a digital recording of the lecture is online, but the 10-minute explanation you want is buried in a 90-minute lecture you don't have time to watch.

Computer scientists unravel 'language of surgery'
Borrowing ideas from speech recognition research, Johns Hopkins computer scientists are building mathematical models to represent the safest and most effective ways to perform surgery, including tasks such as suturing, dissecting and joining tissue.

Novel audio telescope heeds call of the wild ... birds
Researchers at the National Institute of Standards and Technology (NIST), Intelligent Automation, Inc. (Rockville, Md.) and the University of Missouri-Columbia have modified a NIST-designed microphone array to make an "audio telescope" that could help airports more efficiently avoid costly and hazardous bird-aircraft collisions by locating and identifying birds by their calls.

Your virtual assistant for personal financial advice
Added usability and intelligence has been brought to virtual assistants thanks to technology developed by European researchers, offering online users an entertaining, yet competent professional financial service.
More Speech Recognition Current Events and Speech Recognition News Articles
Statistical Methods for Speech Recognition (Language, Speech, and Communication)

Statistical Methods for Speech Recognition (Language, Speech, and Communication)
by Frederick Jelinek (Author)

"For the first time, researchers in this field will have a book that will serve as the bible' for many aspects of language and speech processing. Frankly, I can't imagine a person working in this field not wanting to have a personal copy." -- Victor Zue, MIT Laboratory for Computer Science

This book reflects decades of important research on the mathematical foundations of speech recognition. It focuses on underlying statistical techniques such as hidden Markov models, decision trees, the expectation-maximization algorithm, information theoretic goodness criteria, maximum entropy probability estimation, parameter and data clustering, and smoothing of probability distributions. The author's goal is to present these principles clearly in the simplest setting, to show the...

Speech and Language Processing (2nd Edition)

Speech and Language Processing (2nd Edition)
by Daniel Jurafsky (Author), James H. Martin (Author)

An explosion of Web-based language techniques, merging of distinct fields, availability of phone-based dialogue systems, and much more make this an exciting time in speech and language processing. The first of its kind to thoroughly cover language technology – at all levels and with all modern technologies – this book takes an empirical approach to the subject, based on applying statistical and other machine-learning algorithms to large corporations. Builds each chapter around one or more worked examples demonstrating the main idea of the chapter, usingthe examples to illustrate the relative strengths and weaknesses of various approaches. Adds coverage of statistical sequence labeling, information extraction, question answering and summarization,...

Dragon NaturallySpeaking 10 Preferred

Dragon NaturallySpeaking 10 Preferred
by Nuance Communications, Inc.

DRAGON NATURALLYSPEAKING DVD PREFERRED 10 US VAR

Fundamentals of Speech Recognition

Fundamentals of Speech Recognition
by Lawrence Rabiner (Author), Biing-Hwang Juang (Author)

Provides a theoretically sound, technically accurate, and complete description of the basic knowledge and ideas that constitute a modern system for speech recognition by machine. Covers production, perception, and acoustic-phonetic characterization of the speech signal; signal processing and analysis methods for speech recognition; pattern comparison techniques; speech recognition system design and implementation; theory and implementation of hidden Markov models; speech recognition based on connected word models; large vocabulary continuous speech recognition; and task- oriented application of automatic speech recognition. For practicing engineers, scientists, linguists, and programmers interested in speech recognition.



Speech Recognition: Theory and C++ Implementation

Speech Recognition: Theory and C++ Implementation
by Claudio Becchetti (Author), Lucio Prina Ricotti (Author)

Automatic Speech Recognition (ASR) is the enabling technology for hands-free dictation and voice-triggered computer menus. It is becoming increasingly prevalent in environments such as private telephone exchanges and real-time information services. Speech Recognition introduces the principles of ASR systems, including the theory and implementation issues behind multi-speaker continuous speech recognition. Focusing on the algorithms employed in commercial and laboratory systems, the treatment enables the reader to devise practical solutions for ASR system problems. It addresses in detail C++ programming techniques used to develop ASR applications, thus offering skills that will prove useful in any large C++ based software project. Possible extensions of the well-established ASR technology...

MacSpeech Dictate

MacSpeech Dictate
by MacSpeech

Welcome to the brand new MacSpeech Dictate, the premier speech recognition solution for the Macintosh. Written from the ground up for the Mac, MacSpeech Dictate¿s features, accuracy, and capabilities make it as fun, productive, and intuitive to use as the Mac itself. The all-new MacSpeech Dictate provides: Amazing Accuracy. MacSpeech Dictate will astonish you with its accuracy. You simply talk and leave the recognition to MacSpeech Dictate. Minimal Training Required. MacSpeech Dictate provides astounding accuracy and productivity. With just five minutes or less of training, you'll be using MacSpeech Dictate's superior capabilities. Essential Command Capabilities. Instead of using your mouse to select menu commands or your keyboard to type shortcuts, just speak a command. MacSpeech...

How to Build a Speech Recognition Application: Second Edition: A Style Guide for Telephony Dialogues

How to Build a Speech Recognition Application: Second Edition: A Style Guide for Telephony Dialogues
by Bruce Balentine (Author), David P. Morgan (Author)

Although this style guide has stood up fairly well over the two and a half years since it was first published, the voice user interface subject area has continued to expand and mature. In this edition we have taken more time to address voice portals and we have added an appendix on selecting and training "Voice Talent." It has become obvious over the past few years that selecting the talent for the application can have as much impact on the end user as the design itself. Furthermore, the application designer and the talent must work together to create a satisfying user experience.

We have also rewritten and expanded on Chapter 11, "Usability Testing and Performance Reporting." Most of the new material summarizes our experiences since the first edition was published. In addition, we...

The Application of Hidden Markov Models in Speech Recognition

The Application of Hidden Markov Models in Speech Recognition
by Mark Gales (Author), Steve Young (Author)

Hidden Markov Models (HMMs) provide a simple and effective framework for modelling time-varying spectral vector sequences. As a consequence, almost all present day large vocabulary continuous speech recognition (LVCSR) systems are based on HMMs. Whereas the basic principles underlying HMM-based LVCSR are rather straightforward, the approximations and simplifying assumptions involved in a direct implementation of these principles would result in a system which has poor accuracy and unacceptable sensitivity to changes in operating environment. Thus, the practical application of HMMs in modern systems involves considerable sophistication. The Application of Hidden Markov Models in Speech Recognition presents the core architecture of a HMM-based LVCSR system and proceeds to describe the...

Garmin nüvi 855 4.3-Inch Widescreen Portable GPS Navigator with Speech Recognition

Garmin nüvi 855 4.3-Inch Widescreen Portable GPS Navigator with Speech Recognition
by Garmin



Dragon NaturallySpeaking 10 Standard

Dragon NaturallySpeaking 10 Standard
by Nuance Communications, Inc.

With Dragon NaturallySpeaking 10 Standard, you can talk to your computer and watch your spoken words instantly appear in documents, email and instant messages. You can even surf the Web just by speaking! Dragon NaturallySpeaking 10 turns your voice into text three times faster than most people type — with up to 99% accuracy. It learns to recognize your voice instantly, and continually improves the more you use it! Just use your voice to dictate and edit in virtually any Windows application, including Microsoft Word, Internet Explorer, Mozilla Firefox and AOL. This revolutionary and easy-to-use product gives you everything you need to get started, including a high-quality headset.

© 2009 BrightSurf.com