Skip to main content
Journal of the American Medical Informatics Association : JAMIA logoLink to Journal of the American Medical Informatics Association : JAMIA
. 1995 Jan-Feb;2(1):36–45. doi: 10.1136/jamia.1995.95202546

A continuous-speech interface to a decision support system: I. Techniques to accommodate for misrecognized input.

S Shiffman 1, W M Detmer 1, C D Lane 1, L M Fagan 1
PMCID: PMC116235  PMID: 7895134

Abstract

OBJECTIVE: Develop a continuous-speech interface that allows flexible input of clinical findings into a medical diagnostic application. DESIGN: The authors' program allows users to enter clinical findings using their own vernacular. It displays from the diagnostic program's controlled vocabulary a list of terms that most closely matches the input, and allows the user to select the single best term. The interface program includes two components: a speech-recognition component that converts utterances into text strings, and a language-processing component that matches recognized text strings with controlled-vocabulary terms. The speech-recognition component is composed of commercially available speech-recognition hardware and software, and developer-created grammars, which specify the language to be recognized. The language-processing component is composed of a translator, which extracts a canonical form from both recognized text strings and controlled-vocabulary terms, and a matcher, which measures the similarity between the two canonical forms. RESULTS: The authors discovered that grammars constructed by a physician, who could anticipate how users might speak findings, supported speech recognition better than did grammars constructed programmatically from the controlled vocabulary. However, this programmatic method of grammar construction was more time efficient and better supported long-term maintenance of the grammars. The authors also found that language-processing techniques recovered some of the information lost due to speech misrecognition, but were dependent on the completeness of supporting synonym dictionaries. CONCLUSIONS: The authors' program demonstrated the feasibility of using continuous speech to enter findings into a medical application. However, improvements in speech-recognition technology and language-processing techniques are needed before natural continuous speech becomes an acceptable input modality for clinical applications.

Full Text

The Full Text of this article is available as a PDF (1.1 MB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Bergeron B., Locke S. Speech recognition as a user interface. MD Comput. 1990 Sep-Oct;7(5):329–334. [PubMed] [Google Scholar]
  2. Detmer W. M., Shiffman S., Wyatt J. C., Friedman C. P., Lane C. D., Fagan L. M. A continuous-speech interface to a decision support system: II. An evaluation using a Wizard-of-Oz experimental paradigm. J Am Med Inform Assoc. 1995 Jan-Feb;2(1):46–57. doi: 10.1136/jamia.1995.95202548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Feldman C. A., Stevens D. Pilot study on the feasibility of a computerized speech recognition charting system. Community Dent Oral Epidemiol. 1990 Aug;18(4):213–215. doi: 10.1111/j.1600-0528.1990.tb00060.x. [DOI] [PubMed] [Google Scholar]
  4. Hersh W. R., Greenes R. A. SAPHIRE--an information retrieval system featuring concept matching, automatic indexing, probabilistic retrieval, and hierarchical relationships. Comput Biomed Res. 1990 Oct;23(5):410–425. doi: 10.1016/0010-4809(90)90031-7. [DOI] [PubMed] [Google Scholar]
  5. Johnson K., Poon A., Shiffman S., Lin R., Fagan L. A history-taking system that uses continuous speech recognition. Proc Annu Symp Comput Appl Med Care. 1992:757–761. [PMC free article] [PubMed] [Google Scholar]
  6. Miller R. A., Pople H. E., Jr, Myers J. D. Internist-1, an experimental computer-based diagnostic consultant for general internal medicine. N Engl J Med. 1982 Aug 19;307(8):468–476. doi: 10.1056/NEJM198208193070803. [DOI] [PubMed] [Google Scholar]
  7. Wulfman C. E., Rua M., Lane C. D., Shortliffe E. H., Fagan L. M. Graphical access to medical expert systems: V. Integration with continuous-speech recognition. Methods Inf Med. 1993 Feb;32(1):33–46. [PubMed] [Google Scholar]

Articles from Journal of the American Medical Informatics Association are provided here courtesy of Oxford University Press

RESOURCES