Abstract
This paper introduces the session on advanced speech recognition technology. The two papers comprising this session argue that current technology yields a performance that is only an order of magnitude in error rate away from human performance and that incremental improvements will bring us to that desired level. I argue that, to the contrary, present performance is far removed from human performance and a revolution in our thinking is required to achieve the goal. It is further asserted that to bring about the revolution more effort should be expended on basic research and less on trying to prematurely commercialize a deficient technology.
Full text
PDF


Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Atal B. S. Speech technology in 2001: new research directions. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10046–10051. doi: 10.1073/pnas.92.22.10046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flanagan J. Research in speech communication. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9938–9945. doi: 10.1073/pnas.92.22.9938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Furui S. Toward the ultimate synthesis/recognition system. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10040–10045. doi: 10.1073/pnas.92.22.10040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jelinek F. Training and search methods for speech recognition. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9964–9969. doi: 10.1073/pnas.92.22.9964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levinson S. E., Fallside F. Speech technology in the year 2001. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10038–10039. doi: 10.1073/pnas.92.22.10038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levitt H. Processing of speech signals for physical and sensory disabilities. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9999–10006. doi: 10.1073/pnas.92.22.9999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Makhoul J., Schwartz R. State of the art in continuous speech recognition. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9956–9963. doi: 10.1073/pnas.92.22.9956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marcus M. New trends in natural language processing: statistical natural language processing. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10052–10059. doi: 10.1073/pnas.92.22.10052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oberteuffer J. A. Commercial applications of speech interface technology: an industry at the threshold. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10007–10010. doi: 10.1073/pnas.92.22.10007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rabiner L. R. Voice communication between humans and machines--an introduction. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9911–9913. doi: 10.1073/pnas.92.22.9911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seelbach C. A perspective on early commercial applications of voice-processing technology for telecommunications and aids for the handicapped. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9989–9990. doi: 10.1073/pnas.92.22.9989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weinstein C. J. Military and government applications of human-machine communication by voice. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10011–10016. doi: 10.1073/pnas.92.22.10011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilpon J. G. Voice-processing technologies--their application in telecommunications. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9991–9998. doi: 10.1073/pnas.92.22.9991. [DOI] [PMC free article] [PubMed] [Google Scholar]