Abstract
Research in speech recognition and synthesis over the past several decades has brought speech technology to a point where it is being used in "real-world" applications. However, despite the progress, the perception remains that the current technology is not flexible enough to allow easy voice communication with machines. The focus of speech research is now on producing systems that are accurate and robust but that do not impose unnecessary constraints on the user. This chapter takes a critical look at the shortcomings of the current speech recognition and synthesis algorithms, discusses the technical challenges facing research, and examines the new directions that research in speech recognition and synthesis must take in order to form the basis of new solutions suitable for supporting a wide range of applications.
Full text
PDF





Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Furui S. On the role of spectral transition for speech perception. J Acoust Soc Am. 1986 Oct;80(4):1016–1025. doi: 10.1121/1.393842. [DOI] [PubMed] [Google Scholar]
- MILLER G. A., HEISE G. A., LICHTEN W. The intelligibility of speech as a function of the context of the test materials. J Exp Psychol. 1951 May;41(5):329–335. doi: 10.1037/h0062491. [DOI] [PubMed] [Google Scholar]
- Makhoul J., Schwartz R. State of the art in continuous speech recognition. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9956–9963. doi: 10.1073/pnas.92.22.9956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roe D. B. Deployment of human-machine dialogue systems. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):10017–10022. doi: 10.1073/pnas.92.22.10017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilpon J. G. Voice-processing technologies--their application in telecommunications. Proc Natl Acad Sci U S A. 1995 Oct 24;92(22):9991–9998. doi: 10.1073/pnas.92.22.9991. [DOI] [PMC free article] [PubMed] [Google Scholar]