- 1.Clow, J. & Oviatt, S. L. STAMP: A suite of tools for analyzing multimodal system processing, Proceedings of the International Conference on Spoken Language Processing, Sydney: ASSTA Inc., 1998, 2, pp. 277-280.Google Scholar
- 2.Cohen, P. R., Cheyer, A., Wang, M., & Baeg, S. C. An open agent architecture. AAAI '94 Spring Symposium Series on Software Agents, Menlo Park: AAAI press, 1994, pp. 1-8.Google Scholar
- 3.Cohen, P., Johnston, M., McGee, D., Oviatt, S., Pittman, J., Smith, I., Chen, L. & Clow, J. Quickset: Multimodal interaction for distributed applications. Proceedings of the Fifth ACM International Multimedia Conference, New York, NY: ACM Press, 1997, pp. 31-4. Google ScholarDigital Library
- 4.Das, S., Bakis, R., Nadas, A., Nahamoo, D. & Picheny, M. Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system, Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing, 1993, 2, pp. 71-74.Google ScholarCross Ref
- 5.Dreher, J.J. & O'Neill, J.J. Effects of ambient noise on speaker intelligibility for words and phrases, Journal of the Acoustical Society of America, 1957, 29 (12), pp. 1320-1323.Google ScholarCross Ref
- 6.Gong, Y. Speech recognition in noisy environments, Speech Communication, 1995, 16, pp. 261-291. Google ScholarDigital Library
- 7.Hanley, T.D.& Steer, M.D. Effect of level of distracting noise upon speaking rate, duration and intensity, Journal of Speech and Hearing Disorders, 1949, 14, pp. 363-368.Google ScholarCross Ref
- 8.Iverson, P., Bernstein, L., & Auer, E. Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition, Speech Communication, 1998, 26 (1-2), pp. 45-63. Google ScholarDigital Library
- 9.Johnston, M., Cohen, P.R., McGee, D., Oviatt, S.L., Pittman, J.A. & Smith, I. Unification-based multimodal integration. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, San Francisco, CA.: Morgan Kaufmann, 1997, pp. 281-288. Google ScholarDigital Library
- 10.Junqua, J.C. The Lombard reflex and its role on human listeners & automatic speech recognizers, Journal of the Acoustical Society of America, 1993, 93 (1), pp.510-24.Google ScholarCross Ref
- 11.Kumar, S., Cohen, P.R., Levesque, H.J. The adaptive agent architecture: achieving fault-tolerance using persistent broker teams. To appear in The Fourth International Conference on Multi-Agent Systems (ICMAS 2000), Boston, MA, USA, July 7-12, 2000. Google ScholarDigital Library
- 12.Lockwood, P. & Boudy, J. Experiments with a non-linear spectral subtractor (NSS), hidden markov models and the projection for robust speech recognition in cars, Speech Communication, 1992,11(2-3), pp. 215-28. Google ScholarDigital Library
- 13.Lombard, E. Le signe de l'elevation de la voix, Annals Maladiers Oreille, Larynx, Nez, Pharynx, 1911, 37, pp. 101-119.Google Scholar
- 14.Oviatt, S. L. Ten myths of multimodal interaction, Communications of the ACM, Vol. 42, No. 11, November, 1999, pp. 74-81. Google ScholarDigital Library
- 15.Oviatt, S. L. Mutual disambiguation of recognition errors in a multimodal architecture. Proceedings of the Conference on Human Factors in Computing Systems (CHI'99), ACM Press: New York, N.Y., 1999, pp. 576-583. Google ScholarDigital Library
- 16.Pick, H.L., Siegel, G.M., Fox, P.W., Garber, S.R. & Kearney, J.K. Inhibiting the Lombard effect, Journal of the Acoustical Society of America, 1989, 85 (2), pp. 894-900.Google ScholarCross Ref
- 17.Potash, L.M. A signal detection problem and a possible solution in Japanese quail, Animal Behavior, 1972, 20, pp. 192-195.Google ScholarCross Ref
- 18.Schulman, R. Articulatory dynamics of loud and normal speech, Journal of the Acoustical Society of America, 1989, 85, pp. 295-312.Google ScholarCross Ref
- 19.Siegel, G.M., Pick, H.L., Olsen, M.G. & Sawin, L. Auditory feedback in the regulation of vocal intensity of preschool children, Developmental Psychology, 1976, 12, pp. 255-261.Google ScholarCross Ref
- 20.Sinott, J.M., Stebbins, W.C. & Moody,D.B. Regulation of voice amplitude by the monkey, Journal of the Acoustical Society of America, 1975, 58, pp. 412-14.Google ScholarCross Ref
- 21.Van Summers, W.V., Pisoni, D.B., Bernacki, R.H., Pedlow, R.I. and Stokes, M.A. Effects of noise on speech production: Acoustic and perceptual analyses, Journal of the Acoustical Society of America, 1988, 84, pp. 917-928.Google ScholarCross Ref
Index Terms
- Multimodal system processing in mobile environments
Recommendations
Mutual disambiguation of recognition errors in a multimodel architecture
CHI '99: Proceedings of the SIGCHI conference on Human Factors in Computing SystemsAs a new generation of multimodal/media systems begins to define itself, researchers are attempting to learn how to combine different modes into strategically integrated whole systems. In theory, well designed multimodal systems should be able to integrate ...
A mobile multimodal dialogue system for public transportation navigation evaluated
MobileHCI '06: Proceedings of the 8th conference on Human-computer interaction with mobile devices and servicesAs the technical capabilities of latest mobile devices are combined with mobile broadband internet access, we are ready to make use of free and natural speech in mobile services by utilizing optional and complementary means of input. In these kinds of ...
Integration and synchronization of input modes during multimodal human-computer interaction
ReferringPhenomena '97: Referring Phenomena in a Multimedia Context and their Computational TreatmentOur ability to develop robust multimodal systems will depend on knowledge of the natural integration patterns that typify people's combined use of different input modes. To provide a foundation for theory and design, the present research analyzed ...
Comments