Article

Free Access

Multimodal system processing in mobile environments

Author:
Sharon Oviatt

Department of Computer Science and Engineering, Oregon Graduate Institute of Science and Technology, 20000 N.W. Walker Road, Beaverton, Oregon

Department of Computer Science and Engineering, Oregon Graduate Institute of Science and Technology, 20000 N.W. Walker Road, Beaverton, Oregon
View Profile

UIST '00: Proceedings of the 13th annual ACM symposium on User interface software and technologyNovember 2000Pages 21–30https://doi.org/10.1145/354401.354408

Published:01 November 2000Publication History

UIST '00: Proceedings of the 13th annual ACM symposium on User interface software and technology

Pages 21–30

References

1.Clow, J. & Oviatt, S. L. STAMP: A suite of tools for analyzing multimodal system processing, Proceedings of the International Conference on Spoken Language Processing, Sydney: ASSTA Inc., 1998, 2, pp. 277-280.Google Scholar
2.Cohen, P. R., Cheyer, A., Wang, M., & Baeg, S. C. An open agent architecture. AAAI '94 Spring Symposium Series on Software Agents, Menlo Park: AAAI press, 1994, pp. 1-8.Google Scholar
3.Cohen, P., Johnston, M., McGee, D., Oviatt, S., Pittman, J., Smith, I., Chen, L. & Clow, J. Quickset: Multimodal interaction for distributed applications. Proceedings of the Fifth ACM International Multimedia Conference, New York, NY: ACM Press, 1997, pp. 31-4. Google ScholarDigital Library
4.Das, S., Bakis, R., Nadas, A., Nahamoo, D. & Picheny, M. Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system, Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing, 1993, 2, pp. 71-74.Google ScholarCross Ref
5.Dreher, J.J. & O'Neill, J.J. Effects of ambient noise on speaker intelligibility for words and phrases, Journal of the Acoustical Society of America, 1957, 29 (12), pp. 1320-1323.Google ScholarCross Ref
6.Gong, Y. Speech recognition in noisy environments, Speech Communication, 1995, 16, pp. 261-291. Google ScholarDigital Library
7.Hanley, T.D.& Steer, M.D. Effect of level of distracting noise upon speaking rate, duration and intensity, Journal of Speech and Hearing Disorders, 1949, 14, pp. 363-368.Google ScholarCross Ref
8.Iverson, P., Bernstein, L., & Auer, E. Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition, Speech Communication, 1998, 26 (1-2), pp. 45-63. Google ScholarDigital Library
9.Johnston, M., Cohen, P.R., McGee, D., Oviatt, S.L., Pittman, J.A. & Smith, I. Unification-based multimodal integration. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, San Francisco, CA.: Morgan Kaufmann, 1997, pp. 281-288. Google ScholarDigital Library
10.Junqua, J.C. The Lombard reflex and its role on human listeners & automatic speech recognizers, Journal of the Acoustical Society of America, 1993, 93 (1), pp.510-24.Google ScholarCross Ref
11.Kumar, S., Cohen, P.R., Levesque, H.J. The adaptive agent architecture: achieving fault-tolerance using persistent broker teams. To appear in The Fourth International Conference on Multi-Agent Systems (ICMAS 2000), Boston, MA, USA, July 7-12, 2000. Google ScholarDigital Library
12.Lockwood, P. & Boudy, J. Experiments with a non-linear spectral subtractor (NSS), hidden markov models and the projection for robust speech recognition in cars, Speech Communication, 1992,11(2-3), pp. 215-28. Google ScholarDigital Library
13.Lombard, E. Le signe de l'elevation de la voix, Annals Maladiers Oreille, Larynx, Nez, Pharynx, 1911, 37, pp. 101-119.Google Scholar
14.Oviatt, S. L. Ten myths of multimodal interaction, Communications of the ACM, Vol. 42, No. 11, November, 1999, pp. 74-81. Google ScholarDigital Library
15.Oviatt, S. L. Mutual disambiguation of recognition errors in a multimodal architecture. Proceedings of the Conference on Human Factors in Computing Systems (CHI'99), ACM Press: New York, N.Y., 1999, pp. 576-583. Google ScholarDigital Library
16.Pick, H.L., Siegel, G.M., Fox, P.W., Garber, S.R. & Kearney, J.K. Inhibiting the Lombard effect, Journal of the Acoustical Society of America, 1989, 85 (2), pp. 894-900.Google ScholarCross Ref
17.Potash, L.M. A signal detection problem and a possible solution in Japanese quail, Animal Behavior, 1972, 20, pp. 192-195.Google ScholarCross Ref
18.Schulman, R. Articulatory dynamics of loud and normal speech, Journal of the Acoustical Society of America, 1989, 85, pp. 295-312.Google ScholarCross Ref
19.Siegel, G.M., Pick, H.L., Olsen, M.G. & Sawin, L. Auditory feedback in the regulation of vocal intensity of preschool children, Developmental Psychology, 1976, 12, pp. 255-261.Google ScholarCross Ref
20.Sinott, J.M., Stebbins, W.C. & Moody,D.B. Regulation of voice amplitude by the monkey, Journal of the Acoustical Society of America, 1975, 58, pp. 412-14.Google ScholarCross Ref
21.Van Summers, W.V., Pisoni, D.B., Bernacki, R.H., Pedlow, R.I. and Stokes, M.A. Effects of noise on speech production: Acoustic and perceptual analyses, Journal of the Acoustical Society of America, 1988, 84, pp. 917-928.Google ScholarCross Ref

Index Terms

Multimodal system processing in mobile environments

Recommendations

Mutual disambiguation of recognition errors in a multimodel architecture
CHI '99: Proceedings of the SIGCHI conference on Human Factors in Computing Systems

As a new generation of multimodal/media systems begins to define itself, researchers are attempting to learn how to combine different modes into strategically integrated whole systems. In theory, well designed multimodal systems should be able to integrate ...
Read More
A mobile multimodal dialogue system for public transportation navigation evaluated
MobileHCI '06: Proceedings of the 8th conference on Human-computer interaction with mobile devices and services

As the technical capabilities of latest mobile devices are combined with mobile broadband internet access, we are ready to make use of free and natural speech in mobile services by utilizing optional and complementary means of input. In these kinds of ...
Read More
Integration and synchronization of input modes during multimodal human-computer interaction
ReferringPhenomena '97: Referring Phenomena in a Multimedia Context and their Computational Treatment

Our ability to develop robust multimodal systems will depend on knowledge of the natural integration patterns that typify people's combined use of different input modes. To provide a foundation for theory and design, the present research analyzed ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
UIST '00: Proceedings of the 13th annual ACM symposium on User interface software and technology
November 2000
248 pages
ISBN:1581132123
DOI:10.1145/354401
Chairmen:
Mark Ackerman
Univ. of California, Irvine
,
Keith Edwards
Xerox Palo Alto Research Center, Palo Alto, CA
Copyright © 2000 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 November 2000
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
mobile interface design
multimodal architecture
mutual disambiguation
recognition errors
robust performance
speech and pen input
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate842of3,967submissions,21%
Upcoming Conference
UIST '24

Sponsor:

sigchi

sigchi

UIST '24: The 37th Annual ACM Symposium on User Interface Software and Technology

October 13 - 16, 2024

Pittsburgh , PA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 49
  Total Citations
  View Citations
- 1,472
  Total Downloads
- Downloads (Last 12 months)67
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Multimodal system processing in mobile environments

UIST '00: Proceedings of the 13th annual ACM symposium on User interface software and technology

References

Cited By

Index Terms

Recommendations

Mutual disambiguation of recognition errors in a multimodel architecture

A mobile multimodal dialogue system for public transportation navigation evaluated

Integration and synchronization of input modes during multimodal human-computer interaction

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Multimodal system processing in mobile environments

UIST '00: Proceedings of the 13th annual ACM symposium on User interface software and technology

References

Cited By

Index Terms

Recommendations

Mutual disambiguation of recognition errors in a multimodel architecture

A mobile multimodal dialogue system for public transportation navigation evaluated

Integration and synchronization of input modes during multimodal human-computer interaction

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media