Skip to main content
Top
Gepubliceerd in: Journal of Autism and Developmental Disorders 9/2023

12-07-2022 | S.I. :Impact of Assistive Technology in Special Education

RETRACTED ARTICLE: Audio-Visual Automatic Speech Recognition Towards Education for Disabilities

Auteurs: Saswati Debnath, Pinki Roy, Suyel Namasudra, Ruben Gonzalez Crespo

Gepubliceerd in: Journal of Autism and Developmental Disorders | Uitgave 9/2023

Log in om toegang te krijgen
share
DELEN

Deel dit onderdeel of sectie (kopieer de link)

  • Optie A:
    Klik op de rechtermuisknop op de link en selecteer de optie “linkadres kopiëren”
  • Optie B:
    Deel de link per e-mail

Abstract

Education is a fundamental right that enriches everyone’s life. However, physically challenged people often debar from the general and advanced education system. Audio-Visual Automatic Speech Recognition (AV-ASR) based system is useful to improve the education of physically challenged people by providing hands-free computing. They can communicate to the learning system through AV-ASR. However, it is challenging to trace the lip correctly for visual modality. Thus, this paper addresses the appearance-based visual feature along with the co-occurrence statistical measure for visual speech recognition. Local Binary Pattern-Three Orthogonal Planes (LBP-TOP) and Grey-Level Co-occurrence Matrix (GLCM) is proposed for visual speech information. The experimental results show that the proposed system achieves 76.60 % accuracy for visual speech and 96.00 % accuracy for audio speech recognition.
Literatuur
go back to reference Galatas, G., et al. (2012). Audio-visual speech recognition using depth information from the Kinect in noisy video conditions. In Proceedings of International Conference on Pervasive Technologies Related to Assistive Environments, ACM, pp. 1–4 https://doi.org/10.1145/2413097.2413100 Galatas, G., et al. (2012). Audio-visual speech recognition using depth information from the Kinect in noisy video conditions. In Proceedings of International Conference on Pervasive Technologies Related to Assistive Environments, ACM, pp. 1–4 https://​doi.​org/​10.​1145/​2413097.​2413100
go back to reference Gao, J., et al. (2021). Decentralized federated learning framework for the neighborhood: A case study on residential building load forecasting. In Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, ACM pp. 453–459. https://doi.org/10.1145/3485730.3493450 Gao, J., et al. (2021). Decentralized federated learning framework for the neighborhood: A case study on residential building load forecasting. In Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, ACM pp. 453–459. https://​doi.​org/​10.​1145/​3485730.​3493450
go back to reference Ivanko, D., et al. (2021). An experimental analysis of different approaches to audio-visual speech recognition and lip-reading. In Proceedings of 15th International Conference on Electromechanics and Robotics, Springer, Singapore, pp. 197–209. https://doi.org/10.1007/978-981-15-5580-016 Ivanko, D., et al. (2021). An experimental analysis of different approaches to audio-visual speech recognition and lip-reading. In Proceedings of 15th International Conference on Electromechanics and Robotics, Springer, Singapore, pp. 197–209. https://​doi.​org/​10.​1007/​978-981-15-5580-016
go back to reference Kuncheva, I. (2004). Combining pattern classifiers: Methods and algorithms. Wiley. Kuncheva, I. (2004). Combining pattern classifiers: Methods and algorithms. Wiley.
go back to reference Mohanaiah, P., et al. (2013). Image texture feature extraction using GLCM approach. International Journal of Scientific and Research Publications,3(5), 85. Mohanaiah, P., et al. (2013). Image texture feature extraction using GLCM approach. International Journal of Scientific and Research Publications,3(5), 85.
go back to reference Revathi, A., & Venkataramani, Y. (2009). Perceptual features based isolated digit and continuous speech recognition using iterative clustering approach networks and communication. In First International Conference on Networks & Communications, NetCoM., IEEE, Chennai. https://doi.org/10.1109/NetCoM.2009.32 Revathi, A., & Venkataramani, Y. (2009). Perceptual features based isolated digit and continuous speech recognition using iterative clustering approach networks and communication. In First International Conference on Networks & Communications, NetCoM., IEEE, Chennai. https://​doi.​org/​10.​1109/​NetCoM.​2009.​32
Metagegevens
Titel
RETRACTED ARTICLE: Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
Auteurs
Saswati Debnath
Pinki Roy
Suyel Namasudra
Ruben Gonzalez Crespo
Publicatiedatum
12-07-2022
Uitgeverij
Springer US
Gepubliceerd in
Journal of Autism and Developmental Disorders / Uitgave 9/2023
Print ISSN: 0162-3257
Elektronisch ISSN: 1573-3432
DOI
https://doi.org/10.1007/s10803-022-05654-4

Andere artikelen Uitgave 9/2023

Journal of Autism and Developmental Disorders 9/2023 Naar de uitgave