Author: El Maghraby, Eslam Eid Ali Mohammed./ Title: Enhancement Quality and Accuracy of Speech<br>Recognition System Using Multimodal Audio Visual<br>Speech Signal /

Search In this Thesis

العنوان

Enhancement Quality and Accuracy of Speech
Recognition System Using Multimodal Audio Visual
Speech Signal /

المؤلف

El Maghraby, Eslam Eid Ali Mohammed.

هيئة الاعداد

باحث / اسلام عيد على محمد المغربى

مشرف / عمرو محمد رفعت

مناقش / محمد هشام فاروق

مناقش / عمرو محمد رفعت

الموضوع

Audio Visual<br>Speech Signal.

تاريخ النشر

2020.

عدد الصفحات

260 p. ;

اللغة

الإنجليزية

الدرجة

الدكتوراه

التخصص

الهندسة

تاريخ الإجازة

23/4/2020

مكان الإجازة

جامعة الفيوم - كلية الهندسة - إتصاالت وإليكترونيات

الفهرس

Only 14 pages are availabe for public view

from

250

from

250

Abstract

Multimodal speech recognition is proved to be one of the most
promising solutions for robust speech recognition, especially when
the acoustic signal is corrupted by noise. The visual signal can be
used to obtain more information to enhance the speech recognition
accuracy in noisy system because it is not affected by the acoustic
noise. In the situations when the SNR of acoustic signals is low, the
video cues can compensate the acoustic signals, and thus their
method significantly improve the recognition accuracy.
The critical stage in designing robust speech recognition system is
the choice of reliable classification method from large variety of the
existing classification techniques. This research introduces an
Audio-Visual Speech Recognition (AVSR) model using both audio
and visual speech modality to improve recognition accuracy in a
clean and noisy environment.