Search In this Thesis
   Search In this Thesis  
العنوان
Advanced techniques in speaker diarization for arabic TV brpadcast /
الناشر
Mohamed Salem Mohamed Elhady ,
المؤلف
Mohamed Salem Mohamed Elhady
هيئة الاعداد
باحث / Mohamed Salem Mohamed Elhady
مشرف / Mohsen Abdelrazeq Rashwan
مشرف / Sherif Mahdy Abdou
مناقش / Mohamed Fathy Abuelyazeed
مناقش / Mohamed Waleed Talaat Fakhr
تاريخ النشر
2017
عدد الصفحات
79 P. :
اللغة
الإنجليزية
الدرجة
ماجستير
التخصص
هندسة النظم والتحكم
الناشر
Mohamed Salem Mohamed Elhady ,
تاريخ الإجازة
10/2/2017
مكان الإجازة
جامعة القاهرة - كلية الهندسة - Electronics and Communications Engineering
الفهرس
Only 14 pages are availabe for public view

from 98

from 98

Abstract

Speaker Diarization is known as the task that answers the question, who spoke, when in an audio {uFB01}le or a set of audio {uFB01}les that contain unknown number of speakers. The determination of speaker segments is done in an unsupervised manner. Our Speaker Diarization system composed of two main blocks; Speech Activity Detector and Speaker Clustering. In speech activity detection we propose several solutions including; Phoneme Recognition system, SVMHMM system and i-vector based system. In speaker clustering area we propose an enhancement over state of the art techniques as cosine based Hierarchal Agglomerative Clustering. Such enhancement including enhancing clustering by classi{uFB01}cation methods as SVM, DNN and Random Forrest. Finally we investigated enhancing the i-vector representation via extracting them from a DNN based background model