Search In this Thesis
   Search In this Thesis  
العنوان
Mining textual social media and crowdsourced questions /
الناشر
Iman Hassan Abdallah Mahmoud Saleh ,
المؤلف
Iman Hassan Abdallah Mahmoud Saleh
هيئة الاعداد
باحث / Iman Hassan Abdallah Mahmoud Saleh
مشرف / Osman Hegazy
مشرف / Neamat Eltazi
مشرف / Osman Hegazy
تاريخ النشر
2018
عدد الصفحات
91 Leaves :
اللغة
الإنجليزية
الدرجة
الدكتوراه
التخصص
Information Systems
تاريخ الإجازة
14/9/2019
مكان الإجازة
جامعة القاهرة - كلية الحاسبات و المعلومات - Information Systems
الفهرس
Only 14 pages are availabe for public view

from 111

from 111

Abstract

Different types of social media platforms are getting more and more popular nowadays. Community Question Answering (cQA) websites are particularly getting increasingly popular. Such websites allow users to ask questions and find answers to them.Therefore, they can be considered as a valuable source of information. Numerous questions are posted to these websites on a daily basis, forming huge archives of questions. These archives offer a large body of text that can be analyzed and exploited in several text mining tasks. In this thesis, two different research problems related toUserGeneratedQuestions (UGQ{u2019}s), found in cQA websites, are studied. The first research problem is identifying semantic concepts in these questions. Identifying such concepts enables understanding questions and finding answers to them automatically.This is particularly useful in websites where a large number of questions is posted and it can be diffi- cult to answer all of them. In this thesis, we perform a comprehensive study on the use of reranking combined with linguistic information, in the form of syntactic and semantic structures, to find semantic concepts in UGQ{u2019}s. We evaluate our methods and show that techniques presented can outperform the state of the art for that task. The second research problem studied in the thesis is also related to UGQ{u2019}s. Our goal is to organize huge archives of UGQ{u2019}s by organizing tags assigned to questions into communities of semantically related tags. We investigate different ways that can be utilized to find semantic relationships between tags, and the best way to model tags in the vector space to find similarities between them