Search In this Thesis
   Search In this Thesis  
العنوان
An efficient index structure for big data /
المؤلف
El-Meiligy, Manar Ali Mahmoud.
هيئة الاعداد
باحث / منار علي محمود علي المليجي
مشرف / علي ابراهيم الدسوقي
مشرف / سالي الغمراوي
مناقش / مفرح محمد سالم
مناقش / إبراهيم محمود محمد الحناوي
الموضوع
Big data. Computers and Systems Engineering.
تاريخ النشر
2021.
عدد الصفحات
p. 99 :
اللغة
الإنجليزية
الدرجة
ماجستير
التخصص
Computer Science (miscellaneous)
تاريخ الإجازة
1/1/2021
مكان الإجازة
جامعة المنصورة - كلية الهندسة - قسم الحاسبات والنظم
الفهرس
Only 14 pages are availabe for public view

from 99

from 99

Abstract

Due to the era of big data with the large amount of data, storing, indexing and querying multi-dimensional big data efficiently become an emerging topic. In recent years, daily production of data is rapidly increases that produces huge volumes of data that crucial need to index efficiently, which is one of common problems, especially in case of multi-dimensional big data. R-tree and its variants have proven to be efficient for indexing multi-dimensional big data. Unfortunately, R-tree suffers from the curse of dimensionality problem. Many researchers continue to use the R-tree in their studies as it is the most famous tree-like structure for indexing multi-dimensional data. However, the main issue of using R-tree is when increasing the dimensions of multi-dimensional data the performance of R-tree decreases rapidly. This thesis proposes solutions of the above-mentioned issues by introducing a new indexing structure called ParISSS. The name of ParISSS is taken from Parallel Indexing System Structure based on Spark, which is an efficient system for indexing multi-dimensional big data. ParISSS introduces six types of computing nodes, the reception-node is used to insert and index data, the normal-node manipulates store indexed data, the resolution-node handles distribute the reception-index to a normal-node, the representative-node receives queries from the user, and the reply-node and check-node are used to send the results to the user. In addition, BR*-tree structure is proposed as a storing structure in ParISSS, to improve the storing and searching processes. Set of algorithms are also introduced to distribute and store indexed data in BR*-tree structure efficiently. Finally, an accuracy evaluation equation is introduced to evaluate the system accuracy. Finally, set of comprehensive experiments are done to evaluate ParISSS and comparing its results with several existing indexing systems. The results obtained show the superiority of ParISSS over others.