Author: Radhya Sahal Yahya Abdullah/ Title: Multi-query optimizationfor big data /

Search In this Thesis

العنوان

Multi-query optimizationfor big data /

الناشر

Radhya Sahal Yahya Abdullah ,

المؤلف

Radhya Sahal Yahya Abdullah

تاريخ النشر

2018

عدد الصفحات

140 Leaves :

الفهرس

Only 14 pages are availabe for public view

from

159

from

159

Abstract

Multi-query optimization in Big Data becomes a promising research direction due to the popularity of massive data analytical systems (e.g., MapReduce, Flink). The multi-query is translated into multiple jobs. These multiple jobs are routinely submitted with similar taskswhich are included in different jobs to the underling Big Data analytical systems. These similar tasks are considered complicated and computation overhead. Therefore, there are some existing techniques that have been proposed for exploiting sharing tasks in Big Data multi-query optimization (e.g., MRShare and Relaxed MRShare). These techniques are heavily tailored relaxed optimizing factors of fine-grained reused-based opportunities. In accordance with Big Data multi-query optimization, the existing fine-grained techniques are only concerned with equal tuples size and uniform data distribution. These issues are not applicable to the real-world distributed applications which depend on coarse-grained reused-based opportunities, such as non-equal tuples size and non-uniform data distribution. These two issues have received more-attention in Big Data multi-query optimization, to minimize the data read from or written back to Big Data infrastructures