با سلام خدمت کاربران در صورتی که با خطای سیستم پرداخت بانکی مواجه شدید از طریق کارت به کارت (6037997535328901 بانک ملی ناصر خنجری ) مقاله خود را دریافت کنید (تا مشکل رفع گردد).
دسته بندی:
داده های بزرگ - big data
سال انتشار:
2018
عنوان انگلیسی مقاله:
MIA: Metric Importance Analysis for Big Data Workload Characterization
ترجمه فارسی عنوان مقاله:
MIA: تحلیل اهمیت متریک برای خصوصیات بار کار داده های بزرگ
منبع:
IEEE - IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL: 29, NO: 6, JUNE 2018
نویسنده:
Zhibin Yu , Wen Xiong , Lieven Eeckhout , Zhendong Bei , Avi Mendelson, and Chengzhong Xu
چکیده انگلیسی:
Data analytics is at the foundation of both high-quality products and services in modern economies and societies. Big data
workloads run on complex large-scale computing clusters, which implies significant challenges for deeply understanding and
characterizing overall system performance. In general, performance is affected by many factors at multiple layers in the system stack,
hence it is challenging to identify the key metrics when understanding big data workload performance. In this paper, we propose a novel
workload characterization methodology using ensemble learning, called Metric Importance Analysis (MIA), to quantify the respective
importance of workload metrics. By focusing on the most important metrics, MIA reduces the complexity of the analysis without losing
information. Moreover, we develop the MIA-based Kiviat Plot (MKP) and Benchmark Similarity Matrix (BSM) which provide more
insightful information than the traditional linkage clustering based dendrogram to visualize program behavior (dis)similarity. To
demonstrate the applicability of MIA, we use it to characterize three big data benchmark suites: HiBench, CloudRank-D and SZTS. The
results show that MIA is able to characterize complex big data workloads in a simple, intuitive manner, and reveal interesting insights.
Moreover, through a case study, we demonstrate that tuning the configuration parameters related to the important metrics found by MIA
results in higher performance improvements than through tuning the parameters related to the less important ones.
Index Terms: Big data, benchmarking, workload characterization, performance measurement, MapReduce/hadoop
قیمت: رایگان
توضیحات اضافی:
تعداد نظرات : 0