دانلود مقاله انگلیسی رایگان:یک رویکرد مبتنی بر امضای گروه برای تشخیص کارایی در پلت فرم داده های بزرگ - 2018
رمضان 2
دانلود مقاله انگلیسی داده های بزرگ رایگان
  • An Ensemble Signature-based Approach for Performance Diagnosis in Big Data Platform An Ensemble Signature-based Approach for Performance Diagnosis in Big Data Platform
    An Ensemble Signature-based Approach for Performance Diagnosis in Big Data Platform

    سال انتشار:

    2018


    عنوان انگلیسی مقاله:

    An Ensemble Signature-based Approach for Performance Diagnosis in Big Data Platform


    ترجمه فارسی عنوان مقاله:

    یک رویکرد مبتنی بر امضای گروه برای تشخیص کارایی در پلت فرم داده های بزرگ


    منبع:

    IEEE -2018 IEEE Symposium on Service-Oriented System Engineering


    نویسنده:

    Hong Kou ،Pengfei Chen


    چکیده انگلیسی:

    The big data platform always suffers from performance problems due to internal impairments (e.g. software bugs) and external impairments (e.g. resource hog). And the situation is exacerbated by the properties of velocity, variety and volume (3Vs) of big data. To recovery the system from performance anomaly, the first step is to find the root causes. In this paper, we propose a novel signature-based performance diagnosis approach to rapidly pinpoint the root causes of performance problems in big data platforms. The performance diagnosis is formalized as a pattern recognition problem. We leverage Maximum Information Criterion (MIC) to express the invariant relationships amongst the performance metrics in the normal state. Each performance problem occurred in the big data platform is signified by a unique binary vector named signature, which consists of a set of violations of MIC invariants. The signatures of multiple performance problems form a signature database. If the Key Performance Indicator (KPI) of the big data application exhibits model drift, our approach can identify the real culprits by retrieving the root causes which have similar signatures to the current performance problem. Moreover, considering the diversity of big data applications, we establish an ensemble approach to treat each application separately. The experiment evaluations in a controlled big data platform show that our approach can pinpoint the real culprits of performance problems in an average 84% precision and 87% recall when one fault occurs, which is better than several state-of-the-art approaches.
    Keywords: performance analysis; data analysis; distributed computing; software performance


    سطح: متوسط
    تعداد صفحات فایل pdf انگلیسی: 10
    حجم فایل: 466 کیلوبایت

    قیمت: رایگان


    توضیحات اضافی:




اگر این مقاله را پسندیدید آن را در شبکه های اجتماعی به اشتراک بگذارید (برای به اشتراک گذاری بر روی ایکن های زیر کلیک کنید)

تعداد نظرات : 0

الزامی
الزامی
الزامی
rss مقالات ترجمه شده rss مقالات انگلیسی rss کتاب های انگلیسی rss مقالات آموزشی