عنوان انگلیسی مقاله:
FPO tree and DP3 algorithm for distributed parallel Frequent Itemsets Mining
ترجمه فارسی عنوان مقاله:
الگوریتم درخت FPO و DP3 برای کاوش موارد متداول توزیع شده موازی
Sciencedirect - Elsevier - Expert Systems With Applications, 140 (2020) 112874: doi:10:1016/j:eswa:2019:112874
Van Quoc Phuong Huynh ∗, Josef Küng
Frequent Itemsets Mining is a fundamental mining model in Data Mining. It supports a vast range of ap- plication fields and can be employed as a key calculation phase in many other mining models such as Association Rules, Correlations, Classifications, etc. Many distributed parallel algorithms have been intro- duced to confront with very large-scale datasets of Big Data. However, the problems of running time and memory scalability still have not had adequate solutions for very large and “hard-to-mined”datasets. In this paper, we propose a distributed parallel algorithm named DP3 ( D istributed P re P ost P lus) which parallelizes the state-of-the-art algorithm PrePost + and operates in Master-Slaves model. Slave machines mine and send local frequent itemsets and support counts to the Master for aggregations. In the case of tremendous numbers of itemsets transferred between the Slaves and Master, the computational load at the Master, therefore, is extremely heavy if there is not the support from our complete FPO tree ( F requent P atterns O rganization) which can provide optimal compactness for light data transfers and highly efficient aggregations with pruning ability. Processing phases of the Slaves and Master are designed for memory scalability and shared-memory parallel in Work-Pool model so as to utilize the computational power of multi-core CPUs. We conducted experiments on both synthetic and real datasets, and the empirical results have shown that our algorithm far outperforms the well-known PFP and other three recently high-performance ones Dist-Eclat, BigFIM, and MapFIM.
Keywords: Frequent Itemsets Mining | Parallel | Distributed | Data Mining | Big Data | Prefix tree