با سلام خدمت کاربران در صورتی که با خطای سیستم پرداخت بانکی مواجه شدید از طریق کارت به کارت (6037997535328901 بانک ملی ناصر خنجری ) مقاله خود را دریافت کنید (تا مشکل رفع گردد).
دسته بندی:
داده های بزرگ - big data
سال انتشار:
2016
عنوان انگلیسی مقاله:
Big Data Analytics Integrating a Parallel Columnar DBMS and the R Language
ترجمه فارسی عنوان مقاله:
یکپارچه سازی تحلیل داده های بزرگ یک DBMS ستونی موازی و زبان R
منبع:
IEEE - 2016 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing
نویسنده:
Yiqun Zhang, Carlos Ordonez, Wellington Cabrera
چکیده انگلیسی:
Most research has proposed scalable and parallel
analytic algorithms that work outside a DBMS. On the other
hand, R has become a very popular system to perform machine
learning analysis, but it is limited by main memory and singlethreaded processing. Recently, novel columnar DBMSs have
shown to provide orders of magnitude improvement in SQL
query processing speed, preserving the parallel speedup of rowbased parallel DBMSs. With that motivation in mind, we present
COLUMNAR, a system integrating a parallel columnar DBMS
and R, that can directly compute models on large data sets stored
as relational tables. Our algorithms are based on a combination
of SQL queries, user-defined functions (UDFs) and R calls,
where SQL queries and UDFs compute data set summaries
that are sent to R to compute models in RAM. Since our
hybrid algorithms exploit the DBMS for the most demanding
computations involving the data set, they show linear scalability
and are highly parallel. Our algorithms generally require one
pass on the data set or a few passes otherwise (i.e. fewer passes
than traditional methods). Our system can analyze data sets
faster than R even when they fit in RAM and it also eliminates
memory limitations in R when data sets exceed RAM size. On
the other hand, it is an order of magnitude faster than Spark (a
prominent Hadoop system) and a traditional row-based DBMS.
Keywords: Mathematical model | Computational modeling | Random access memory | Data models | Load modeling | Layout | Numerical models
قیمت: رایگان
توضیحات اضافی:
تعداد نظرات : 0