Data Mining Strategies for Real-Time Control in New York City
استراتژی داده کاوی برای کنترل زمان واقعی در شهر نیویورک-2105
The Data Mining System (DMS) at New York City Department of Transportation (NYCDOT) mainly consists of four database systems for traffic and pedestrian/bicycle volumes, crash data, and signal timing plans as well as the Midtown in Motion (MIM) systems which are used as part of the NYCDOT Intelligent Transportation System (ITS) infrastructure. These database and control systems are operated by different units at NYCDOT as an independent database or operation system. New York City experiences heavy traffic volumes, pedestrians and cyclists in each Central Business District (CBD) area and along key arterial systems. There are consistent and urgent needs in New York City for real-time control to improve mobility and safety for all users of the street networks, and to provide a timely response and management of random incidents. Therefore, it is necessary to develop an integrated DMS for effective real-time control and active transportation management (ATM) in New York City. This paper will present new strategies for New York City suggesting the development of efficient and cost-effective DMS, involving: 1) use of new technology applications such as tablets and smartphone with Global Positioning System (GPS) and wireless communication features for data collection and reduction; 2) interface development among existing database and control systems; and 3) integrated DMS deployment with macroscopic and mesoscopic simulation models in Manhattan. This study paper also suggests a complete data mining process for real-time control with traditional static data, current real timing data from loop detectors, microwave sensors, and video cameras, and new real-time data using the GPS data. GPS data, including using taxi and bus GPS information, and smartphone applications can be obtained in all weather conditions and during anytime of the day. GPS data and smartphone application in NYCDOT DMS is discussed herein as a new concept. © 2014 The Authors. Published by Elsevier B.V. Selection and peer-review under responsibility of Elhadi M. Shakshu Keywords: Data Mining System (DMS), New York City, real-time control, active transportation management (ATM), GPS data
MISS-D: A fast and scalable framework of medical image storage service based on distributed file system
MISS-D: یک چارچوب سریع و مقیاس پذیر از خدمات ذخیره سازی تصویر پزشکی بر اساس سیستم فایل توزیع شده-2020
Background and Objective Processing of medical imaging big data is deeply challenging due to the size of data, computational complexity, security storage and inherent privacy issues. Traditional picture archiving and communication system, which is an imaging technology used in the healthcare industry, generally uses centralized high performance disk storage arrays in the practical solutions. The existing storage solutions are not suitable for the diverse range of medical imaging big data that needs to be stored reliably and accessed in a timely manner. The economical solution is emerging as the cloud computing which provides scalability, elasticity, performance and better managing cost. Cloud based storage architecture for medical imaging big data has attracted more and more attention in industry and academia. Methods This study presents a novel, fast and scalable framework of medical image storage service based on distributed file system. Two innovations of the framework are introduced in this paper. An integrated medical imaging content indexing file model for large-scale image sequence is designed to adapt to the high performance storage efficiency on distributed file system. A virtual file pooling technology is proposed, which uses the memory-mapped file method to achieve an efficient data reading process and provides the data swapping strategy in the pool. Result The experiments show that the framework not only has comparable performance of reading and writing files which meets requirements in real-time application domain, but also bings greater convenience for clinical system developers by multiple client accessing types. The framework supports different user client types through the unified micro-service interfaces which basically meet the needs of clinical system development especially for online applications. The experimental results demonstrate the framework can meet the needs of real-time data access as well as traditional picture archiving and communication system. Conclusions This framework aims to allow rapid data accessing for massive medical images, which can be demonstrated by the online web client for MISS-D framework implemented in this paper for real-time data interaction. The framework also provides a substantial subset of features to existing open-source and commercial alternatives, which has a wide range of potential applications.
Keywords: Hadoop distributed file system | Data packing | Memory mapping file | Message queue | Micro-service | Medical imaging
Challenges and recommended technologies for the industrial internet of things: A comprehensive review
چالش ها و فن آوری های پیشنهادی برای اینترنت اشیا صنعتی: مرور جامع-2020
Physical world integration with cyber world opens the opportunity of creating smart environments; this new paradigm is called the Internet of Things (IoT). Communication between humans and objects has been extended into those between objects and objects. Industrial IoT (IIoT) takes benefits of IoT communications in business applications focusing in interoperability between machines (i.e., IIoT is a subset from the IoT). Number of daily life things and objects connected to the Internet has been in increasing fashion, which makes the IoT be the dynamic network of networks. Challenges such as heterogeneity, dynamicity, velocity, and volume of data, make IoT services produce inconsistent, inaccurate, incomplete, and incorrect results, which are critical for many applications especially in IIoT (e.g., health-care, smart transportation, wearable, finance, industry, etc.). Discovering, searching, and sharing data and resources reveal 40% of IoT benefits to cover almost industrial applications. Enabling real-time data analysis, knowledge extraction, and search techniques based on Information Communication Technologies (ICT), such as data fusion, machine learning, big data, cloud computing, blockchain, etc., can reduce and control IoT and leverage its value. This research presents a comprehensive review to study state-of-the-art challenges and recommended technologies for enabling data analysis and search in the future IoT presenting a framework for ICT integration in IoT layers. This paper surveys current IoT search engines (IoTSEs) and presents two case studies to reflect promising enhancements on intelligence and smartness of IoT applications due to ICT integration.
Keywords: Industrial IoT (IIoT) | Searching and indexing | Blockchain | Big data | Data fusion Machine learning | Cloud and fog computing
A method of NC machine tools intelligent monitoring system in smart factories
روش ابزار و ماشین آلات NC سیستم نظارت هوشمند در کارخانه های هوشمند-2020
The construction of effectual connection to bridge the gap between physical machine tools and upper software applications is one of the inherent requirements for smart factories. The difficulties in this issue lies in the lack of effective and appropriate means for real-time data acquisition, storage and processing in monitoring and the post workflows. The rapid advancements in Internet of things (IoT) and information technology have made it possible for the realization of this scheme, which have become an important module of the concepts such as “Industry 4.0”, etc. In this paper, a framework of bi-directional data and control flows between various machine tools and upper-level software system is proposed, within which several key stumbling blocks are presented, and corresponding solutions are subsequently deeply investigated and analyzed. Through monitoring manufacturing big data, potential essential information are extracted, providing useful guides for practical production and enterprise decision-making. Based on the integrated model, an NC machine tool intelligent monitoring and data processing system in smart factories is developed. Typical machine tools, such as Siemens series, are the main objects for investigation. The system validates the concept and performs well in the complex manufacturing environment, which will be a beneficial attempt and gain its value in smart factories..
Keywords: CNC | Monitoring system | Data analysis | Machine tool | Smart factory
Towards DNA based data security in the cloud computing environment
به سمت امنیت داده های مبتنی بر DNA در محیط محاسبات ابری-2020
Nowadays, data size is increasing day by day from gigabytes to terabytes or even petabytes, mainly because of the evolution of a large amount of real-time data. Most of the big data is transmitted through the internet and they are stored on the cloud computing environment. As cloud computing provides internet-based services, there are many attackers and malicious users. They always try to access user’s confidential big data without having the access right. Sometimes, they replace the original data by any fake data. Therefore, big data security has become a significant concern recently. Deoxyribonucleic Acid (DNA) computing is an advanced emerged field for improving data security, which is based on the biological concept of DNA. A novel DNA based data encryption scheme has been proposed in this paper for the cloud computing environment. Here, a 1024-bit secret key is generated based on DNA computing, user’s attributes and Media Access Control (MAC) address of the user, and decimal encoding rule, American Standard Code for Information Interchange (ASCII) value, DNA bases and complementary rule are used to generate the secret key that enables the system to protect against many security attacks. Experimental results, as well as theoretical analyses, show the efficiency and effectivity of the proposed scheme over some well-known existing schemes.
Keywords: Cloud computing | DNA computing | Big data security | MAC address | Complementary rule | CloudSim
PRAISE-HK: A personalized real-time air quality informatics system for citizen participation in exposure and health risk management
PRAISE-HK: یک سیستم انفورماتیک شخصی با کیفیت هوا در زمان واقعی برای مشارکت شهروندان در معرض خطر و مدیریت ریسک سلامت-2020
Exposure to air pollutants causes a range of adverse health effects. These harmful effects occur whenever and wherever people come into direct contact with air pollution. Therefore, individual actions that reduce the frequency, duration, and severity of personal contact with air pollution can reduce health risks. We developed a system that empowers the public with personalized information on air quality and exposure health risk. This system, the Personalised Real-Time Air Quality Informatics System for Exposure – Hong Kong (PRAISE-HK, http://praise.ust.hk/), is embodied in an interactive mobile application. PRAISE-HK is based on real-time data on emissions, high resolution urban morphology, meteorology, physical and chemical processes affecting pollutant transport and transformations, extensive measurements of air pollution concentrations in typical locations such as homes, schools, offices, and transportation, and big data integration of sensor monitoring to accurately estimate current and short-term forecasted street-level air quality. The street-level air quality simulation has been validated against reference monitoring data. Ongoing and planned future enhancements to PRAISE-HK include prediction of personal exposure and health response. PRAISE-HK is an example of the use of collective intelligence in a smart city to engage citizens in learning about and managing their own exposure to air pollution.
Keywords: Air pollution | Personalized exposure | Individual health sensitivity | Citizen engagement
مروری بر تجمیع دستگاه های مدل سازی اطلاعات ساختمانی (BIM) و اینترنت اشیاء (IoT): وضعیت کنونی و روند آینده
سال انتشار: 2019 - تعداد صفحات فایل pdf انگلیسی: 13 - تعداد صفحات فایل doc فارسی: 56
تجمیع مدل سازی اطلاعات ساختمانی (BIM) با داده های زمان واقعی(بلادرنگ) دستگاه های اینترنت اشیاء (IoT)، نمونه قوی را برای بهبود ساخت وساز و بهره وری عملیاتی ارائه می دهد. اتصال جریان-های داده های زمان واقعی که بر گرفته از مجموعه هایی از شبکه های حسگرِ اینترنت اشیاء (که این جریان های داده ای، به سرعت در حال گسترش هستند) می باشند، با مدل های باکیفیت BIM، در کاربردهای متعددی قابل استفاده می باشد. با این حال، پژوهش در زمینه ی تجمیع BIM و IOT هنوز در مراحل اولیه ی خود قرار دارد و نیاز است تا وضعیت فعلی تجمیع دستگاه های BIM و IoT درک شود. این مقاله با هدف شناسایی زمینه های کاربردی نوظهور و شناسایی الگوهای طراحی رایج در رویکردی که مخالف با تجمیع دستگاه BIM-IoT می باشد، مرور جامعی در این زمینه انجام می دهد و به بررسی محدودیت های حاضر و پیش بینی مسیرهای تحقیقاتی آینده می پردازد. در این مقاله، در مجموع، 97 مقاله از 14 مجله مربوط به AEC و پایگاه داده های موجود در صنایع دیگر (در دهه گذشته)، مورد بررسی قرار گرفتند. چندین حوزه ی رایج در این زمینه تحت عناوین عملیات ساخت-وساز و نظارت، مدیریت ایمنی و بهداشت، لجستیک و مدیریت ساختمان، و مدیریت تسهیلات شناسایی شدند. نویسندگان، 5 روش تجمیع را همراه با ذکر توضیحات، نمونه ها و بحث های مربوط به آنها به طور خلاصه بیان کرده اند. این روش های تجمیع از ابزارهایی همچون واسط های برنامه نویسی BIM، پایگاه داده های رابطه ای، تبدیل داده های BIM به پایگاه داده های رابطه ای با استفاده از طرح داده های جدید، ایجاد زبان پرس وجوی جدید، فناوری های وب معنایی و رویکردهای ترکیبی، استفاده می کنند. براساس محدودیت های مشاهده شده، با تمرکز بر الگوهای معماری سرویس گرا (SOA) و راهبردهای مبتنی بر وب برای ادغام BIM و IoT، ایجاد استانداردهایی برای تجمیع و مدیریت اطلاعات، حل مسئله همکاری و محاسبات ابری، مسیرهای برجسته ای برای تحقیقات آینده پیشنهاد شده است.
کلمه های کلیدی: مدل سازی اطلاعات ساختمانی (BIM) | دستگاه اینترنت اشیاء (IoT) | حسگرها | ساختمان هوشمند | شهر هوشمند | محیط ساخته شده هوشمند | تجمیع.
|مقاله ترجمه شده|
iFusion: Towards efficient intelligence fusion for deep learning from real-time and heterogeneous data
iFusion: به سمت تلفیق اطلاعاتی کارآمد برای یادگیری عمیق از داده های واقعی و ناهمگن-2019
Deep learning has shown great strength in many fields and has allowed people to live more conveniently and intelligently. However, deep learning requires a considerable amount of uniform training data, which introduces difficulties in many application scenarios. On the one hand, in real-time systems, training data are constantly generated, but users cannot immediately obtain this vast amount of training data. On the other hand, training data from heterogeneous sources have different data formats. Therefore, existing deep learning frameworks are not able to train all data together. In this paper, we propose the iFusion framework, which achieves efficient intelligence fusion for deep learning from real-time data and heterogeneous data. For real-time data, we train only newly arrived data to obtain a new discrimination model and fuse the previously trained models to obtain the discrimination result. For heterogeneous data, different types of data are trained separately; then, we fuse the different discrimination models so that it is not necessary to consider heterogeneous data formats. We use a method based on Dempster-Shafer theory (DST) to fuse the discrimination models. We apply iFusion to the deep learning of medical image data, and the results of the experiments show the effectiveness of the proposed method.
Keywords: Information| fusion | Real-time data | Heterogeneous data | Deep learning
Real-time data text mining based on Gravitational Search Algorithm
داده کاوی متن در زمان واقعی بر اساس الگوریتم جستجوی گرانشی-2019
Short messages are one of the milestones on the web especially on social media (SM). Due to the widespread circulation of SM, it already turns into excessively painful capturing outmost relevant and significant information for certain users. One of the main motivations of this work is that many users may need an inclusive brief of all comments without reading the entire list of short messages for deci- sion making. In this work, mining in big social media data is formulated for the first time into a multi- objective optimization (MOO) task to extract the essence of a text. Since some users may demand the brief at any moment, several groups of dissimilar short messages are established based on graph coloring mechanism. Six interesting feature are formalized to exhibit more interactive messages. A Gravitational Search Algorithm (GSA) is employed to satisfy several important objectives for generating a concise sum- mary. The problem was picked by using the Normal Boundary Intersection (NBI) mechanism to trade-offamong different features. Additionally, to satisfy real-time needs, an inventive incremental grouping task is modelled to update the existing colors. From exhaustive experimental results, the proposed approach outperformed other strong comparative methods.
Keywords: Data text mining | Swarm Intelligence | Big-data | Gravitational search algorithm | Normal boundary intersection
Machine Learning based Digital Twin Framework for Production Optimization in Petrochemical Industry
چارچوب دوقلوی دیجیتال مبتنی بر یادگیری ماشین برای بهینه سازی تولید در صنعت پتروشیمی-2019
Digital twins, along with the internet of things (IoT), data mining, and machine learning technologies, offer great potential in the transformation of today’s manufacturing paradigm toward intelligent manufacturing. Production control in petrochemical industry involves complex circumstances and a high demand for timeliness; therefore, agile and smart controls are important components of intelligent manufacturing in the petrochemical industry. This paper proposes a framework and approaches for constructing a digital twin based on the petrochemical industrial IoT, machine learning and a practice loop for information exchange between the physical factory and a virtual digital twin model to realize production control optimization. Unlike traditional production control approaches, this novel approach integrates machine learning and real-time industrial big data to train and optimize digital twin models. It can support petrochemical and other process manufacturing industries to dynamically adapt to the changing environment, respond in a timely manner to changes in the market due to production optimization, and improve economic benefits. Accounting for environmental characteristics, this paper provides concrete solutions for machine learning difficulties in the petrochemical industry, e.g., high data dimensions, time lags and alignment between time series data, and high demand for immediacy. The approaches were evaluated by applying them in the production unit of a petrochemical factory, and a model was trained via industrial IoT data and used to realize intelligent production control based on real-time data. A case study shows the effectiveness of this approach in the petrochemical industry.
Keywords: digital twin | machine learning | internet of things | petrochemical industry | production control optimization