Investigating consumer preferences on product designs by analyzing opinions from social networks using evidential reasoning
بررسی ترجیحات مصرف کننده در طرح های محصول با تجزیه و تحلیل نظرات شبکه های اجتماعی با استفاده از استدلال مشهود-2020
The rapid growth of e-commerce and social networking sites has created various challenges for the extraction of user-generated content (UGC). In the era of big data, customer opinions from social media are utilized for investigating consumer preferences to support product redesigns. Opinion mining, including the various automatic text classification algorithms using sentiment analysis is a capable tool to deal with a large amount of comments on the social networking sites. In which, sentiment analysis is used to determine the contextual polarity within a comment by searching sentimental words. However, the inconsistency on choosing the sentiment words leads to the inaccurate interpretation of the opinion strength of sentiment words. An approach to summarize the UGC from social networking media using fuzzy and ER without the need to review all the comments is proposed in this paper. The inaccuracy on determination of the polarity of sentiment words and corresponding opinion strengths is rectified by fuzzy approximation and ER. The result is presented in ranking therefore the effort for result interpretation significantly reduced. The incorporation of sentiment analysis with ER to analyze the UGC for product designs is a new attempt in investigating consumer preferences. The proposed approach is shown to be handy, sufficient, and cost effective for the product design and re-design, particularly in the preliminary stage. This project can be further extended by employing alternative fuzzy approximate techniques in the fuzzy-ER approach to support the sentiment analysis to enhance the accuracy of sentiment values for determining the distribution assessments of ER.
Keywords: Opinion mining | Sentiment analysis | Evidential reasoning | Consumer preferences | Product design
Prediction of the ground temperature with ANN, LS-SVM and fuzzy LS-SVM for GSHP application
پیش بینی دمای زمین با شبکه های عصبی، LS-SVM و LS-SVM فازی برای استفاده GSHP-2020
Ground source heat pump (GSHP) system has received more and more attentions for its energy-conserving and environmental-friendly properties. Acquisition of the undisturbed ground temperature is the prerequisite for designing of GSHP system. Measurement by burying temperature sensors underground is the conventional means for obtaining the ground temperature data. However, this way is usually time consuming and high investment, and also easily encounter with certain technical difficulties. The rapid development of intelligent computation algorithm provides solutions for many realistic difficult problems. Basing on a great number of the measured data of the ground temperature from two boreholes with 100m depth located in Chongqing, ground temperature prediction models basing on artificial neural network (ANN) and support vector machine based on least square (LS-SVM) are established, respectively. And then, two kinds of validation works, i.e., holdout validation and k-fold validation are conducted toward the two models, respectively. Furthermore, a new method that correlating fuzzy theory with LS-SVM is proposed to solve the big computation burden problem encountered by LS-SVM model. By comparing with the above two models, it is concluded that the newly proposed model can not only improve the calculation speed obviously but also be able to promote the prediction accuracy, especially superior to the single LS-SVM model.
Keywords: Ground temperature | Fuzzy | Support vector machine | Ground source heat pump
Application of smart safety training and education in network teaching management
کاربرد آموزش ایمنی هوشمند و آموزش در مدیریت آموزش شبکه-2020
Aiming at the problems of poor resource scheduling and low degree of information fusion in the traditional network management method of intelligent security training and education optimization, an intelligent security training and education optimization network management model based on big data mining is proposed. Building intelligent safety training and education of big data fusion analysis model, using the method of association rules mining, complete the intelligent safety training and education statistics analysis, under the Internet environment using quantitative sensing fusion tracking method, the network teaching management information fusion processing, build large data information scheduling model based on network teaching management, fuzzy information fusion method to reconstruct 3 d information of the network teaching management, to establish the network teaching management big data spectral analysis model, the introduction of phase space reconstruction method, the network resource scheduling optimization of teaching management. The experimental results show that the proposed method has better resource scheduling performance, higher degree of information fusion, and can improve the ability of intelligent security training and education management.
Keywords: Smart safety training | Education | Network teaching management | Big data | Integration | Resource scheduling
Managing minority opinions in micro-grid planning by a social network analysis-based large scale group decision making method with hesitant fuzzy linguistic information
مدیریت نظرات اقلیت ها در برنامه ریزی خرد شبکه ای با استفاده از روش تصمیم گیری گروهی مقیاس بزرگ مبتنی بر تحلیل شبکه های اجتماعی با اطلاعات زبانی فازی مردد-2020
The growth of global electricity demand has put forward higher requirements for power distribution networks. The high cost of the large-scale power system and the voice for the use of renewable energy impel the birth of the micro-grid which plays a complementary role in the power generation of large-scale power system. The construction of micro-grid planning is complex and many stakeholders’ opinions should be considered for a comprehensive evaluation. Furthermore, the development of social big data techniques, such as e-marketplace and e-democracy, makes experts have social relationships among them. This study aims to develop a consensus model to manage minority opinions for largescale group decision making with social network analysis for micro-grid planning. To deal with the vague and uncertain features in complex micro-grid planning problems, experts are supposed to use hesitant fuzzy linguistic term sets to express their opinions. A social network analysis-based clustering method is introduced to classify experts. Besides, in a large-scale group decision making problem, the opinions of experts should be fully considered, especially the minority opinions. This model considers the minority opinions in a micro-grid planning problem and provides an approach to manage these opinions. Finally, we use an illustrative example concerning the micro-grid planning decision making in Ali district in Tibet to demonstrate the effectiveness and practicability of the proposed model.
Keywords: Micro-grid planning | Large-scale group decision making | Social network analysis | Minority opinions | Hesitant fuzzy linguistic term sets | Consensus
Veracity handling and instance reduction in big data using interval type-2 fuzzy sets
کنترل صحت و کاهش نمونه در داده های بزرگ با استفاده از بازها های مجموعه های فازی نوع 2-2020
Within the aspect of big data, veracity refers to the existing uncertainty in the dataset. The continuous flow of unstructured data with unwanted noise may bring abnormality in the dataset making them unusable. In this paper, we propose a novel method to handle the veracity characteristic of the big data using the concept of footprint of uncertainty (FOU) in interval type-2 fuzzy sets (IT2 FSs). The proposed method helps in handling the veracity issue in big data and reduces the instances to a manageable extent. We have compared the results with the existing clustering based methods and examined the relationship between the clusters and the FOUs by comparing their centroids and defuzzified values. To scrutinize the validity of our results, we have also performed a number of additional experiments by appending extra instances to the datasets. To check its consistency and efficacy, the proposed methodology is assessed from three different aspects. Experimental result validates that the proposed method can suitably handle the veracity issue in big datasets and is efficient in reducing the instances.
Keywords: Instance reduction | Big data veracity | Interval type-2 fuzzy sets | Cluster centroid | Footprint of uncertainty
Prediction of greenhouse gas emissions from Ontario’s solid waste landfills using fuzzy logic based model
پیش بینی انتشار گازهای گلخانه ای از محل های دفع زباله جامد انتاریو با استفاده از مدل مبتنی بر منطق فازی-2020
In this study, multi-criteria assessment technique is used to predict the methane generation from large municipal solid waste landfills in Ontario, Canada. Although a number of properties determine the gas generation from landfills, these parameters are linked with empirical relationships making it difficult to generate precise information concerning gas production. Moreover, available landfill data involve sources of uncertainty and are mostly insufficient. To fully characterize the chemistry of reaction and predict gas generation volumes from landfills, a fuzzy-based model is proposed having seven input parameters. Parameters were identified in a linguistic form and linked by 19 IF-THEN statements. When compared to measured values, results of the fuzzy based model showed good prediction of landfill gas generation rates. Also, when compared to other first order decay and second order decay models like LandGEM, the fuzzy based model showed better results. When plotting the LandGEM and Fuzzy model values to the actual measured data, the fuzzy model resulted in a better fit to actual data than the LandGEM model with a coefficient of determination R2 of 0.951 for fuzzy model versus 0.804 for LandGEM model. The results show how multi-criteria assessment technique can be used in modelling of complicated processes that take place within the landfills and somehow accurately predicting the landfill gas generation rate under different operating conditions
Keywords: Municipal solid waste | Landfill gas | Life-cycle assessment | Waste to energy | Greenhouse gas emissions | Fuzzy model
Dual incremental fuzzy schemes for frequent itemsets discovery in streaming numeric data
طرح های فازی افزایشی دوگانه برای کشف مکرر آیتم ها در جریان داده های عددی-2020
Discovering frequent itemsets is essential for finding association rules, yet too computa- tional expensive using existing algorithms. It is even more challenging to find frequent itemsets upon streaming numeric data. The streaming characteristic leads to a challenge that streaming numeric data cannot be scanned repetitively. The numeric characteristic requires that streaming numeric data should be pre-processed into itemsets, e.g., fuzzy- set methods can transform numeric data into itemsets with non-integer membership val- ues. This leads to a challenge that the frequency of itemsets are usually not integer. To overcome such challenges, fast methods and stream processing methods have been ap- plied. However, the existing algorithms usually either still need to re-visit some previous data multiple times, or cannot count non-integer frequencies. Those existing algorithms re-visiting some previous data have to sacrifice large memory spaces to cache those pre- vious data to avoid repetitive scanning. When dealing with big streaming data nowadays, such large-memory requirement often goes beyond the capacity of many computers. Those existing algorithms unable to count non-integer frequencies would be very inaccurate in estimating the non-integer frequencies of frequent itemsets if used with integer approxi- mation of frequency-counting. To solve the aforementioned issues, in this paper we propose two incremental schemes for frequent itemsets discovery that are capable to work efficiently with streaming nu- meric data. In particular, they are able to count non-integer frequency without re-visiting any previous data. The key of our schemes to the benefits in efficiency is to extract essen- tial statistics that would occupy much less memory than the raw data do for the ongoing streaming data. This grants the advantages of our schemes 1) allowing non-integer count- ing and thus natural integration with a fuzzy-set discretization method to boost robustness and anti-noise capability for numeric data, 2) enabling the design of a decay ratio for dif- ferent data distributions, which can be adapted for three general stream models: landmark, damped and sliding windows, and 3) achieving highly-accurate fuzzy-item-sets discovery with efficient stream-processing. Experimental studies demonstrate the efficiency and effectiveness of our dual schemes with both synthetic and real-world datasets.
Keywords: Incremental algorithm | Data stream mining | Frequent itemsets | Without re-visiting
The impact of big data on firm performance in hotel industry
تأثیر داده های بزرگ بر عملکرد شرکت در صنعت هتلداری-2020
Big data has increasingly appeared as a frontier of opportunity in enhancing firm performance. However, it still is in early stages of introduction and many enterprises are still un-decisive in its adoption. The aim of this study is to propose a theoretical model based on integration of Human-Organization-Technology fit and Technology- Organization-Environment frameworks to identify the key factors affecting big data adoption and its consequent impact on the firm performance. The significant factors are gained from the literature and the research model is developed. Data was collected from top managers and/or owners of SMEs hotels in Malaysia using online survey questionnaire. Structural Equation Modelling (SEM) is used to assess the developed model and Adaptive Neuro- Fuzzy Inference Systems (ANFIS) technique is used to prioritize adoption factors based on their importance levels. The results showed that relative advantage, management support, IT expertise, and external pressure are the most important factors in the technological, organizational, human, and environmental dimensions. The results further revealed that technology is the most important influential dimension. The outcomes of this study can assist the policy makers, businesses and governments to make well-informed decisions in adopting big data.
Keywords: Firm performance | Big data | Hotel industry | Fuzzy logic | Structural equation modelling
Assessment of traffic congestion with ORESTE method under double hierarchy hesitant fuzzy linguistic environment
ارزیابی تراکم ترافیک با استفاده از روش ORESTE در محیط زبانی فازی با سلسله مراتب مضاعف-2020
With the new generation of information technology development and the promotion of the Internet, local governments turn their attention to the construction of intelligent transportation systems. More and more cities began building intelligent transportation which has been widely used to monitor urban traffic. Experts can evaluate urban traffic congestion based on the information collected from the big data of intelligent transportation. In recent two years, double hierarchy hesitant fuzzy linguistic term set has been widely used to depict explicit evaluation information, which is straightforward and broadspectrum. When evaluating traffic congestion in a city, decision makers can utilize double hierarchy hesitant fuzzy linguistic term sets to express vague information. Moreover, the ORESTE method is an applicative method which can select a reliable alternative by subdividing alternatives and reduce the loss of information in the conversion process. In this paper, we propose a double hierarchy hesitant fuzzy linguistic ORESTE method and a new score function of double hierarchy hesitant fuzzy linguistic term set. The method raises a new perspective to reduce the error from other methods and the new score function derives a robust decision-making result. Then, we apply the double hierarchy hesitant fuzzy linguistic ORESTE method to solve a practical case involving choosing the congested city by evaluating the 5S traffic congestion model. Finally, we compare the double hierarchy hesitant fuzzy linguistic ORESTE method with other methods such as the classical ORESTE method and the double hierarchy hesitant fuzzy linguistic MULTIMOORA to illustrate the advantages of our method.
Keywords: Double hierarchy hesitant fuzzy linguistic | term sets | Double hierarchy hesitant fuzzy linguistic | ORESTE method | Score function | Traffic congestion
A knowledge-based expert system to assess power plant project cost overrun risks
یک سیستم خبره مبتنی بر دانش برای ارزیابی هزینه ریسک بیش ازحد پروژه نیروگاهی-2019
Preventing cost overruns of such infrastructure projects as power plants is a global project management problem. The existing risk assessment methods/models have limitations to address the complicated na- ture of these projects, incorporate the probabilistic causal relationships of the risks and probabilistic data for risk assessment, by taking into account the domain experts’ judgments, subjectivity, and un- certainty involved in their judgments in the decision making process. A knowledge-based expert system is presented to address this issue, using a fuzzy canonical model (FCM) that integrates the fuzzy group decision-making approach (FGDMA) and the Canonical model ( i.e. a modified Bayesian belief network model) . The FCM overcomes: (a) the subjectivity and uncertainty involved in domain experts’ judgment, (b) sig- nificantly reduces the time and effort needed for the domain experts in eliciting conditional probabilities of the risks involved in complex risk networks, and (c) reduces the model development tasks, which also reduces the computational load on the model. This approach advances the applications of fuzzy-Bayesian models for cost overrun risks assessment in a complex and uncertain project environment by addressing the major constraints associated with such models. A case study demonstrates and tests the application of the model for cost overrun risk assessment in the construction and commissioning phase of a power plant project, confirming its ability to pinpoint the most critical risks involved ̶ in this case, the complex- ity of the lifting and rigging heavy equipment, inadequate work inspection and testing plan, inadequate site/soil investigation, unavailability of the resources in the local market, and the contractor’s poor plan- ning and scheduling.
Keywords: Cost overruns | Risk assessment | Power plant projects | Fuzzy logic | Canonical model