Data Mining Strategies for Real-Time Control in New York City
استراتژی داده کاوی برای کنترل زمان واقعی در شهر نیویورک-2105
The Data Mining System (DMS) at New York City Department of Transportation (NYCDOT) mainly consists of four database systems for traffic and pedestrian/bicycle volumes, crash data, and signal timing plans as well as the Midtown in Motion (MIM) systems which are used as part of the NYCDOT Intelligent Transportation System (ITS) infrastructure. These database and control systems are operated by different units at NYCDOT as an independent database or operation system. New York City experiences heavy traffic volumes, pedestrians and cyclists in each Central Business District (CBD) area and along key arterial systems. There are consistent and urgent needs in New York City for real-time control to improve mobility and safety for all users of the street networks, and to provide a timely response and management of random incidents. Therefore, it is necessary to develop an integrated DMS for effective real-time control and active transportation management (ATM) in New York City. This paper will present new strategies for New York City suggesting the development of efficient and cost-effective DMS, involving: 1) use of new technology applications such as tablets and smartphone with Global Positioning System (GPS) and wireless communication features for data collection and reduction; 2) interface development among existing database and control systems; and 3) integrated DMS deployment with macroscopic and mesoscopic simulation models in Manhattan. This study paper also suggests a complete data mining process for real-time control with traditional static data, current real timing data from loop detectors, microwave sensors, and video cameras, and new real-time data using the GPS data. GPS data, including using taxi and bus GPS information, and smartphone applications can be obtained in all weather conditions and during anytime of the day. GPS data and smartphone application in NYCDOT DMS is discussed herein as a new concept. © 2014 The Authors. Published by Elsevier B.V. Selection and peer-review under responsibility of Elhadi M. Shakshu Keywords: Data Mining System (DMS), New York City, real-time control, active transportation management (ATM), GPS data
Abstract depiction of human figures in impressionist art and children’s picture books
چکیده ای از چهره های انسان در هنرهای تجسمی و کتاب های تصویری کودکان-2019
The human figure is important in art. I discuss examples of the abstract depiction of the human figure, from both impressionist painting and children’s book illustration, and the challenge faced in algorithmically mimicking what human artists can achieve. I demonstrate that there are excellent examples in both genres that provide insight into what a human artist sees as important in providing abstraction at different levels of detail. The challenge lies in the human brain having enormous knowledge about the world and an ability to make fine distinctions about other humans from posture, clothing and expression. This allows a human to make assumptions about human figures from a tiny amount of data, and allows a human artist to take advantage of this when creating art. The question for the computer graphics community is whether and how we could algorithmically mimic what a human artist can do. I provide evidence from both genres to suggest possible ways forward.
Keywords: Impressionism | Abstraction | Representation | Perception
2DToonShade: A stroke based toon shading system
2DToonShade: ضربه بر اساس سیستم سایه toon-2019
We present 2DToonShade: a semi-automatic method for creating shades and self-shadows in cel animation. Besides producing attractive images, shades and shadows provide important visual cues about depth, shapes, movement and lighting of the scene. In conventional cel animation, shades and shadows are drawn by hand. As opposed to previous approaches, this method does not rely on a complex 3D reconstruction of the scene: its key advantages are simplicity and ease of use. The tool was designed to stay as close as possible to the natural 2D creative environment and therefore provides an intuitive and user-friendly interface. Our system creates shading based on hand-drawn objects or characters, given very limited guidance from the user. The method employs simple yet very efficient algorithms to create shading directly out of drawn strokes. We evaluate our system through a subjective user study and provide qualitative comparison of our method versus existing professional tools and recent state of the art.
Keywords: Toon shading | Cel shading | Hand-drawn animation | Image-based rendering | Non-photorealistic-rendering
Shape analysis of 3D nanoscale reconstructions of brain cell nuclear envelopes by implicit and explicit parametric representations
تجزیه و تحلیل 3D بازسازی شکل در مقیاس نانو سلول های مغز پاکت های هسته ای توسط نمایندگی پارامتری ضمنی و صریح-2019
Shape analysis of cell nuclei is becoming increasingly important in biology and medicine. Recent results have identified that large variability in shape and size of nuclei has an important impact on many biological processes. Current analysis techniques involve automatic methods for detection and segmentation of histology and microscopy images, but are mostly performed in 2D. Methods for 3D shape analysis, made possible by emerging acquisition methods capable to provide nanometric-scale 3D reconstructions, are still at an early stage, and often assume a simple spherical shape. We introduce here a framework for analyzing 3D nanoscale reconstructions of nuclei of brain cells (mostly neurons), obtained by semiautomatic segmentation of electron micrographs. Our method considers two parametric representations: the first one customizes the implicit hyperquadrics formulation and it is particularly suited for convex shapes, while the latter considers a spherical harmonics decomposition of the explicit radial representation. Point clouds of nuclear envelopes, extracted from image data, are fitted to the parameterized models which are then used for performing statistical analysis and shape comparisons. We report on the analysis of a collection of 121 nuclei of brain cells obtained from the somatosensory cortex of a juvenile rat.
Keywords: Shape analysis | Nanoscale cell reconstruction | Nuclear envelopes | Cell classification
Unsupervised by any other name: Hidden layers of knowledge production in artificial intelligence on social media
بدون نظارت با هر نام دیگری: لایه های پنهان تولید دانش در هوش مصنوعی در رسانه های اجتماعی-2019
Artificial Intelligence (AI) in the form of different machine learning models is applied to Big Data as a way to turn data into valuable knowledge. The rhetoric is that ensuing predictions work well—with a high degree of autonomy and automation. We argue that we need to analyze the process of applying machine learning in depth and highlight at what point human knowledge production takes place in seemingly autonomous work. This article reintroduces classification theory as an important framework for understanding such seemingly invisible knowledge production in the machine learning development and design processes. We suggest a framework for studying such classification closely tied to different steps in the work process and exemplify the framework on two experiments with machine learning applied to Facebook data from one of our labs. By doing so we demonstrate ways in which classification and potential discrimination take place in even seemingly unsupervised and autonomous models. Moving away from concepts of non-supervision and autonomy enable us to understand the underlying classificatory dispositifs in the work process and that this form of analysis constitutes a first step towards governance of artificial intelligence.
Keywords: Artificial intelligence | machine learning | classification | social media| Facebook | discrimination | bias
When data is capital: Datafication, accumulation, and extraction
وقتی داده سرمایه است: داده سازی، انباشت و استخراج-2019
The collection and circulation of data is now a central element of increasingly more sectors of contemporary capitalism. This article analyses data as a form of capital that is distinct from, but has its roots in, economic capital. Data collection is driven by the perpetual cycle of capital accumulation, which in turn drives capital to construct and rely upon a universe in which everything is made of data. The imperative to capture all data, from all sources, by any means possible influences many key decisions about business models, political governance, and technological development. This article argues that many common practices of data accumulation should actually be understood in terms of data extraction, wherein data is taken with little regard for consent and compensation. By understanding data as a form capital, we can better analyse the meaning, practices, and implications of datafication as a political economic regime.
Keywords: Big Data | digital capitalism | value | political economy | Marx | Bourdieu
The social imaginaries of data activism
تصورات اجتماعی فعالانه داده ها-2019
Data activism, promoting new forms of civic and political engagement, has emerged as a response to problematic aspects of datafication that include tensions between data openness and data ownership, and asymmetries in terms of data usage and distribution. In this article, we discuss MyData, a data activism initiative originating in Finland, which aims to shape a more sustainable citizen-centric data economy by means of increasing individuals’ control of their personal data. Using data gathered during long-term participant-observation in collaborative projects with data activists, we explore the internal tensions of data activism by first outlining two different social imaginaries – technological and socio-critical – within MyData, and then merging them to open practical and analytical space for engaging with the socio-technical futures currently in the making. While the technological imaginary favours data infrastructures as corrective measures, the socio-critical imaginary questions the effectiveness of technological correction. Unpacking them clarifies the kinds of political and social alternatives that different social imaginaries ascribe to the notions underlying data activism, and highlights the need to consider the social structures in play. The more far-reaching goal of our exercise is to provide practical and analytical resources for critical engagement in the context of data activism. By merging technological and socio-critical imaginaries in the work of reimagining governing structures and knowledge practices alongside infrastructural arrangements, scholars can depart from the most obvious forms of critique, influence data activism practice, and formulate data ethics and data futures.
Keywords: Datafication | social imaginary | data activism | MyData | data ethics | socio-technical futures
Conceptual frameworks for social and cultural Big Data analytics: Answering the epistemological challenge
چارچوب مفهومی برای تجزیه و تحلیل داده های بزرگ اجتماعی و فرهنگی : پاسخ به چالش معرفت شناختی-2019
This paper aims to contribute to the development of tools to support an analysis of Big Data as manifestations of social processes and human behaviour. Such a task demands both an understanding of the epistemological challenge posed by the Big Data phenomenon and a critical assessment of the offers and promises coming from the area of Big Data analytics. This paper draws upon the critical social and data scientists’ view on Big Data as an epistemological challenge that stems not only from the sheer volume of digital data but, predominantly, from the proliferation of the narrow-technological and the positivist views on data. Adoption of the social-scientific epistemological stance presupposes that digital data was conceptualised as manifestations of the social. In order to answer the epistemological challenge, social scientists need to extend the repertoire of social scientific theories and conceptual frameworks that may inform the analysis of the social in the age of Big Data. However, an ‘epistemological revolution’ discourse on Big Data may hinder the integration of the social scientific knowledge into the Big Data analytics.
Keywords: Social and cultural Big Data analytics | social science | computational science | epistemological challenge | social media
Medical research, Big Data and the need for privacy by design
تحقیقات پزشکی، داده های بزرگ و نیاز به حریم خصوصی توسط طراحی-2019
Medical research data is sensitive personal data that needs to be protected from unauthorized access and unintentional disclosure. In a research setting, sharing of (big) data within the scientific community is necessary in order to make progress and maximize scientific benefits derived from valuable and costly data. At the same time, convincingly protecting the privacy of people (patients) participating in medical research is a prerequisite for maintaining trust and willingness to share. In this commentary, we will address this issue and the pitfalls involved in the context of the PEP project1 that provides the infrastructure for the Personalized Parkinson’s Project,2 a large cohort study on Parkinson’s disease from Radboud University Medical Center (Radboudumc), in cooperation with Verily life Sciences, an Alphabet subsidiary.
Keywords:Big Data | GDPR compliance | informed consent | medical cohort study | polymorphic encryption | privacy by design
Experiments with a data-public: Moving digital methods into critical proximity with political practice
آزمایشات با داده های عمومی: انتقال روش های دیجیتال به نزدیکی بحرانی با عمل سیاسی-2019
Making publics visible through digital traces has recently generated interest by practitioners of public engagement and scholars within the field of digital methods. This paper presents an experiment in moving such methods into critical proximity with political practice and discusses how digital visualizations of topical debates become appropriated by actors and hardwired into existing ecologies of publics and politics. Through an experiment in rendering a specific data-public visible, it shows how the interplay between diverse conceptions of the public as well as the specific platforms and data invoked, resulted in a situated affordance-space that allowed specific renderings take shape, while disadvantaging others. Furthermore, it argues that several accepted tropes in the literatures of digital methods ended up being problematic guidelines in this space. Among these is the prescription to shown heterogeneity by pushing back at established media logics.
Keywords: Digital methods | public engagement | pragmatism | controversy-mapping | critical proximity | multiplicity