No abstract available.
Efficient Storage of Big-Data for Real-Time GPS Applications
GPS applications need real-time responsiveness and are location-sensitive. GPS data is time-variant, dynamic and large. Current methods of centralized or distributed storage with static data impose constraints on addressing the real-time requirement of ...
Real Time Routing in Road Networks
Routing in road networks is an old problem with renewed interest. In this work, we focus on how to extend real timeliness to the routing problem. With the increasing availability of rich time dependent data in the form of current traffic, weather ...
Tahoe-LAFS Distributed Storage Service in Community Network Clouds
Community networks are successful large scale, decentralized IP networks, built and operated by citizens for citizens. Cloud computing infrastructures present in today's Internet hardly exist in community networks. But the demand for cloud storage, and ...
Event Pattern Discovery on IDS Traces of Cloud Services
The value of Intrusion Detection System (IDS) traces is based on being able to meaningfully parse the complex data patterns appearing therein as based on the pre-defined intrusion 'detection' rule sets. As IDS traces monitor large groups of servers, ...
A New Approach Based on Intelligent Water Drops Algorithm for Node Selection in Service-Oriented Wireless Sensor Networks
Issues related to Wireless Sensor Networks (WSNs) are inseparable part of concerns in Big Data, due to they can provide a large amount of real-time data to the processing units. A service-oriented wireless sensor network aims to manage the procedure for ...
Automating Deployment of Customized Scientific Data Analytic Environments on Clouds
Cloud computing has become a widely used solution for efficiently provisioning computational and storage resources. Meanwhile, it is essential to provide customizable scientific data analytic platforms for researchers to conduct their personalized data ...
A Domain-Driven, Generative Data Model for Big Pet Store
Generating large amounts of semantically-rich data for testing big data workflows is paramount for scalable performance benchmarking and quality assurance in modern machine-learning and analytics workloads. The most obvious use case for such a ...
Dynamic Workload Balancing for Hadoop MapReduce
Hadoop has two components which are HDFS and MapReduce. HDFS is a distributed file system for storing data for users of Hadoop and MapReduce is the framework that executes jobs from users. Hadoop stores user data based on space utilization of data nodes ...
A Stop Planning Method over Big Traffic Data for Airport Shuttle Bus
With the growing volume of the airport passengers, public transit is needed for healthy and sustainable city development, in which airport shuttle buses play a key role in satisfying the demand. In this paper, a two-phase airport shuttle bus stop ...
A Neural Network Based Pre-Selection of Big Data in Photon Science
One of the challenges of scientific data collection on a big data scale is the problem of storing all data. An example of this is femtosecond crystallography. Here, small crystals are illuminated by a pulsed X -- ray laser beam and the resulting ...
A Cloud Model for Distributed Transport System Integration
Public transport systems already encounter vast amounts of data per day, ranging from the GPS information of vehicles to sensor checks on real-time locations. Newer technologies such as social media, sensor networks or passenger counting systems, to ...
Efficient Pre-copy Live Migration with Memory Compaction and Adaptive VM Downtime Control
Virtual machine (VM) live migration is an important feature of the virtualization technique. Pre-copy method is typically used to support live migration in most virtualization environments. It is observed that pre-copy may not work in some situation ...
Practical Analysis of Big Acoustic Sensor Data for Environmental Monitoring
Monitoring the environment with acoustic sensors is an effective method for understanding changes in ecosystems. Through extensive monitoring, large-scale, ecologically relevant, datasets can be produced that can inform environmental policy. The ...
RAID-Aware SSD: Improving the Write Performance and Lifespan of SSD in SSD-Based RAID-5 System
Flash memory-based SSD RAID has an excellent I/O performance with high stability, which making it get more and more attention from companies and manufacturers, especially in I/O-intensive environments. However, frequently updating parity also makes the ...
Fault Tolerant Erasure Coded Replication for HDFS Based Cloud Storage
Businesses and individuals move their data to the cloud because fault-tolerant data storage is becoming more important. Currently fault-tolerance cloud storage file systems are available and being used widely. Hadoop Distributed File System (HDFS) has ...
Optimal Distributed Data Warehouse System Architecture
Many organizations look for a proper way to make better and faster decisions about their businesses. Data warehouse has unique features such as data mining and ad hoc querying on data collected and integrated from many of the computerized systems used ...
Secure Index Construction for Privacy-Preserving Large-Scale Image Retrieval
How to efficiently retrieve the images while preserving the user's privacy has gradually become a key problem in some applications such as Cloud storage, social networks. In this paper, a secure index used for image retrieval is constructed to protect ...
A Paralleled Big Data Algorithm with MapReduce Framework for Mining Twitter Data
Some recent studies have suggested that public opinions expressed in social media may be correlated with various social issues. To find out what actually can be discovered in social media data, we need data mining. Data mining approaches that can handle ...
Data-Intensive Workflow Optimization Based on Application Task Graph Partitioning in Heterogeneous Computing Systems
Stream based data processing model is proven to be an established method to optimize data-intensive applications. Data-intensive applications involve movement of huge amount of data between execution nodes that incurs large costs. Data-streaming model ...
Remote Monitoring System Enabling Cloud Technology upon Smart Phones and Inertial Sensors for Human Kinematics
Stroke is a common neurological condition which is becoming increasingly common as the population ages. This entails healthcare monitoring systems suitable for home use, with remote access for medical professionals and emergency responders. The mobile ...
High-Performance Processing of Large-Scale Parallel Applications in Heterogeneous Cloud Computing Data Centers
Efficient application processing is critical for achieving high performance in heterogeneous computing systems, i.e., Optimal System configuration and load distribution of some given types of applications, such that the average response time of tasks is ...
Cloud-Based Educational Big Data Application of Apriori Algorithm and K-Means Clustering Algorithm Based on Students' Information
The paper proposes a cloud-based framework to abstract and analyze the meaningful rules among great amount of students' raw information. The authors abstract a set of learning skills based on the course outline from The Open University of China. The ...