A highly efficient dynamic router for application-oriented network on chip
With the number of processor cores increasing in chip multiprocessors, the network on chip becomes a reliable structure with its perfect parallel communication performance. The traditional static router suffers a bad performance because of low buffer ...
A joint optimization method for NoC topology generation
The increasing demand on efficient intra-chip communication of multicore systems has driven the interconnection structure to evolve from bus/ring to Network-on-Chip (NoC). NoC design is fundamentally based on network topology generation and ...
Improving the energy efficiency and performance of data-intensive workflows in virtualized clouds
In recent years, deploying and running data-intensive workflows in cloud platform has become more and more popular in many areas. Unlike computation-intensive applications, a data-intensive workflow typically requires to deal with bulk data transferring ...
Resource optimization of container orchestration: a case study in multi-cloud microservices-based applications
An approach to optimize the deployment of microservices-based applications using containers in multi-cloud architectures is presented. The optimization objectives are three: cloud service cost, network latency among microservices, and time to start a ...
A multi-objective decision support framework for virtual machine placement in cloud data centers: a real case study
Cloud computing (CC) is the concept of accessing to computing resources: servers, networks, storage, and applications, on demand through a network. This new paradigm has led to the birth of several data centers worldwide offering cloud services across ...
Incentive-aware virtual machine scheduling in cloud computing
As cloud computing is a market-oriented utility, optimal virtual machine (VM) scheduling in cloud computing should take into account the incentives for both cloud users and the cloud provider. However, most of existing studies on VM scheduling only ...
On benchmarking task scheduling algorithms for heterogeneous computing systems
The task scheduling problem on heterogeneous computing systems has been broadly studied, and many heuristic algorithms are proposed to solve this problem. It is interesting to go for significant performance assessment and comparison among these ...
Improving all-reduce collective operations for imbalanced process arrival patterns
Two new algorithms for the all-reduce operation optimized for imbalanced process arrival patterns (PAPs) are presented: (1) sorted linear tree, (2) pre-reduced ring as well as a new way of online PAP detection, including process arrival time estimations,...
Real-time tsunami inundation forecast system for tsunami disaster prevention and mitigation
- Akihiro Musa,
- Osamu Watanabe,
- Hiroshi Matsuoka,
- Hiroaki Hokari,
- Takuya Inoue,
- Yoichi Murashima,
- Yusaku Ohta,
- Ryota Hino,
- Shunichi Koshimura,
- Hiroaki Kobayashi
The tsunami disasters that occurred in Indonesia, Chile, and Japan have inflicted serious casualties and damaged social infrastructures. Tsunami forecasting systems are thus urgently required worldwide. We have developed a real-time tsunami inundation ...
Priority-based capacity and power allocation in co-located WBANs using Stackelberg and bargaining games
To mitigate the interference in co-located wireless body area networks (WBANs), this paper proposes an inter-WBAN priority-based capacity allocation scheme based on the Nash bargaining game, and an intra-WBAN priority-based power control scheme based on ...
An automatic RFID reader-to-reader delegation protocol for SCM in cloud computing environment
Radio frequency identification (RFID) technology enables unique identification and tracking of the tag attached to an object. Widespread usage of RFID technologies in supply chain management (SCM) has drawn attention for developing security protocols to ...
System-wide trade-off modeling of performance, power, and resilience on petascale systems
While performance remains a major objective in the field of high-performance computing (HPC), future systems will have to deliver desired performance under both reliability and energy constraints. Although a number of resilience methods and power ...
Efficient implementation of space---time adaptive processing for adaptive weights calculation based on floating point FPGAs
Space---time adaptive processing (STAP) has an enormous computational complexity which has confined its practical applications. In this paper, we present an implementation based on field programmable gate array (FPGA) for the most computationally ...
Balancing the learning ability and memory demand of a perceptron-based dynamically trainable neural network
Artificial neural networks (ANNs) have become a popular means of solving complex problems in prediction-based applications such as image and natural language processing. Two challenges prominent in the neural network domain are the practicality of ...
A hybrid GPU cluster and volunteer computing platform for scalable deep learning
Deep learning is a very computing-intensive and time-consuming task. It needs an amount of computing resource much greater than a single machine can afford to train a sophisticated model within a reasonable time. Normally, GPU clusters are required to ...
Simulation of an inelastic dispersive phenomenon: stimulated Brillouin scattering in a single-mode fiber segment through parallelism
Stimulated Brillouin scattering (SBS) is one of the most important nonlinear phenomena because it limits the maximum transmission power in modern optical communication systems. Unfortunately, the simulation of SBS is time-consuming, since it requires ...
A heuristic search algorithm based on subspaces for PageRank computation
We studied a fast algorithm for the large-scale computation of PageRank. PageRank is what the Google search engine uses to simulate the importance of web pages. It is defined by the eigenvector of a particular stochastic matrix related to the graphs of ...
Design and performance evaluation of mixed multicast architecture for internet of things environment
Internet of things (IoT) has become one of the most important fields in computing arena. The environments of IoT require highly efficient, immediate and worldwide communication services. Accordingly, efficient multicast routing architecture is a ...
Learning-based dynamic scalable load-balanced firewall as a service in network function-virtualized cloud computing environments
Network function virtualization (NFV) is a network architecture which tries to provide communication services in clouds through virtualization techniques. Actually, NFV combines server and service and replaces a lot of network devices. NFV deploys ...
Brokerage-based dependability integration in cloud computing services
Cloud computing plays a significant role to store and analyze the exponentially growing volume of generated data. However, cloud computing is inherently a frequent environment for failures and its services undergo long downtime periods. These failures ...
SRAM- and STT-RAM-based hybrid, shared last-level cache for on-chip CPU---GPU heterogeneous architectures
Shared last-level cache (LLC) in on-chip CPU---GPU heterogeneous architectures is critical to the overall system performance, since CPU and GPU applications usually show completely different characteristics on cache accesses. Therefore, when co-running ...
Load balancing in reducers for skewed data in MapReduce systems by using scalable simple random sampling
MapReduce has demonstrated itself to be as a highly efficient programming model for processing massive dataset on the distributed system. One of the most important obstacles hindering the performance of MapReduce is data skewness. The presence of data ...
Physically based visual simulation of the Lattice Boltzmann method on the GPU: a survey
The rapid increase in performance, programmability, and availability of graphics processing units (GPUs) has made them a compelling platform for computationally demanding tasks in a wide variety of application domains. One of these is real-time ...
Fast induced sorting suffixes on a multicore machine
Sorting the suffixes of an input string is a fundamental task in many applications such as data compression, genome alignment, and full-text search. The induced sorting (IS) method has been successfully applied to design a number of state-of-the-art ...
Correction to: Dependability enhancing mechanisms for integrated clinical environments
The Acknowledgements published in the original article contain errors. The correct version is given below.