Abstract
The edge computing paradigm is featured by the ability to off-load computing tasks from mobile devices to edge clouds and provide high cost-efficient computing resources, storage and network services closer to the edge. A key question for workflow scheduling in the edge computing environment is how to guarantee user-perceived quality of services when the supporting edge services and resources are with unstable, time-variant, and fluctuant performance. In this work, we study the workflow scheduling problem in the multi-user edge computing environment and propose a Deep-Q-Network (DQN) -based multi-workflow scheduling approach which is capable of handling time-varying performance of edge services. To validate our proposed approach, we conduct a simulative case study and compare ours with other existing methods. Results clearly demonstrate that our proposed method beats its peers in terms of convergence speed and workflow completion time.
H. Liu and Y. Ma—Contribute equally to this article and should be considered co-first authors.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
The edge computing paradigm is emerging as a high performance computing environment with a large-scale, heterogeneous collection of autonomous systems and flexible computational architecture [1,2,3,4,5,6]. It provides the tools and technologies to build data or computational intensive parallel applications with much more affordable prices compared to traditional parallel computing techniques. Hence, there has been an increasingly growth in the number of active research work in edge computing such as scheduling, placement, energy management, privacy and policy, security, etc. Workflow scheduling in cloud and edge environment has recently drawn enormous attention thanks to its wide application in both scientific and economic areas. A workflow is usually formulized as a Directed-Acyclic-Graph (DAG) with several n tasks that satisfy the precedent constraints. Scheduling workflows over an edge environment is referred to as matching tasks onto edge services created on edge nodes. For multi-objective scheduling, objectives can sometimes be conflicting. e.g., for execution time minimization, fast services are more preferable than slow ones. However, fast services are usually more expensive and thus execution time minimization may contradict the cost reduction objective. It is widely acknowledged as well that to scheduling multi-task workflow on distributed platforms is an NP-hard problem. It is therefore extremely time-consuming to yield optimal schedules through traversal-based algorithms. Fortunately, heuristic and meta-heuristic algorithms with polynomial complexity are able to produce approximate or near optimal solutions of schedules at the cost of acceptable optimality loss [7,8,9,10,11,12]. Good examples of such algorithms are multi-objective particle swarm optimization (MOPSO) and non-dominated sorting genetic algorithm-II (NSGA-II).
Recently, as novel machine learning algorithms are becoming increasingly versatile and powerful, considerable research efforts are paid to using reinforcement learning (RL) and Q-learning-based algorithms [13,14,15] in finding near-optimal workflow scheduling solutions. Nevertheless, most existing contributions in this direction focused on scheduling workflows over centralized clouds. How to apply Q-learning-based algorithms and models to the problem of scheduling workflows upon distributed edge computing platforms is still to be clearly addressed. In this work, we propose a DQN-based multi-workflow scheduling method. The proposed model takes the probability mass functions (PMF) of historical performance data of edge services as the inputs and is capable of improving the scheduling plans via optimizing the probability of a workflow satisfying the completion-time constraint. We conduct a simulated experiment and compare our method with other baseline algorithms. The results show that our method outperforms baseline algorithms in terms of workflow completion time.
2 Related Work
Scheduling multi-workflows upon distributed infrastructures, e.g., grids, clouds and edge, is usually known to be NP-hard and thus traversal-based algorithms can be ineffective in terms of computational complexity. Instead, heuristic and meta-heuristic procedures with polynomial complexity can yield high-quality and sub-optimal solutions at the cost of a certain level of optimality degradation. For example, [16] leveraged a multi-objective bio-inspired procedure (MOBFOA) by augmenting the tradtional BFOA with Pareto-optimal fronts. Their method deals with the reduction of flow-time, completion duration, and operational cost. [17] considered a multi-objective genetic optimization (BOGA) and optimizd both electricity consumption and DAG reliability. [18] considered an augmented GA with the Efficient Tune-In (GA-ETI) mechanism for the optimization of turnaround time. [19] employed a non-dominated-sorting-based Hybrid PSO approach and aimed at minimizing both turnaround time and cost. [20] introduced a fuzzy dominance sort based heterogeneous finishing time minimization approach for the optimization of both cost and turnaround time of DAG executed on IaaS clouds.
Recently, deep reinforcement learning (DRL) methods shed new light on the problem we are interested in [21,22,23,24,25,26,27]. It was shown that the multi-agent training methods can be effective in dealing with multi-constraint and multi-objective optimization problems. For example, [28] employed a sequential cooperative game approach for heterogeneous workflow scheduling. [29] developed a reinforcement-learning-based method for multi-DAG execution with user-defined priorities specified at different times. [30] proposed a distributed load management and access control approach for the SaaS environment by using a fuzzy game-theoretic model. [31] proposed modified Q-learning method for turn-around time reduction and load balancing by considering weighted fitness value function. However, Q-learning-based algorithms and models intended for edge-infrastructure-based workflow scheduling is very rare. A highly efficient reinforcement-learning-based approach for scheduling and managing multi-workflows upon distributed, mobile, and resource-constrained edge services is in high need.
3 Model and System
3.1 System Architecture
As shown in Fig. 1, an edge computing environment can be seen as a collection of multiple edge servers usually deployed near base stations. By this way, users are allowed to off-load compute-intensive and latency-sensitive applications, e.g., Augmented Reality (AR), Virtual Reality (VR), Artificial Intelligence (AI), to edge servers. Within an edge computing environment, there exist n users in an edge computing environment, denote by \(U=\{u_{1},u_{2},...,u_{n}\}\), and m base stations, denote by \(B=\{b_{1},b_{2},...,b_{m}\}\). Each user has an application to be executed, and users mobile device is allowed to offload tasks on edge servers near the base station by wireless access point. For generality, we regard mobile applications as workflows, denote by a directed acyclic graph(DAG) \(W=(T,D)\), where \(T=\{t_{1},t_{2},...,t_{n}\}\) represents a set of tasks. Tasks have multiple types which have different size of input data. \(D=\{d_{i,j} | i,j\in [1,n]\}\) represents a set of precedence dependencies, where \(d_{i,j}=1\) means \(t_j\) can be executed only when \(t_i\) is completed, otherwise \(d_{i,j}=0\). \(S_{i}=\{s_{1},s_{2},...,s_{n}\}\) represents the list of servers which signal coverage covers user i, thus user i can offload tasks on these servers.
Users are allowed to offload tasks to the edge via wireless access points. The action profile of users can be expressed as \(a_{i}=\{s_{1},s_{2},...,s_{m}\}\), where \(s_{j}\) indicates server \(s_{j}\). For a server \(s_{j}\), a list of users who offload tasks to it can be represented as \(UL_{j}=\{i|s_{j}\in a_{i}\}\). For an action profile \(A=\{a_{1},a_{2},...,a_{n}\}\) of all users, the uplink data rate of wireless channel of user \(u_{i}\) to server \(s_{j}\) can be estimated by
where B is the channel bandwidth, \(p_{i}\) the transmit power of user \(u_{i}\), \(g_{i,j}\) the channel gain from user \(u_{i}\) to server \(s_{j}\), and \(\sigma \) the backgroud noise power. It can thus be seen from this equation, if too many users choose to offload its tasks to the same server, the uplink data rate decreases and further causes low offloading efficiency.
Assume user \(u_{i}\) chooses to offload its task \(t_{j}\) to server \(s_{k}\), according to the Eq. 1, the transmission time for offloading the input data of size \(C_{i,j,k}\) can be estimated by
We assume that all wireless channels obey the quasi-static block fading rule [32]. This rule means that the state of the channel remains unchanged during transmission. Thus, the probability distribution of the completion time of the task is
where TE is historical execution time, PMF(t) indicates the probability mass function of the historical execution time.
3.2 Promblem Formulation
Based on the above system model, we are interested in knowing the highest probability of meeting the completion-time constraints. The resulting scheduling problem can be described as follows:
subject to,
where \(C_i^g\) is a completion-time threshold for user \(u_{i}\) and \(CT_{i}\) the actual completion time of a user’s workflow.
4 Our Approach
4.1 Decomposition of the Global Constraint
For the evaluation of effectiveness of the actions by agents during the training process, we first have to decompose the global constraint to local ones. Given a workflow with n tasks, denoted by \(W=\{t_{1},t_{2},...,t_{n}\}\) and \(C^g\) as the global completion-time constraint, the local constraint of subtask can be represented by \(C^l=\{C_{1}^l,C_{2}^l,...,C_{n}^l\}\). We consider dividing the global constraint in proportion to the expected completion time of each part, where specific steps are as follows:
-
1.
Obtain the server list whose coverage reach user k, denoted by \(S_{k}=\{s_{1},s_{2},...,s_{n}\}\).
-
2.
For task \(t_{i}\), its completion time on server \(s_{j}\) is represented by a PMF that we mentioned above. The expected completion time \(e_{i,j}\) can be estimated by \(\{e_{i,j}|\int _0^{e_{i,j}}PMF(X)=0.5\}\).
-
3.
Task \(t_{i}\) has multiple candidate servers \(S_k\) to be scheduled into, the expected completion time of task \(t_{i}\) is \(E_i^t=avg(e_i)\), where \(e_i=\{e_{i,1},e_{i,2},...,e_{i,n}\}\).
-
4.
For any part \(p_{g}\), it consists of tasks \(T_g=\{t_1,t_2,...,t_n\}\). The expected completion time of this part is thus \(E_g^p=\max (E^t_{t_{1}},E^t_{t_{2}},...,E^t_{t_{n}})\)
-
5.
Eventually, we can divide the global constraint into smaller ones as follows:
$$\begin{aligned} C_{i}^l=C^g\cdot \frac{E_{i}^p}{\sum _{j=1}^{n}E_{j}^p} \end{aligned}$$(8)
4.2 Deep-Q-Network-based Solution to the Workflow Scheduling Problem
As mentioned earlier, we employ DQN for solving the optimization formulations given above. According to DQN, the value function updated by time difference can be expressed as:
where Q(s, a) is the state-action value function at current state, \(Q(s',a')\) is the state-action value function at the next state, \(\alpha \) is the update step size, R(a) is the reward derived based on the PMF of the workflow completion time according to (12) and \(\gamma \) is the reward decay factor. The loss function of deep q network can be computed by
where y presents the target Q network whose parameters are periodically replaced by evaluate Q network \(Q^*\). The DQN procedure is shown in Algorithm 1.
The DQN environment includes components of environment observation, action space, policy setting, and reward design [33]. Note that the former 3 components can be implemented by using the standard DQN setting, while the reward design one should be developed based on the optimization formulation and the constraint decomposition configuration given in the previous sections. The reward function is designed as:
where \(C_i^l\) is based on the decomposition of the global constraint given in (8).
5 Case Study
In this section, we conduct simulative case studies to prove the effectiveness, in terms of workflow completion time, network loss value, and convergence speed of the algorithm of our method. The types of server, workflow and task are randomly generated. We assume as well that edge servers and users are located according to the position dataset of [34] as illustrated in Fig. 2. Edge servers have 3 different types, i.e., type1, type2, and type3, in terms of their resource configuration and performance. User applications are expressed in the form of multiple workflows as given in Fig. 3, where each workflow task is responsible for executing a GaussCLegendre calculation with 8, 16, or 32 million decimal digits. The historical execution time for GaussCLegendre calculations over different types of edge servers are based on data from [35] shown in Fig. 4. For the comparison purpose, we compare our proposed method with other existing methods, i.e., NSPSO [36] and NSGA [37] as well.
5.1 Experiment Configuration
We test our methods and its peers by using a workstation with the Intel Core i7 CPU @ 2.80 GHz, NVIDIA GeForce GTX 1050 Ti, and 8 GB RAM configuration. Table 1 shows basic parameters used in the experiments.
5.2 Performance Evaluation
Based on the above configurations and datasets, we repeated invoking our proposed method to schedule workflows based on performance data of edge servers measured at 3 different time periods given in Fig. 4. It can be seen from Figs. 5 and 6 that the network loss decreases rapidly with time and the probability of satisfying global constraint increases with iterations.
As can be seen from Fig. 7, our method clearly outperforms baseline algorithms at all 3 time periods in terms of workflow completion time.
6 Conclusion
In this work, a novel probability-mass function and DQN-based approach to scheduling multi-workflows upon a distributed edge-computing environment is proposed. The proposed method is capable of handling time-varying performance of edge services through probability-mass functions of historical performance data and leveraging a Deep-Q-network framework for yielding high-quality workflow scheduling plans. To validate our proposed approach, we conduct a simulative case study based on a well-known edge-service-position dataset and demonstrate that our proposed method beats its peers in terms of the scheduling performance.
References
Chen, X., Liu, Z., Chen, Y., Li, Z.: Mobile edge computing based task offloading and resource allocation in 5g ultra-dense networks. IEEE Access 7, 184172–184182 (2019)
Ciobanu, R., Dobre, C., Balanescu, M., Suciu, G.: Data and task offloading in collaborative mobile fog-based networks. IEEE Access 7, 104405–104422 (2019)
Li, G., Lin, Q., Wu, J., Zhang, Y., Yan, J.: Dynamic computation offloading based on graph partitioning in mobile edge computing. IEEE Access 7, 185131–185139 (2019)
Luo, S., Wen, Y., Xu, W., Puthal, D.: Adaptive task offloading auction for industrial CPS in mobile edge computing. IEEE Access 7, 169055–169065 (2019)
Zhou, J., Fan, J., Wang, J., Zhu, J.: Task offloading for social sensing applications in mobile edge computing. In: Seventh International Conference on Advanced Cloud and Big Data, CBD 2019, Suzhou, China, 21–22 September 2019, pp. 333–338. IEEE (2019)
Chen, H., Zhu, X., Liu, G., Pedrycz, W.: Uncertainty-aware online scheduling for real-time workflows in cloud service environment. IEEE Trans. Serv. Comput. 1, 1 (2018)
Zhang, Y., Du, P.: Delay-driven computation task scheduling in multi-cell cellular edge computing systems. IEEE Access 7, 149156–149167 (2019)
Cao, H., Xu, X., Liu, Q., Xue, Y., Qi, L.: Uncertainty-aware resource provisioning for workflow scheduling in edge computing environment. In: 18th IEEE International Conference On Trust, Security and Privacy in Computing and Communications / 13th IEEE International Conference On Big Data Science And Engineering, TrustCom/BigDataSE 2019, Rotorua, New Zealand, 5–8 August 2019, pp. 734–739. IEEE (2019)
Deng, Y., Chen, Z., Yao, X., Hassan, S., Wu, J.: Task scheduling for smart city applications based on multi-server mobile edge computing. IEEE Access 7, 14410–14421 (2019)
Jian, C., Chen, J., Ping, J., Zhang, M.: An improved chaotic bat swarm scheduling learning model on edge computing. IEEE Access 7, 58602–58610 (2019)
Ma, Y., et al.: A novel approach to cost-efficient scheduling of multi-workflows in the edge computing environment with the proximity constraint. In: Wen, S., Zomaya, A.Y., Yang, L.T. (eds.) Algorithms and Architectures for Parallel Processing - 19th International Conference, ICA3PP 2019, Melbourne, VIC, Australia, 9–11 December 2019, Proceedings, Part I. Volume 11944 of Lecture Notes in Computer Science, pp. 655–668. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-38991-8_43
Peng, Q., Jiang, H., Chen, M., Liang, J., Xia, Y.: Reliability-aware and deadline-constrained workflow scheduling in mobile edge computing. In: 16th IEEE International Conference on Networking, Sensing and Control, ICNSC 2019, Banff, AB, Canada, May 9–11, 2019, pp. 236–241. IEEE (2019)
Bernal, J., et al.: Deep convolutional neural networks for brain image analysis on magnetic resonance imaging: a review. Artif. Intell. Med. 95, 64–81 (2019)
Bouwmans, T., Javed, S., Sultana, M., Jung, S.K.: Deep neural network concepts for background subtraction: a systematic review and comparative evaluation. Neural Netw. 117, 8–66 (2019)
Grekousis, G.: Artificial neural networks and deep learning in urban geography: a systematic review and meta-analysis. Comput. Environ. Urban Syst. 74, 244–256 (2019)
Kaur, M., Kadam, S.: A novel multi-objective bacteria foraging optimization algorithm (MOBFOA) for multi-objective scheduling. Appl. Soft Comput. J. 66, 183–195 (2018)
Zhang, L., Li, K., Li, C., Li, K.: Bi-objective workflow scheduling of the energy consumption and reliability in heterogeneous computing systems. Inf. Sci. 379, 241–256 (2017)
Casas, I., Taheri, J., Ranjan, R., Wang, L., Zomaya, A.Y.: GA-ETI: an enhanced genetic algorithm for the scheduling of scientific workflows in cloud environments. J. Comput. Sci. 26, 318–331 (2018)
Verma, A., Kaushal, S.: A hybrid multi-objective Particle Swarm Optimization for scientific workflow scheduling. Parallel Comput. 62, 1–19 (2017)
Zhou, X., Zhang, G., Sun, J., Zhou, J., Wei, T., Hu, S.: Minimizing cost and makespan for workflow scheduling in cloud using fuzzy dominance sort based HEFT. Fut. Generation Comput. Syst. 93, 278–289 (2019)
Bertsekas, D.P.: Feature-based aggregation and deep reinforcement learning: a survey and some new implementations. In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–31 (2018)
Mao, H., Alizadeh, M., Menache, I., Kandula, S.: Resource management with deep reinforcement learning. In: Proceedings of the 15th ACM Workshop on Hot Topics in Networks - HotNets 2016, pp. 50–56 (2016)
Xue, L., Sun, C., Wunsch, D., Zhou, Y., Yu, F.: An adaptive strategy via reinforcement learning for the prisoner’s dilemma game. IEEE/CAA J. Automatica Sinica 5(1), 301–310 (2018)
Zhan, Y., Ammar, H.B., Taylor, M.E.: Theoretically-grounded policy advice from multiple teachers in reinforcement learning settings with applications to negative transfer. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. IJCAI’16, AAAI Press, pp. 2315–2321 (2016)
Wang, H., Huang, T., Liao, X., Abu-Rub, H., Chen, G.: Reinforcement learning for constrained energy trading games with incomplete information. IEEE Trans. Cybern. 47(10), 3404–3416 (2017)
Zheng, L., Yang, J., Cai, H., Zhang, W., Wang, J., Yu, Y.: MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence, pp. 1–2 (2017)
Lowe, R., Wu, Y., Tamar, A., Harb, J., Pieter Abbeel, O., Mordatch, I.: Multi-agent actor-critic for mixed cooperative-competitive environments. In: Guyon, I., et al. (eds.) Advances in Neural Information Processing Systems 30. Curran Associates, Inc. pp. 6379–6390 (2017)
Duan, R., Prodan, R., Li, X.: Multi-objective game theoretic scheduling of bag-of-tasks workflows on hybrid clouds. IEEE Trans. Cloud Comput. 2(1), 29–42 (2014)
Cui, D., Ke, W., Peng, Z., Zuo, J.: Multiple DAGs Workflow Scheduling Algorithm Based on Reinforcement Learning in Cloud Computing, pp. 305–311. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-0356-1_31
Iranpour, E., Sharifian, S.: A distributed load balancing and admission control algorithm based on Fuzzy type-2 and Game theory for large-scale SaaS cloud architectures. Future Generation Comput. Syst. 86 81–98 (2018)
Jiahao, W., Zhiping, P., Delong, C., Qirui, L., Jieguang, H.: A Multi-object Optimization Cloud Workflow Scheduling Algorithm Based on Reinforcement Learning, pp. 550–559. Springer, Cham (aug (2018). https://doi.org/10.1007/978-3-319-95933-7_64
Guo, S., Liu, J., Yang, Y., Xiao, B., Li, Z.: Energy-efficient dynamic computation offloading and cooperative task scheduling in mobile cloud computing. IEEE Trans. Mob. Comput. 18(2), 319–333 (2019)
Mnih, V., et al.: Playing atari with deep reinforcement. Learning 2055, 1–9 (2013)
Lai, P., et al.: Optimal edge user allocation in edge computing with variable sized vector bin packing. CoRR abs/1904.05553 (2019)
Li, W., Xia, Y., Zhou, M., Sun, X., Zhu, Q.: Fluctuation-aware and predictive workflow scheduling in cost-effective infrastructure-as-a-service clouds. IEEE Access 6, 61488–61502 (2018)
Beegom, A.S.A., Rajasree, M.S.: Non-dominated sorting based PSO algorithm for workflow task scheduling in cloud computing systems. J. Intell. Fuzzy Syst. 37(5), 6801–6813 (2019)
Mollajafari, M., Shahhoseini, H.S.: Cost-optimized ga-based heuristic for scheduling time-constrained workflow applications in infrastructure clouds using an innovative feasibility-assured decoding mechanism. J. Inf. Sci. Eng. 32(6), 1541–1560 (2016)
Acknowledgement
This work is supported in part by Science and Technology Program of Sichuan Province under Grant 2020 JDRC0067.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, H. et al. (2020). Scheduling Multi-workflows over Edge Computing Resources with Time-Varying Performance, A Novel Probability-Mass Function and DQN-Based Approach. In: Ku, WS., Kanemasa, Y., Serhani, M.A., Zhang, LJ. (eds) Web Services – ICWS 2020. ICWS 2020. Lecture Notes in Computer Science(), vol 12406. Springer, Cham. https://doi.org/10.1007/978-3-030-59618-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-59618-7_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59617-0
Online ISBN: 978-3-030-59618-7
eBook Packages: Computer ScienceComputer Science (R0)