Computer Science > Performance
[Submitted on 1 Oct 2017 (this version), latest version 16 Sep 2018 (v3)]
Title:Asymptotic response time analysis for multi-task parallel jobs
View PDFAbstract:The response time of jobs with multiple parallel tasks is a critical performance metric in many systems, including MapReduce systems, coded data storage systems, etc. However, tight analytical characterizations of the response time of such jobs are largely unknown except for highly degenerate cases. The difficulty is rooted in the fact that a job with multiple tasks is considered complete only when all of its tasks complete processing; i.e., the job response time is the maximum of the response times of its tasks, which is hard to analyze since these task response times are generally not independent.
In this paper, we approach this problem by studying when the response times of a job's tasks are close to being independent. We consider a limited fork-join model with $n$ servers, where each job consists of $k^{(n)}\le n$ tasks. Upon arrival, each job chooses $k^{(n)}$ distinct servers uniformly at random and sends one task to each server. We assume Poisson job arrivals and generally distributed task service times. We establish that under the condition $k^{(n)} = o(n^{1/4})$, the steady state response times at any $k^{(n)}$ servers are asymptotically independent, as $n$ grows large. This result greatly generalizes the asymptotic-independence type of results in the literature where asymptotic independence is shown only for a fixed constant number of queues. We then further show that the job response time converges to the maximum of independent task response times, in a proper sense. This gives the first asymptotically tight analytical characterization of the response time of a multi-task parallel job. To complement the asymptotic independence result, we also show that when $k^{(n)}=\Theta(n)$, any number of multiple queues are not asymptotically independent. Analysis for the regime of $k^{(n)}$ between $o(n^{1/4})$ and $\Theta(n)$ remains open.
Submission history
From: Weina Wang [view email][v1] Sun, 1 Oct 2017 05:19:19 UTC (421 KB)
[v2] Fri, 27 Oct 2017 18:16:21 UTC (421 KB)
[v3] Sun, 16 Sep 2018 01:50:55 UTC (422 KB)
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.