CLC number:
On-line Access: 2021-12-10
Received: 2021-06-24
Revision Accepted: 2021-11-02
Crosschecked: 0000-00-00
Cited: 0
Clicked: 910
Yuzhao WANG, Junqing YU, Zhibin YU. Resource scheduling techniques in cloud froma view of coordination: a holistic survey[J]. Frontiers of Information Technology & Electronic Engineering, 1998, -1(-1): .
@article{title="Resource scheduling techniques in cloud froma view of coordination: a holistic survey",
author="Yuzhao WANG, Junqing YU, Zhibin YU",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="-1",
number="-1",
pages="",
year="1998",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.2100298"
}
%0 Journal Article
%T Resource scheduling techniques in cloud froma view of coordination: a holistic survey
%A Yuzhao WANG
%A Junqing YU
%A Zhibin YU
%J Journal of Zhejiang University SCIENCE C
%V -1
%N -1
%P
%@ 2095-9184
%D 1998
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.2100298
TY - JOUR
T1 - Resource scheduling techniques in cloud froma view of coordination: a holistic survey
A1 - Yuzhao WANG
A1 - Junqing YU
A1 - Zhibin YU
J0 - Journal of Zhejiang University Science C
VL - -1
IS - -1
SP -
EP -
%@ 2095-9184
Y1 - 1998
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.2100298
Abstract: Nowadays, the management of resource contention in shared cloud remains a pending problem. And the evolution and deployment of new application paradigms (e.g., deep learning training and microservices) and custom hardware (e.g., GPU, TPU) have posed new challenges in resource management system design. Current solutions
tend to trade cluster efficiency for guaranteed application performance, e.g., resource over-allocation, leaving lots of
resources underutilized. To overcome this dilemma is not easy, because different components across the software stack
are involved. Nevertheless, massive efforts have been devoted to seeking effective performance isolation and highly-efficient resource scheduling. The goal of this paper is to systematically cover related aspects to deliver the techniques
from coordination perspective, and identify the corresponding trends they indicate. Briefly, four topics are involved.
Firstly, isolation mechanisms deployed at different levels (micro-architecture, system and virtualization level) are
reviewed, including GPU multitasking methods. Second, resource scheduling techniques within individual machine
and at cluster level are investigated, respectively. Particularly, GPU scheduling for deep learning applications is
described in detail. Third, adaptive resource management including the latest microservice-related researches is
thoroughly explored. Finally, future research directions are discussed in the light of advanced work. Hopefully, this
paper will help researchers establish a global view of the landscape of resource management techniques in shared
cloud, and see technology trends more clearly.
Open peer comments: Debate/Discuss/Question/Opinion
<1>