Articles with "coflow scheduling" as a keyword



Photo by avasol from unsplash

Bottleneck-Aware Non-Clairvoyant Coflow Scheduling With Fai

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Transactions on Cloud Computing"

DOI: 10.1109/tcc.2021.3128360

Abstract: Coflow scheduling is critical to data-parallel applications in data centers. While schemes like Varys can achieve optimal performance, they require a priori information about coflows which is hard to obtain in practice. Existing non-clairvoyant solutions… read more here.

Keywords: coflow scheduling; bottleneck aware; non clairvoyant; bottleneck flows ... See more keywords
Photo by strong18philip from unsplash

Multi-Attributes-Based Coflow Scheduling Without Prior Knowledge

Sign Up to like & get
recommendations!
Published in 2018 at "IEEE/ACM Transactions on Networking"

DOI: 10.1109/tnet.2018.2858801

Abstract: In data centers, the coflow abstraction is proposed to better express the requirements and communication semantics of a group of parallel flows generated by the jobs of cluster computing frameworks. Knowing the coflow-level information, such… read more here.

Keywords: information; based coflow; coflow scheduling; level ... See more keywords
Photo by timothyeberly from unsplash

Beamer: Stage-Aware Coflow Scheduling to Accelerate Hyper-Parameter Tuning in Deep Learning Clusters

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Network and Service Management"

DOI: 10.1109/tnsm.2021.3132361

Abstract: Training a neural network requires retraining the same model many times to search for the configuration of hyper-parameters with the best training result. It is common to launch multiple training jobs and evaluate them in… read more here.

Keywords: coflow scheduling; stage aware; network; stage ... See more keywords
Photo by avasol from unsplash

Distributed Bottleneck-Aware Coflow Scheduling in Data Centers

Sign Up to like & get
recommendations!
Published in 2019 at "IEEE Transactions on Parallel and Distributed Systems"

DOI: 10.1109/tpds.2018.2889685

Abstract: With the booming development of data parallel frameworks, the coflow abstraction has been greatly favored by data center transport designs, for its prominent ability in capturing application-level semantics. To accelerate job completion, coflow completion time… read more here.

Keywords: coflow scheduling; aware coflow; distributed bottleneck; bottleneck aware ... See more keywords