Articles with "batch aware" as a keyword



Photo from wikipedia

BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Parallel and Distributed Systems"

DOI: 10.1109/tpds.2022.3199806

Abstract: Modern intelligent devices usually execute multiple neural networks to improve service quality. However, system performance degrades significantly when the working set exceeds the physical memory capability, a phenomenon called memory oversubscription. To support the execution… read more here.

Keywords: aware resource; neural networks; batch aware; multiple neural ... See more keywords