Sign Up to like & get
recommendations!
2
Published in 2023 at "IEEE Transactions on Parallel and Distributed Systems"
DOI: 10.1109/tpds.2023.3269530
Abstract: In the machine learning era, model inference efficiency is one of the most important issues for machine learning systems. It is a major challenge to find the optimal configuration in a huge search space as…
read more here.
Keywords:
novel inference;
inference optimization;
niot novel;
optimization transformers ... See more keywords