Articles with "novel inference" as a keyword



Photo by andreacaramello from unsplash

NIOT: A Novel Inference Optimization of Transformers on Modern CPUs

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Transactions on Parallel and Distributed Systems"

DOI: 10.1109/tpds.2023.3269530

Abstract: In the machine learning era, model inference efficiency is one of the most important issues for machine learning systems. It is a major challenge to find the optimal configuration in a huge search space as… read more here.

Keywords: novel inference; inference optimization; niot novel; optimization transformers ... See more keywords